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Promising the Future: Virginity Pledges and 
First Intercourse’ 


Peter S. Bearman and Hannah Briickner 
Columbia University 


Since 1993, in response to a movement sponsored by the Southern 
Baptist Church, over 2.5 million adolescents have taken public “vir- 
ginity” pledges, in which they promise to abstain from sex until 
marriage. This paper explores the effect of those pledges on the 
transition to first intercourse. Adolescents who pledge are much less 
likely to have intercourse than adolescents who do not pledge. The 
delay effect is substantial. On the other hand, the pledge does not 
work for adolescents at all ages. Second, pledging delays intercourse 
only in contexts where there are some, but not too many, pledgers. 
The pledge works because it is embedded in an identity movement. 
Consequently, the pledge identity is meaningful only in contexts 
where it is at least partially nonnormative. Consequences of pledging 
are explored for those who break their promise. Promise breakers 
are less likely than others to use contraception at first intercourse. 


Gonna give you all my love, boy/My fear is fading fast/Been 
saving it all for you/’Cause only love can last. 
(Madonna, “Like a Virgin”) 


INTRODUCTION 


Since 1993, in response to an organized social movement sponsored by 
the Southern Baptist Church, well over 2.5 million adolescents have taken 


* Data for this article are drawn from the National Longitudinal Study of Adolescent 
Health (Add Health), a program project designed by J Richard Udry and Peter Bear- 
man and funded by a grant from the National Institute of Child Health and Human 
Development to the Carolina Population Center, University of North Carolina at 
Chapel Hill (HD31921), with cooperative funding participation by the National Cancer 
Institute; the National Institute of Alcohol Abuse and Alcoholism, the National In- 
stitute on Deafness and other Communication Disorders, the National Institute on 
Drug Abuse, the National Institute of General Medical Sciences, the National Institute 
of Mental Health; the Office of AIDS Research, National Institutes of Health (NIH), 
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public “virginity” pledges, in which they promise to abstain from sex until 
marriage. By the standard set by most social movements, the pledge 
movement has been a resounding success. Heralded in the popular press 
as a movement of, for, and by adolescents, the pledge movement has 
expanded to include hundreds of church, school, and college chapters. It 
is associated with the development of new products and cultural symbols 
(most notably in music) and is home to numerous interlocked Internet 
pages where adolescents can pledge online, purchase pledge merchandise, 
and interact with other pledgers in pledge-dedicated chat rooms. Pledgers 
can even go to summer camps where only other pledgers can be found. 
The movement has been successful in organizing mass rallies in which 
speakers extol the benefits of abstinence to stadiums full of eager ado- 
lescents. Its growth rate has been phenomenal, and with it, the movement 
has spawned a whole new subculture in which it is “cool”to say no to sex, 
and where, according to one account, “virginity is hot” (Young and Mod- 
ern, May 1998, in an issue featuring the 100 secrets of Leonardo DiCaprio). 

The pledge movement is really a movement organized by adults for 
adolescents and so, not surprisingly, while pledging is for adolescents, the 
movement is also attractive to many adults.’ Movement rhetoric stresses 
orientations that are loosely labeled as “family values.” Supporters of the 
pledge like the idea that adolescents can take individual responsibility for 
their sexual life simply by promising to say no thank you to sex—at least 
until marriage. Others, on the other hand, are not as sympathetic to the 
pledge movement. On one hand, pledge critics simply do not believe that 
something as simple as a pledge can work. In part, they feel uncomfortable 
endorsing an element of a program (in this case, the pledge) that is based 
on assumptions that are antithetical to a liberal vision of the world, for 
example, the assumption that sex should only occur in the context of 
marriage. Critics do not necessarily oppose the idea of “saving” sex until 
marriage, but they think that it is an unrealistic foundation for social 


the Office of the Director, NIH; the National Center for Health Statistics, Office of 
Minority Health, Centers for Disease Control and Prevention, Health and Human 
Services (HHS), Office of the Assistant Secretary for Planning and Evaluation, HHS; 
and the National Science Foundation. Additional funding for the analyses reported in 
this article was provided by the Office of Population Affairs. The authors thank Chris 
Bachrach, Guang Guo, John Hutchins, Joe Rodgers, J. Richard Udry, and Harrison 
White for helpful comments. Address all correspondence to Peter Bearman, Institute 
for Social and Economic Research and Policy, 814 IAB, Columbia University, New 
York, New York 10027 E-mail: psb17@columbia.edu 


1 The pledge movement is loosely organized. More than 80 autonomous organizations 
sponsor public pledges, support chapter formation, and participate in organizing rallies 
of varied scale Movement organizers claim disinterest ın the number of adolescents 
who have taken a pledge They keep no formal records, and they have few formal 
mechanisms in place to coordinate the efforts of participating organizations. The pledge 
movement is one of the activist arms of the more general abstinence-only movement 
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policy. In part, critics find themselves uncomfortable with the pledge 
because many pledge supporters think that sex education sends the wrong 
message to adolescents.*And critics do not like the fact that the pledge 
movement rests on, and reinforces, the imagery of romantic love,‘ not 
because critics of the pledge are cynics, but because pledge rhetoric seems 
to be associated with traditional gender roles in adolescence as well as 
young adulthood.“ 

Whether pledge supporters or critics, most adults are concerned about 
adolescent sex, here defined as intercourse.’ Critics are more concerned 
about the observable consequences of sex: heightened risk for teenage 
pregnancy, abortion, and sexually transmitted diseases (STDs) and in- 
creased likelihood of participation in other risk activities, such as drinking, 
smoking, and substance abuse (Resnick et al. 1997; Tubman, Windle, and 
Windle 19964, 1996b; AGI 1994; Martin 1996; Thompson 1995). In this 
view there is nothing wrong with sex per se.’ The problem with sex is 
that it has often has negative consequences, especially for girls, who typ- 
ically feel worse after sex than before sex (Joyner and Udry 1998), and 
who face the risk of unwanted pregnancy. Pledge supporters do not ar- 
ticulate their opposition to sex in terms of the consequences of sex, because 
this would appear to legitimize “good” sex. Instead, they stress moral 
systems that justify saying no thank you to sex in and of itself. 

To anticipate the main empirical findings reported in this article, there 
might be something for everyone. Pledge supporters will find comfort in 
the fact that the pledge has a substantial effect on the timing of first 
intercourse. Adolescents who pledge, controlling for all of the usual char- 


* On this issue, feelings sometimes run strong. On October 26, 1998, after hearing an 
earlier version of this article read at a Planned Parenthood of NYC workshop, the 
president and CEO of SIECUS (Sexuality Information and Education Council of the 
United States) led the assembled crowd of 300 or so participants in a rousing chant: 
“Abstinence programs do not work.” 

* An irony is that, within the adolescent world, it is the rhetoric of romantic love that 
provides boys with the key cultural mechanism by which they “work out a yes” from 
girls (Kaminer 1996, Thompson 1995; Martin 1996). 

* Critics are also concerned that pledging sets standards that are unrealistic for most 
adolescents. This may lead them to marry too young, inducing as a consequence 
marriages with higher chances of divorce. In 1988, the average time between sexual 
maturity and marriage was 11.8 years for young women and 12.5 years for young men 
(AGI 1994), which many believe is a long time to wait for sex. 

* Of course “sex,” also involves more than intercourse. Most teenagers, and some adults, 
think otherwise for the most part, though, and in this article we focus only on 
intercourse. 

* Brooks-Gunn and Paikoff (1993), e g., define “sexual well-being” not in terms of sexual 
behavior per se but with respect to the consequences of such behavior for adolescents. 
In such a view, the task is to help adolescents avoid negative consequences of such 
behavior. 
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acteristics of adolescents and their social contexts that are associated with 
the transition to sex, are much less likely than adolescents who do not 
pledge, to have intercourse. The delay effect is substantial and robust. 
Pledging delays intercourse for a long time. In this sense, the pledge 
works.’ Critics of the pledge should find solace from two important ca- 
veats. First, the pledge effect is strongly conditioned by age. Pledging does 
not work for adolescents at all ages. Second, pledging delays intercourse 
only in contexts where there are some, but not too many, pledgers. Too 
few, and too many, pledgers in the adolescent world can negate the pledge 
effect. The pledge effect is largely contextual.’ 

We reach these conclusions with data from the National Longitudinal 
Study of Adolescent Health (hereafter, Add Health), which was collected 
at the height of the pledge movement, the period between 1994 and 1996 
(Bearman, Jones, and Udry 1997). Add Health provides detailed infor- 
mation on the social contexts in which adolescents are embedded. The 
clustered design of Add Health allows us to investigate the effect of pledg- 
ing, in the same interacting communities, on pledgers and nonpledgers 
alike. The longitudinal design of Add Health allows us to order cause 
and effect without ambiguity. Because our analyses operate on complex 
data structures at multiple levels of observation, showing the findings we 
identify involves a number of steps. These steps are outlined below. 

We first briefly review the literature on sexual debut. Because the pledge 
is embedded in an identity movement, we consider the specific aspects of 
such movements that provide the foundation for both their success and 
failure. We then describe the Add Health study and the design and meth- 
ods of our analysis. We then describe the transition to first sexual inter- 
course and look at differences between pledgers and others.'” Because we 
restrict our analyses of pledging to adolescents for whom we can un- 


* Later, we show that there are no obvious psychological downsides to pledging and 
then having intercourse anyway. Pledgers who break their promise (hereafter, promise 
breakers) and end up having intercourse feel no worse about themselves than other 
adolescents who have intercourse. But, promise breakers are less likely than others to 
use contraception at first intercourse. This means that promise breakers are at greater 
risk to the negative consequences of sex. That pledgers who have sex are likely to be 
contraceptively unprepared is to be expected, for it is hard to imagine how one could 
both pledge to be a virgin until marriage and carry a condom while unmarried. 

” The pledge does not operate by changing individuals in obvious and easy-to-measure 
ways that are known to be correlates of adolescent sexual behavior. Just thinking about 
girls, those who are smart, unattractive, on sports teams, doing well in school, close 
to their parents, and have high self-esteem are less likely than others to have sex. 
Adolescents do not get smarter once they have pledged, they do not become less 
attractive, they do not gain or lose self-esteem, they do not suddenly join sports teams, 
do better in school, have better relations with their parents, or feel more attached to 
their schools. But they do delay intercourse. 

12 App. B reports a full multivariate model of the determinants of pledging. 


862 


Virginity Pledges 


ambiguously order intercourse and pledging, our sample is selective. We 
discuss the impact of sample selectivity for our results in appendix A. 
The impact is negligible for nonblack adolescents. 

We then turn to multivariate event history models of the transition to 
first intercourse. We focus on identifying main effects and observe the 
strong impact of the pledge on the transition to first intercourse, controlling 
for all of the relevant factors associated with pledging. As the transition 
dynamics are different for adolescent females than for males, we next test 
a wide range of gender interactions with a model focusing on individual- 
level and partnership characteristics. Because pledge rhetoric invokes the 
idiom of romantic love, we focus on the interaction between pledging, 
emotional commitment within relationships, and the transition to inter- 
course. The results show significant race and gender differences. Specif- 
ically, whereas emotional commitment within relationships is positively 
associated with the transition to sex within that relationship for most 
adolescents, the opposite is true for black males. Black males are not 
likely to have intercourse in relationships characterized by emotional 
closeness. They are likely to have intercourse in relationships that invoke 
little emotional commitment. 

Next, we explore context effects on the transition to first intercourse, 
specifically, the proportion of pledgers in a respondents’ school for both 
pledgers and others. Here we identify the core contextual effect described 
above. We also show how pledging is driven by the proportion of pledgers 
in school. 

We then relax one important assumption underlying the event history 
models previously discussed. Specifically, we explore possible age de- 
pendencies of the estimated parameter effects. Here, we ask whether the 
presence and magnitude of factors affecting the transition to first inter- 
course depends on the age in which their influence is exerted. We show 
that most factors are critically sensitive to age. The pledge effect also 
depends on an adolescent’s age. 

After the detailed analysis of the effects of pledging on sexual initiation, 
we look specifically at health outcomes and the health behavior of pledgers 
and others who have had intercourse. It is easy to say that the pledge is 
not likely to work in the long run because eventually most everyone will 
transition to intercourse before marriage. But this is misguided, for even 
if pledging works only to delay intercourse, and even if pledgers eventually 
have sex before they marry, pledging may be a way to bridge some of 
the difficult years in early adolescence without (unwanted) sexual activ- 
ity." Still, critics have argued that promise breakers will feel guilty after 


11 About one-quarter of all adolescents report that they have felt pressure from peers 
to have sex (AGI 1994). Pledging may provide a useful tool for fending off such peer 
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sex and thus lose self-esteem. Consequently, we first ask whether pledgers 
are worse off in terms of the consequences of sexual debut for psycho- 
logical well-being than nonpledgers. We show that promise breakers are 
not worse off with respect to self-esteem. 

Finally, using a sample of adolescents who have had first intercourse 
within our observation window only, we model contraceptive behavior 
at first intercourse and examine whether pledgers are less contraceptively 
prepared than others. We show that there are differences between promise 
breakers and others in their contraceptive behavior. Pledgers are less likely 
to use contraceptives. We conclude by focusing on implications of identity 
movements for social policy. 


SOCIAL INFLUENCES ON ADOLESCENT SEXUAL DEBUT 


Our empirical work builds from a large literature on the determinants of 
first intercourse. The key insight that we exploit is the recognition that 
adolescents are embedded in multiple social contexts—families, schools, 
peer groups, and romantic relationships—and that these social contexts, 
in interaction with individual characteristics, are important determinants 
of adolescent behavior (Udry and Bearman 1998; Perry, Kelder, and 
Komro 1993; Brooks-Gunn and Paikoff 1993). Our goal in this short 
section is to identify the salient influences on sexual debut discussed in 
previous work in order to provide the foundation for a baseline model 
from which we can assess the pledge effect. 

Many adolescent “problem behaviors,” like sexual activity, drinking, 
smoking, are behaviors that are socially sanctioned, acceptable, and en- 
joyable for many adults (Udry and Bearman 1998). While not socially 
sanctioned, it is reasonable to think that they are also enjoyable for ad- 
olescents. Consequently, one idea is that in the absence of social control, 
all adolescents would engage in nonnormative (enjoyable) behavior.” It 
follows that one determinant of sexual behavior is the extent to which 
adolescents are subject to, and influenced by, social controls that operate 
to constrain opportunity to engage in nonsanctioned behavior, or heighten 
the costs of such behavior. Conversely, social influence (especially the 
influence of partners, for sexual behavior) may operate to enhance the 
perceived benefits of nonnormative behavior. Many of the social context 
variables that we use in our baseline model derive from the social control 


pressure For an analysis of peer effects on sexual debut and pregnancy risk among 
girls, see Bearman and Brückner (1999). 

1 See Udry (1988) for a summary and critique of social control theory; Hofferth (1987) 
for a summary of research with respect to sexual initiation, and Brooks-Gunn and 
Paikoff (1993) for a summary with respect to the sexual well-being of adolescents. 
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perspective. The central idea is to capture through measurement the social 
influences on adolescents that shape the costs, benefits, and opportunities 
for sexual initiation. 

Table 1 identifies the main social (and developmental) influences on 
sexual debut. Within each domain, core variables identified in prior lit- 
erature and their hypothesized relationship to sexual debut that we use 
in our analyses are listed. For each effect we note if it is hypothesized to 
be protective (negative) or risk enhancing (positive). Many of these factors 
are likely to influence sexual behavior differently across the adolescent 
life course. There are also strong indications in the literature that the 
transition dynamic for girls is different than that for boys. Consequently, 
gender interactions across many of the variables capturing social influence 
are expected. Potential gender interactions are listed in table 1. Compar- 
atively little work has been done that focuses on age dependencies with 
respect to adolescent transition to first sex, but such dependencies are 
likely important. Physical and pubertal development, for example, is likely 
to be more differentiated among younger teens (Udry and Billy 1987). 
Consequently, it is reasonable to expect that the effect of physical devel- 
opment on the transition to first sex is stronger for younger adolescents 
than for older adolescents. Likewise, as adolescents get older, peers may 
become more influential, while parents’ influence may decline as adoles- 
cent autonomy increases (Hofferth 1987; Bearman and Brickner 1999). 
Consequently, we would expect to observe enhanced peer influence and 
declining parental influence on sexual behavior as adolescents age. For 
each influence listed, we note whether or not there is an expected age 
dependency. 


FAMILY CONTEXT AS SOCIAL INFLUENCE ON FIRST 
INTERCOURSE 


Family context has been shown to have consistent and strong effects on 
the timing of first intercourse. Teens living in single-parent families or 
with stepparents initiate sexual activity earlier than those in two-parent 
families (AGI 1994; Brooks-Gunn and Paikoff 1993; Hayes 1987; Harris 
1996; Miller 1998). Social control theory provides the central mechanism 
for this effect: adolescents in nontraditional families tend to be subject to 
less supervision, and/or more permissive attitudes toward sex, by adults. 
Less supervision provides greater opportunity for engagement in non- 
sanctioned behaviors, including sex. Adolescents’ relationship to their par- 
ents, net of socioeconomic background and family context, is also seen as 
an important correlate of sexual behavior. Adolescent feelings of closeness 
and connectedness to parents has been shown to delay sexual activity, as 
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well as parental disapproval of sex (Resnick et al. 1997; Jaccard, Dittus, 
and Gordon 1996) and adolescents’ perception of their parents’ disap- 
proval of sex (Dittus and Jacquard 1998).” If parent attitudes toward sex 
are not perceived to be strongly and consistently negative, adolescents 
may also think that the cost of early sexual initiation is lower. In general, 
connectivity to parents and siblings is protective because it increases the 
costs of behaviors that are not socially sanctioned. Parents’ educational 
attainment and occupational status have also been shown to delay sexual 
initiation (Miller 1998; Hofferth 1987). Typically, the costs associated with 
negative sequelae arising from early sexual initiation (including but not 
limited to pregnancy) are higher for adolescents from high socioeconomic 
status (SES) backgrounds. Consequently, the social control perspective 
hypothesizes that SES is positively associated with age at first sex. 


PEER AND PARTNERSHIP INFLUENCES 


Consistent with the expectations of control theory, Bearman and Brückner 
(1999) show that peer influences (net of family, individual, and social 
demographic factors) on the transition to first intercourse are generally 
“positive”—that is, they work to delay age at first intercourse. Connections 
to peers, as with connections to parents, raise the costs of engaging in 
nonsanctioned behaviors. In contrast, partnership effects are generally 
“negative’—that is, they hasten the transition to first intercourse. The 
more frequently and the earlier adolescents date, the earlier they have 
intercourse. Commitment to a romantic relationship also plays a role in 
the timing of first intercourse (Furstenberg 1976; Spanier 1975). Likewise, 
partner and partnership characteristics are seen to play a key role in the 
timing of first intercourse. Critical in this regard are the extent to which 
the partnership is embedded in larger social networks and whether it is 
characterized by emotional commitment. Partnerships that draw adoles- 
cents away from peers (Le., that isolate young persons) are more likely to 
be associated with sex. Except for black males, emotional commitment 
within the partnership, controlling for partnership duration, is positively 
associated with the transition to sex within that partnership. These part- 
nership effects are thought to operate on adolescents by changing their 
perception of the costs and benefits of intercourse. 


4 Adolescent perceptions of parental attitudes about sex are accurate less than half 
the time There are few studies that explore gender differences in parents’ normative 
influence on teenagers’ sexual behavior, although social control may be stronger, and 
undesirable behavior more negatively sanctioned, for girls (Martin 1996). In any case, 
Udry (1988) shows that social control indicators are more important for girls than for 
boys, whose transition appears to be more governed by opportunity and hormones. 
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INDIVIDUAL, BIOLOGICAL, AND SOCIAL DEMOGRAPHIC 
INFLUENCES 


Independent of background and family structure, adolescents who have 
reached puberty are more at risk to intercourse than those who have not. 
Smarter adolescents are more likely to delay intercourse (Bearman and 
Brickner 1999; Halpern et al. 2000). Cognitive skills, interests outside the 
dating culture, as well as self-esteem are seen to be important delaying 
factors of sexual initiation for girls. Academically inclined girls are less 
interested in, and less interesting for, boys (Hayes 1987; Hofferth 1987). 
In general, the more adolescents do (constraining their opportunities and 
increasing their investments in socially sanctioned activities) the later they 
have intercourse. Participation in extracurricular activities— both aca- 
demic clubs and sport teams—delays intercourse (Bearman and Brückner 
1999), as do a cluster of variables associated with attachment and com- 
mitment to school (Resnick et al. 1997). These attachments draw adoles- 
cents into a wider circle of peers and activities, all of which raise the costs 
associated with early sexual debut. More religious adolescents tend to 
engage in sexual activities later than do their less religious counterparts 
(Thornton and Camburn 1989; Miller 1998; Bearman and Brickner 1999). 
Behavioral measures of affiliation with one’s religion seem to be more 
important than membership in specific denominations (Resnick et al. 1997; 
Hayes 1987; Hofferth 1987). This suggests that it is not religious pre- 
scriptions against sex per se that are salient for delaying intercourse, but 
the effect of increased connectivity to others arising from participation in 
church and community groups. 

As noted above, these influences are likely to operate differently for 
adolescents of different ages. Connections to peers and partners are ex- 
pected to become more salient with age, whereas family influences are 
expected to be strongest for younger adolescents. Likewise, different path- 
ways to sexual intercourse for boys and girls are expected. In general, 
contextual factors are thought to be more salient for girls, since the risks 
associated with sexual activity are disproportionately shared by females. 
Consequently, social influences that shape adolescent investments in oth- 
ers—whether peers, partners, or parents—are likely to have a greater 
impact on transition dynamics for girls than for boys. 

Our interest in this article is in modeling the effect of taking a virginity 
pledge on the transition to first intercourse by sex, age, and social context. 
To be able to estimate the independent effect of a pledge, we need to 
estimate baseline models for sexual debut. The influences described above, 
arising in part but not exclusively from control theory, are those most 


$4 This is more true for whites than for blacks (Udry 1988). 
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often seen to shape transition dynamics, and they provide the conceptual 
foundation for our measurement of the peer, partner, family, and asso- 
ciational contexts in which adolescents find themselves. Because social 
demographic and developmental variables play an important role in struc- 
turing opportunity, interest, and capacity, to have sex, our models control 
for well-established covariates of sexual debut—pubertal development, 
age, intelligence, race, self-esteem, SES, and a battery of attitudes toward 
sex, including but not limited to the perceived costs, benefits, and risks 
of sexual activity. 


IDENTITY MOVEMENTS 


The pledge is not a government program. Adolescents who pledge can 
get pledge cards, but they are not required to show them to anyone. 
Schools do not sponsor pledges, although they may allow pledge groups 
and clubs to use their grounds before or after school. They offer no special 
incentives for pledging, and pledgers are not rewarded for pledging by 
schools. Pledge groups may arise from church youth groups, but they are 
not isomorphic with them, and many nonpledgers and pledgers alike 
belong to church groups.” Instead, the institutional foundation of the 
pledge is the local pledge group. These are only loosely coordinated with 
each other though weak national leadership. The movement recruits 
pledgers to local groups through the Internet, through church groups, 
through Christian music and rallies, but mostly through the energies of 
adults and adolescents in the community. The incentives the movement 
selectively provides to its adherents are identity and fellowship with other 
pledgers. The pledge identity is induced and sustained through interacting 
with other pledgers in the community who distinguish themselves from 
nonpledgers by their public pledge and their commitment to the group. 
The pledge movement, in this sense, is an identity movement—that is, a 
movement that provides a frame for self-understanding (and consequently 
action in pursuit of that understanding) effective only in the context of, 
and interaction with, similar others who constitute a self-conscious com- 
munity differentiated from others. The key to identity movements is that 
they need the other to induce the “self” (Hardin 1995; Laitin 1998; Bear- 
man 1991). 

Consequently, identity movements are prone to self-limiting dynamics 
governing recruitment and commitment. Specifically, as a group becomes 
successful, commitment to the identity declines. On one hand, the unique 


1 We considered membership in youth church groups as an independent variable 
affecting pledging and/or pledge effectiveness, but it is not significant 
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nature of the identity disappears as it becomes more normative (Hardin 
1995; Calhoun 1991). Consequently, it no longer commands a strong grip 
on behavior. On the other hand, as the movement becomes more nor- 
mative, followers come into increased routine contact with nonfollowers, 
in-group interactions become less dominant, and new influences come to 
bear (Kim and Bearman 1997; Blau 1977). Recruitment dynamics shift 
as well, associated with the same threshold effect that governs commit- 
ment. If the movement successfully bridges the gap between heterodoxy 
and orthodoxy, there are no more reasons to join. It appears that identity 
movements are, by their nature, minority movements. 

If, as we argue, the pledge movement is an identity movement, it should 
be characterized by the same self-limiting dynamics governing recruitment 
and commitment described above. Specifically, within each focal point 
(the local community) we ought to be able to observe two interrelated 
outcomes: declining significance of the pledge effect at the point that 
pledging becomes normative, and declining impact of the number of 
pledgers on the probability of pledging (recruitment) after the pledge be- 
comes normative. We test for these expectations subsequently and find 
that they are confirmed. 


DATA, DESIGN, AND METHODS 


Data for the analyses reported on in this article are drawn from Add 
Health, a nationally representative study of American adolescents in 
grades 7-12. Add Health provides data from three waves of data collec- 
tion. Adolescents were included based on a multistage cluster sampling 
design. Add Health is a school-based study. The primary sampling frame 
was derived from the Quality Education Database, which lists all high 
schools in the United States. From this list, Add Health selected a stratified 
sample of 80 high schools (defined as schools with an eleventh grade and 
more than 30 students) with probability proportional to size. Schools were 
stratified by region, urbanicity, school type (public, private, parochial), 
ethnic mix, and size. For each high school selected, Add Health identified 
and recruited one of its feeder schools (typically a middle school) with 
probability proportional to its student contribution to the high school, 
yielding one pair of schools in 80 different communities. Schools varied 
in size from fewer than 100 students to more than 3,000 students. The 
Add Health sample includes private, religious, and public schools from 
communities located in urban, suburban, and rural areas of the country. 
The schools, and the students in them, are representative samples. Add 
Health replaced schools or school pairs in each strata where a school did 
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not agree to participate in the survey; however, nearly 80% of the schools 
that were contacted by Add Health agreed to participate in the study. 

From September 1994 until April 1995, in-school questionnaires were 
administered to all students in each sampled school. Each school admin- 
istration occurred on a single day within one 45-60-minute class period. 
Students not in school on the day of the administration did not complete 
the in-school instrument. Over 80% of all students completed the ques- 
tionnaire. Seven schools did not allow us to survey students in the school 
but did provide us with a roster; thus we have completed in-school ques- 
tionnaires from over 90,000 students attending 141 schools.'* 

The in-school questionnaire provided measurement on the social and 
demographic characteristics of respondents. Students were asked about 
the educational and occupational background of parents, their household 
structure, risk behaviors, visions of the future, self-esteem, and health 
status. Students were also asked to nominate their five best male and 
female friends. They reported on frequency of contacts with friends and 
the sports and extracurricular activities that they participated in during 
the school year. School administrators also completed a half-hour self- 
administered questionnaire in the first and third years of the study. 

For the second stage of data collection (the wave 1 in-home survey), 
Add Health obtained rosters of all enrolled students in each school. From 
the union of students on school rosters and students not on a roster who 
completed an in-school questionnaire, Add Health randomly selected a 
sample for a 90-minute in-home interview. Students who did not partic- 
ipate in the in-school survey were eligible to be selected for participation 
in the in-home main sample. Consequently, the wave 1 sample includes 
students who did not participate in the in-school survey as well as students 
who had dropped out of school. Add Health completed 20,745 wave 1 
in-home interviews, with an 80% response rate. Parental interviews are 
available for slightly more than 85% of all adolescents in the in-home 
sample. ; 

Data collected during the in-home phase of Add Health provide mea- 
surement on more sensitive health risk behaviors, such as drug and alcohol 
use, sexual behavior, and criminal activities in addition to detailed mea- 
surement of the student’s health status, health utilization, peer networks, 
decision making, family dynamics, aspirations, and attitudes. In addition, 
Add Health collected detailed data on romantic partnerships, including 
but not limited to the sequence of activities with partnerships and the 


1 The discussion is complicated by the fact that some schools are K-12, therefore 
counting as a single school pair. Add Health provides data on 80 school pairs and 148 
(not 160) unique schools. We lose data on 7 schools. For more detail on the Add Health 
design, see Bearman, Jones, and Udry (1997) and Udry and Bearman (1998) 
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structure of romantic and sexual networks. The adolescent in-home in- 
terview was conducted using ACASI technology for all sensitive health 
status and health risk behavior questions. Adolescents listened to ques- 
tions through earphones and directly entered their responses into a laptop 
computer, thereby eliminating any potential for interviewer or parental 
effects on their responses. ACASI technology has been shown to reduce 
response bias associated with sensitive questions and nonnormative be- 
havioral items (Turner et al. 1998). 

The first wave of in-home interviews was initiated in May 1995 and 
completed in December 1995. Most interviews were conducted during the 
summer. Follow-up interviews (wave 2) with adolescents who participated 
in the first wave of the in-home survey were conducted between April 
and September 1996. Interviews were not attempted with wave 1 seniors 
in wave 2. Over 85% of all eligible wave 1 respondents participated in 
wave 2, resulting in 14,787 interviews. 

The analyses reported in this article make use of data from both in- 
home waves, the in-school interview, the parent interview, and school 
administrator surveys. School and network information comes from the 
in-school survey; all measures of adolescents’ and parents’ characteristics 
and attitudes are derived from the wave 1 in-home interview and parental 
questionnaire; and information on the timing of sexual initiation and 
romantic involvement is taken from both waves of in-home interviews.” 


MEASUREMENT 


The focus of this analysis is the effect of taking an abstinence pledge on 
the transition to first intercourse. Adolescents who were not married at 
the time of the first in-home interview were asked whether they had “ever 
taken a public or written pledge to remain a virgin until marriage.” In 
the following, we first describe social demographic variables, then turn 
to family and individual factors. Many of the contextual variables we use 
in this analysis are new or less commonly found in the literature. Spe- 
cifically, we explore network correlates (popularity and isolation), the char- 
acteristics of romantic partnerships (their emotional closeness and their 
social embeddedness), and school context. We describe these variables 
below. Appendix C offers more detail. 

As noted earlier, differences between the transition behavior of white 
and black adolescents are well documented in the literature (e.g., AGI 
1994; Brewster 1994; Furstenberg, Brooks-Gunn, and Morgan 1987). 


Y See Bearman et al. (1997) for a detailed description of the Add Health design and 
instrumentation. 
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Therefore, we stratify the analysis by race and test for gender interactions 
within each group.'* Hispanic and Asian respondents are included with 
the white sample and identified with a dummy variable. We measure SES 
as a composite index of parents’ education and occupation, combining 
information from in-home adolescent and parental questionnaires. 

Family type is measured in three categories: adolescents living with 
both biological parents (comparison category); adolescents living only with 
their mother or father; and adolescents living with two adults one or both 
of whom are step- or foster parents. We control for perceived parental 
attitudes toward sex. To measure closeness to parents, we constructed an 
index from three questions (a = .87; see app. C for exact wording).”” 

Sexual debut is generally conditional on interest in and ability to have 
sex, both of which are associated with physical maturation and puberty. 
In this analysis, we make use of data on self-reported physical develop- 
ment. Specifically, we use an index constructed from the mean of three 
items (œ = .65). Girls were asked about breast and body development, 
boys about facial and body hair growth. Both were asked whether they 
looked younger, about the same, or older then adolescents in their age/ 
sex group.” As an indicator for psychological well-being, we use a scale 
of self-esteem constructed from the 11 items reported in appendix C 
(a = .86) and rated on a Likert scale. Cognitive ability is measured with 
the Add Health PPVT (Peabody Picture Verbal Test; referred to hereafter 
as “Add Health verbal test score”). In previous research adolescents with 
greater cognitive skills have been found to delay intercourse (Hofferth 
1987; Halpern et al. 2000). There is some indication that the relationship 
between IQ and sex is curvilinear (Halpern et al. 2000; Bearman and 
Brickner 1999). We test this hypothesis and find little support for it with 
respect to either pledging or age of first sex. In addition, we control for 
grade point average (GPA) as an indicator for academic orientation and 
achievement. 

Adolescents’ religiosity is measured with three behavioral items (a= 
.85; see app. C for wording): frequency of praying, church attendance, 


** Black respondents are defined as those who either reported only “African-American” 
as their racial background or reported some other racial background but gave “African- 
American” as their main background 

** We explored an array of factors associated with parental supervision, including but 
not limited to time with parents, parent attitudes toward sex, parental supervision, 
perceived autonomy of decision making, and the number of dinners with parents each 
week. None of these variables are significant, consequently the final models we report 
do not include them 

” Add Health also asked about age at menarche. One could imagine using this as a 
measure of physical development for females, but the correlation with the other items 
was low, and the index performed better in the multivariate models than the age at 
menarche variable Also, there is no equivalent measure for boys. 
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and importance of religion.” School attachment is constructed by taking 
the mean of five responses to questions measuring closeness to others at 
school, sense of belonging to the school community, and getting along 
with other students and teachers (see app. C). Participation in school sports 
activities is measured by a count of the activities in which an adolescent 


takes part. 


NETWORK, PARTNERSHIP, AND SCHOOL CONTEXT VARIABLES 


A simple measure of how popular adolescents are is, in degree, the number 
of nominations as friends (including romantic partners) they receive from 
others in their school. Because we need data on complete social networks 
to calculate in degree, this information is taken from the self-administered 
in-school survey. Unpopular, socially isolated teens may be both less at- 
tractive and lacking opportunity to recruit romantic partners among their 
friends and friends’ friends. Since the distribution of the number of nom- 
inations is highly skewed, we constructed a dummy variable indicating 
unpopular teens.” 

The in-home surveys contain a wealth of information about adolescents’ 
romantic relationships. For the 18 months before the first in-home inter- 
view, Add Health provides full retrospective information on start and 
end dates of relationships as well as a description of the behaviors oc- 
curring in each relationship.” In order to make full use of this information 
we construct four time-varying variables from the relationship questions. 
A dummy variable indicating whether or not the respondent is currently 
in a romantic relationship and a count of how many romantic relationships 
the respondent has had (including the current relationship) serve as mea- 


21 We also introduced a measure of parents’ religiosity using the same items from the 
parental questionnaire Because this variable did not add anything to the model but 
created additional problems with missing data, we dropped it from the analysis re- 
ported below. In addition, we controlled for religious denomination (Roman Catholic, 
black Protestant, moderate Protestant, pledge supporters Protestant, other denomi- 
nation, none) Since the resulting five-dummy variables did not contribute to the model 
after controlling for the behavioral measure, and, in particular, did not modify the 
impact of pledging, below we report only the estimates for religiosity for the sake of 
brevity. 
2 Highly popular adolescents are not different in thelr transition behavior than those 
with average popularity (results not shown; see also Bearman and Bruckner 1999) 
3 For the purpose of this analysis, we combine information on romantic and “bke” 
relationships. For the former, the associated name generator was, “In the last 18 months 
have you had a special romantic relationship with anyone?” For the latter, the 
survey asked whether the adolescent had done each of the following three things with 
the same person (excluding family members): holding hands, kissing someone on the 
mouth, and telling him or her “that you liked or loved them.” 
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sures of exposure. The nature of the relationship is measured with two 
indexes. Social embeddedness of the relationship is constructed from three 
items asking about the occurrence of specific social events: going out 
together in a group, meeting each other’s parents, and telling others about 
the relationship (see app. C for the exact wording). We expect social 
embeddedness to delay the onset of sexual activity in the relationship, 
since embedded relationships are more visible to parents and peers and 
thus more subject to social control. Emotional commitment combines the 
following events: going out alone with one’s partner, seeing less of friends, 
thinking about oneself as a couple, giving each other presents, and saying 
“I love you” to each other (see app. A). We expect emotional commitment 
to increase the likelihood for initiating intercourse (Martin 1996; Thomp- 
son 1995). 

One central concern of this article is how the effect of pledging is 
mediated by adolescents’ social context. We subsequently show that the 
effect of an individual adolescent’s pledge on his or her transition to first 
intercourse depends on the prevalence of pledging in his or her school. 
The effects of pledging depend on the extent to which dating relationships 
are confined to the school itself. Where the social environment beyond 
the school context provides ample opportunity for romantic involvement, 
the school context is likely to matter less than in schools where romantic 
partnerships are largely limited to the school. We therefore define socially 
closed schools as schools where the overwhelming majority of adolescents’ 
friendships are within school. For each school, we calculated the ratio of 
opposite-sex friendship and romantic ties sent to other students in the 
same school over the total number of ties sent by students in the school” 
As an indicator for social closure we use a cutoff point of 83% within 
school friendship and romantic partnership ties. This corresponds to the 
sixty-fifth percentile of the distribution of within school ties over all ties. 
In order to explore these context effects we introduce interaction effects 
between pledging, proportion of female/male pledgers in the school, and 
social closure.” 


™ We tested both the overall friendship pattern and only the opposite-sex friendship 
pattern for schools (r = 49). Both specifications yield the same results. We report the 
results obtained when using the opposite-sex friendship. 

* The distribution when using persons as unit of analysis corresponds very closely to 
that obtained when using schools as unit of analysis, that is, socially closed schools 
comprise about 30% of our schools (N = 44) and 30% of our adolescents. The results 
are not sensitive to choosing any particular cutoff above the median. We explore as 
well a whole array of other school context variables, including but not limited to, 
school type, centrality of pledgers in school, relative density of school, school size, and 
the interaction of school type, individual religiosity, and school religiosity. These var- 
iables are not significant, and we have trimmed them in the final models we present. 
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We explore a whole array of items that measure adolescent attitudes 
toward sex, including but not limited to indexes that capture positive (it 
would help me relax, feel pleasure, be less lonely, gain the respect of my 
partner) and negative (I would feel guilty, it would upset my mother, I 
would lose my partner’s respect) motivations to have sex. These variables 
do not affect the results we report and are not included in the final trimmed 
models because they are available only for respondents 15 years old and 
older. Finally, we explored a set of measures that capture adolescent en- 
gagement with nonnormative behaviors, including delinquency, drinking, 
and trouble in school with teachers and peers.” 


MODELS 


We estimate a hazard rate model, modeling the duration from age at the 
wave 1 interview to first sexual intercourse. Data for respondents who 
never had sex are censored at the time of the second in-home interview. 
For this kind of problem, where we have both right censoring (the event 
has not yet occurred) and time varying covariates, event history methods 
are appropriate. 

There are reasons to assume that the transition rate will not be constant 
over time even when controlling for all kinds of factors affecting the 
transition to first intercourse. A flexible approach to this time dependence 
of a rate is provided by a piecewise constant model with time periods 
(Blossfeld and Rohwer 1995). In this model, the process time axis (age) 
is divided into periods. The rate may change between periods but is 
assumed to be constant within each period. Formally, the rate from the 
origin state j to the destination state k is defined as follows: 


Tat) = expla + A8] if tel, 


where o™ is a constant coefficient for the Ith time period, A” is a vector 
of covariates, and 6™ is an associated vector of coefficients (Blossfeld and 
Rohwer 1995, p. 111). The coefficients for the time periods express the 
baseline rate (comparable to an intercept in a regression model), while 
the covariates shift the baseline rate upward or downward.” The effect 
of the covariates on the rate is assumed to be proportional over time in 
this type of model. That means that the effect of the covariates in relative 


3 These measure do not affect the final results and are left out of the models we report 
In general, engagement in the adolescent subculture at low levels (a little drinking, a 
little partying) ls protective against early intercourse, since it is associated with greater 
connections to the peer world. 

7 This model is equivalent to a logistic regression with person-months as unit of 
analysis. 
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terms is assumed to be constant over the course of adolescence, while the 
“baseline” rate is allowed to change over time. The division of the time 
axis into periods is arbitrary to some extent. The strategy we followed is 
to make the periods as small as possible while assuring that there are 
enough events (i.e., transitions) in each period to obtain stable and robust 
estimates for the rate. An extension of this basic model, the piecewise 
constant exponential model with period-specific effects, allows us to test 
the proportionality assumption by estimating a time-period-specific co- 
efficient for each of the independent variables in the model. 

In addition to right censoring, we also have to take left truncation into 
account, since we lose all respondents who already had sex by the time 
of the wave 1 interview. One solution to left truncation, which is easily 
implemented with the kind of model we will estimate, is to let the risk 
period for each person begin at the age they were interviewed for the first 
time (a detailed explanation and alternative solutions can be found in 
Guo [1993).4 

Values for the time-varying relationship covariates are introduced in 
the model by means of episode splitting (Blossfeld and Rohwer 1995), 
Initially, each respondent contributes one episode from birth to first sex 
or interview date. When the values of time-dependent covariates change, 
the episode is split into two subepisodes. The first subepisode is assigned 
the start value of the covariate and is coded as right censored, while the 
second subepisode is assigned the new value of the time-varying covariate 
and the appropriate destination state. 


RESULTS 


Tables 2 and 3 provide basic descriptive information about the variables 
used in the analysis by sex, race, and pledge status. On the bivariate level, 
we find a number of similarities and differences between pledgers and 
others. For all groups, except for black females (table 3), perceived pa- 
rental disapproval of sex is higher for pledgers than for nonpledgers. 
Except for black males, pledgers are also closer to their parents than 
nonpledgers. Pledgers have lower verbal test scores than others (except 
for black females) but, for black females and nonblack males only, higher 
GPA, higher school attachment, and higher self-esteem. Female pledgers 
in the nonblack group (table 2) come from families with lower SES than 
others, and their romantic relationships tend to be more embedded in 
their peer groups. In all groups, pledgers are significantly more religious 


* The survivor functions in app A, fig. A2, for the restricted sample are based on this 
correction for left truncation. 
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than others. In a bivariate context, it is clear that pledgers come from 
more normative backgrounds than nonpledgers. Compared to nonpledg- 
ers, pledgers think their parents like them better. They may not be as 
smart. Bivariate differences between pledgers and others in age at sexual 
debut are discussed in appendix A. 


MULTIVARIATE ANALYSIS 


We turn now to the multivariate analysis of the transition to first inter- 
course. The goal is to test whether the differences between pledgers and 
others account for pledgers delayed sexual initiation (see app. A), or 
whether there is an effect of pledging that is independent of other char- 
acteristics of pledgers. A baseline model is reported in table 4. We then 
test gender interaction effects across both groups (tables 5 and 6). We then 
introduce the school context effect (model 2 in tables 7 and 8) and ask 
whether the estimated effects are stable across time as assumed in our 
basic model (tables 9 and 10). In tables 5-10 we report the results in terms 
of relative risk of experiencing the transition. Coefficients that express the 
effect of independent variables on the baseline transition rate in propor- 
tional hazard models, are often denoted as, and calculated as, exp(8). The 
number corresponding to [exp (8) — 1] x 100 provides the expected per- 
centage change of the baseline rate given a one-unit change in the as- 
sociated covariate, other variables being equal. For an easier interpre- 
tation of the results, we report estimated exp(8) and the associated 95% 
confidence interval. For variables with confidence intervals that include 
1, we cannot reject the null hypothesis. For interaction effects for which 
we have specific hypotheses about the direction of the effect, we use a 
one-tailed significance test and report the associated 90% confidence in- 
terval. The relative risk for nonblack pledgers of 0.66 reported in table 
4, for example, means that for this group, the baseline rate is reduced by 
34%—{0.66 — 1) x 100—for pledgers compared to others, other things 
being equal. 

In table 4, we report the results for the hazard model for white, Asian, 
and Hispanic respondents. The time (age) axis is split into 11 periods of 
six months each, except for the first and last period, which, due to the 
small number of events, are longer. In this model, adolescent females are 
seen to transition at the same rate as males. Hispanic respondents are not 
different from whites. Asians’ risk of sexual initiation is about 33% lower 
than that of whites. Adolescents living with single parents or in step- or 
foster families have a higher risk of sexual debut than those living in 
families with two biological parents (27% and 24%, respectively). As in 
previous research, higher SES decreases the risk of sexual debut. 
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15 up to 15,5 ... 
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18 up to 185 ..... 
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TABLE 4 


95% CI Relative Risk 


WHITE, ASIAN, 
HISPANIC 
Relative Risk 
037 .01-.07 
057 02-.14 
07” .02~.18 
057 02~ 14 
07” 02—.18 
08” .03—.20 
6” .02-.16 
08” .03-.22 
.107 04-27 
117 .04~.28 
107 04-27 
1.18 99-1.41 
97 .79-1.19 
677 A6—.96 
127" 1.04-1 55 
1.24” 1.02-1.50 
95” .92—.99 
.90 .78—1.03 
807 .73—.88 
3.317 2.36—-4.63 
1.16” 1 08-1 23 
1.147 105-1 24 
.92 82-1.04 
83” .74— 93 
817 .66— 99 
1167 104-129 
1.00 .99-1.00 
89" 79-99 
93 86-1.01 
1.06 90-1.25 
84” 78.91 
667 52-83 


BLack 


07” 
09” 


1" 


14 


3.69” 
1.07 
92 
1.01 
105 
71 
1.02 
9 
77” 
1.08 
96 
88 
104 





95% CI 


.01-.58 
.01- 67 


-01-,83 


02-1.06 


56-1.23 


.88-1.91 
81-2.02 
-91-1.05 
651.16 
75-1 16 
1.88-7.26 
89-1.27 
.77-1.10 
17-131 
82-1.35 
49-105 
.83-1.24 
.98—1.01 
-60—.99 
93-1.25 
.66—1.40 
.75—1.04 
70-107 


=a >> 
NoTE.—N of cases = 5,679 for nonblacks and 997 for blacks N of events = 690 for nonblacks and 
152 for blacks x7 (d) 18 455 (40) for nonblacks and 48 (27) for blacks CI = confidence interval. Coefficients 


for missmg data flags are omitted 
* P< 01, two-tailed test 
™ P< 0s 
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Closeness to parents has the expected direction, but is not significant 
in this model, while perceived parental disapproval of sexual activity 
decreases the relative risk by 20% for each unit change on the five-point 
scale. Adolescents in romantic relationships are more than twice as likely 
to experience intercourse, and exposure through previous dating relation- 
ships increases risk of sexual debut by about 16% for each additional 
previous romantic partner. Emotional commitment in the current rela- 
tionship increases the relative risk by 14% for each unit change on the 
five- point scale, while embeddedness of the relationship is not significant 
Higher school attachment and higher grades decrease the risk substan- 
tially, as expected, while participation in school sports and self-esteem are 
not significant. Socially unpopular adolescents are less likely to have in- 
tercourse, while adolescents that are more physically mature than their 
peers are more likely to have intercourse. Higher religiosity decreases the 
risk of sexual debut. 

Pledging decreases the risk of intercourse substantially and indepen- 
dently. Pledgers’ relative risk of sexual initiation is estimated to be 34% 
lower than nonpledgers, even after controlling for a wide range of personal 
characteristics, religiosity, and other protective factors. This is the main 
pledge effect. It is robust to model specification. And it is observed even 
when the main correlates of pledging (see app. B) are in the model.” We 
discuss the pledge effect further below. 


GENDER INTERACTION EFFECTS 


We first discuss the results for nonblack adolescents (table 5).* In contrast 
to our expectations, family type and normative parental influences are 
not different for boys than for girls. While most interactions have the 


* For black adolescents (table 4) we observe few differentiating factors The baseline 
time periods are fewer and longer for this sample, due to the smaller number of cases 
and events. The only significant factors here are being in a romantic relationship and 
GPA. The pledge does not make a difference. 

2% Table 5 reports results for main and interaction effects in two separate columns for 
ease of comparison; Le., we do not estimate a separate model for males and females. 
The first column in table $ reports main effects, the second column interaction effects 
with gender. Thus, the effect for boys can be seen in col. 1, whereas effects for girls 
are obtained by adding cols. 1 and 2 in each panel. The models reported in table 5 
allow for gender differences in the effect of all independent variables on sexual debut, 
whereas the models reported in table 4 constrained the effects to be the same for boys 
as for girls. Please note that, for reasons of space, we omit the coefficients for the 
baseline rate in tables 5 and 6. 
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TABLE 5 
‘TRANSITION TO FIRST INTERCOURSE: GENDER INTERACTION EFFECTS FOR WHITES, 
ASIANS, AND HISPANICS 











Direct EFFECTS GENDER INTERACTION 
INDEPENDENT VARIABLES Relative Risk 95% CI Relative Risk 95% CI 
Female ....... ee 14707 2 09-100 03 o 
Hispanic 2... cee 153° 1 12-2 09 477" 31-.71 
Aslan as or So et vols Bo es .78 46-1.31 75 37-153 
Single-parent family ...... 137 1.00-1 87 89 60-1.33 
Step- or foster family pode what, 1,457 1.071 95 .76 52-1.12 
SES ...... ido St 100 951.06 92° 86—.99 
Closeness to parents ... ... 100 19-126 .86 .75—1.00 
Perceived parental disapproval 867 75-99 91 .75-1 10 
In romantic relationship... .. 2547 151-4 28 1.51 .76-3.01 
N romantic partners ......... 1247 1 11-138 91 .79-1 04 
Emotional commitment .... .. 1177 1.03-1.33 95 81-1.12 
Embeddedness in peer group ... 86 .71-1.05 112 87-1.43 
School attachment .. . . ..... 747 .63-.88 119 94-1.50 
Unpopular in school network . . .87 64-117 86 .58-1.27 
Pubertal development .... 1307 108-157 86 .69-1.08 
Add Health verbal test score ... 1.00 .99-1.01 1.00 .99-1.01 
GPA... bata 86° 72-1.02 1.04 83-130 
School sports participation a 101 90-1 13 86" 79-93" 
Self-esteem ...... 0.0... . 3 128* 1 11-1 46" 75° 63-89 
Religiosity . ex esse 817 72-91 106 91-1.24 
Pledge A Vs A 507 .31- 80 1.37 80-2.35 


A A SA A A A 
Notr —N of cases = 5,679 (3,054 female) N of events = 690 (412 female) x7 (df) 15 472 (60) CI = 
confidence mterval. Coefficients for baseline rate are omitted 
* 90% confidence interval 
* PS 01, one-tailed test 
™ PS 05, two-tailed test 


expected direction, they do not reach significance. Higher parents’ SES, 
however, delays intercourse only for girls, not for boys.*! 

Although girls report having more relationships, are more likely to be 
in a relationship, and report higher emotional commitment and embed- 
dedness in the relationships they are in, romantic involvement has the 
same effect on both sexes. Participation in school sports is significant only 
for girls. Self-esteem has a positive effect on boys’ sexual debut, whereas 
for girls, the positive effect of self-esteem is reduced to zero by the negative 
interaction effect. 

For black respondents (table 6), the positive effect of pubertal devel- 
opment is greatly reduced for girls. Sports participation increases the 


* For girls, higher SES may indicate a more protective environment; for boys, higher 
SES may provide them with useful resources for attracting adolescent females, e.g., 
cars (table 5). 
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TABLE 6 
TRANSITION TO FIRST INTERCOURSE. GENDER INTERACTION EFFECTS FOR BLACKS 











Direct EFFECTS GENDER INTERACTION 
INDEPENDENT VARIABLES Relative Risk 95% CI Relative Risk 95% CI 
Female 2... ouesse 0 oo cono nn 5.56 08-393.92 
ASAD ira ias e i ; Tate 
Single-parent family . . ........... 1.27 65-2 49 1.02 442.34 
Step- or foster family ... .......... 141 .66-3 03 96 36-253 
O r Naa 1 04 91-1.19 91 .78-1.07 
Closeness to parents ..... o... .. 119 62-2.29 64 31-1 33 
Percetved perental disapproval ... 94 .69-1.28 1.01 .65—1 59 
In romantic relationship . ...... . 8.80” 2.48-31.15 32 .07-1.46 
N romantic partners ........ .... 85 56-1.30 134 842.15 
Emotional commitment . ......... 72” 52-.99 143° 1 17-1.75* 
Embeddedness in peer group ..... 1.03 .64—1 65 91 51-1.63 
School attachment . ........... 3 104 67-1.62 1.03 .60-1.75 
Unpopular in school network ..... 1.03 56-1.92 56 .26-1.20 
Pubertal development ... ...... 1.42° 1 15-1.74" 66" 52-84" 
Add Health verbal test score ..... 98° .96—1 00 1.02 .99-—1.04 
GPA, rin aa 82 55-121 85 51-1 41 
School sports participation . ...... 1.377 1.07-1.76 69" 50-95 
Self-esteem . 0.0. cece ooo rencor. o 75 40-1 44 1.53 69-3.38 
Religiostty ....... 0. eee cece eee 106 80-1.41 78 55-1.11 
Pledge: iii ii 77 35-1.73 1.42 56-359 


Nore —N of cases = 997 (610 female) N of events = 152 (97 female) x’ (df) 18 58 (45) CI = 


likelihood of sexual debut for boys but not for girls. The most striking 
result is the gender difference in terms of black adolescents’ emotional 
commitment (see figs. 1 and 2). In contrast to all other groups, emotional 
commitment within a relationship decreases the transition rate for black 
male adolescents. One interpretation of the impact of emotional com- 
mitment on black males’ sexual behavior may be a “double standard” 
common in populations with a large excess of females over males: that 
is, one does not love the women one has sex with and vice versa (Guttentag 
and Secord 1983). This finding has implications for the pledge move- 
ment, which assumes that sex is the expression of romantic love. That 


* We find the same result when analyzing a more inclusive sample, starting the ob- 
servation at the time-retrospective data on relationships starts, namely 18 months 
before the first in-home interview. However, for this sample we cannot include a large 
part of the protective factors included in the model reported in tables 2 and 3 since 
measurement occurred after we observe these relationships. 
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125 13 135 14 145 15 155 16 165 17 175 18 185 19 195 


Fic. 1.—Estimated effect of emotional commitment on black males (based on results 
reported in table 6 below; other covariates are evaluated at means/modes). 


this is not true for black males suggests one of the reasons why we cannot 
observe a pledge effect in our sample of that group. 

In contrast to our hypotheses (see table 1), we find little evidence for 
gender differences in the effect of social control. Most important for this 
article, there are no significant gender interactions with respect to the 
pledge, which works for white, Hispanic, and Asian adolescent males and 
females in the same way. The key finding for blacks is the inversion of 
the role of emotional closeness in relationships observed for black males. 
We return to this below. 


THE PLEDGE EFFECT IN CONTEXT 


It is important to examine the mediating role of school context in shaping 
the impact of the pledge. We first report a new baseline model in table 
7 (model 1) for nonblacks and table 8 (model 1) for blacks, which contains 
only gender interaction effects that turned out to be significant in the 
models reported in tables 5 and 6.* We then add the interaction terms 


* In these analyses, we include only significant gender interaction effects. Excluding 
nonsigntficant interaction terms from the model does not reduce model fit Including 
all interaction effects introduces collinearity (note that the SEs in the model for blacks 
are already affected by collinearity as can be seen by the huge confidence interval for 
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125 13 135 14 145 15 155 16 165 17 175 18 185 19 195 


Fra. 2.—Estimated effect of emotional commitment on black females (based on results 
reported in table 6; other covariates are evaluated at means/modes). 


described below in the second model reported in tables 7 and 8. We first 
distinguish schools with respect to their providing the focus for adolescent 
friendship and romantic relationships. For all groups, we can observe a 
significant three-way interaction between pledging, social closure in the 
school, and %pledgers of the same sex in the school. We focus the dis- 
cussion on the results for nonblack adolescents (table 7, model 2). The 
results for blacks (table 8) are similar to those discussed below but more 
difficult to ascertain since the interaction terms are significant only at the 
.01 level. The effect is complex. 

In socially open schools—that is, schools where less then 83% of friend- 
ship and romantic ties to the opposite sex are sent within the school 


the coefficient for female). For reasons of space and ease of interpretation, we report 
only the estimates for the reduced model specification. 

* Introducing only interaction effect terms for pledging and percent same-sex pledgers 
results in nonsignificant coefficients. The context effect of pledging thus depends on 
whether social relations are largely confined within the school. This result is not sub- 
stantively sensitive to the cutoff point chosen for social closure, but the estimates get 
smaller when using a lower cutoff and greater when using a higher cutoff. We present 
the full interactional model in tables 7 and 8, including the estimates for nonpledgers, 
when restricting the model to the context effect for pledgers only, the interaction effects 
yield a significant improvement of the model fit 6? = 8; df = 3; P = 0.04). Excluding 
the nonsignificant parameters does not affect the estimates for the significant interaction 
effect parameters 
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TABLE 7 
‘TRANSITION TO FIRST INTERCOURSE: WHITE, ASIAN, AND HISPANIC RESPONDENTS 











MODEL 1 MODEL 2 
Relative Relative 
Risk 95% CI Risk 95% CI 
Age (years).  - 
12.5 up to 14 ..... Se. aa 01” = .000-03 01” 00- 03 
14 up to 14.5 2... oo cece eee 027 01-.07 027 .01-.06 
14.5 up to 15 moco ee o3” 01-.09 027 01- 08 
15 up to 155 ........ 027 01-.07 02" O1-.06 
15 5 upto 16 2. ck cece eee 037 01-09 02" .01- 08 
16 up to 16.5 ..... 037 01-10 037 01-09 
165 up to 17... wc oo eee 02" 01-08 02” 01-07 
17 up to175 2... ... hae «ra 03” .01-.11 03” 01- 10 
AN A So 047 01-.13 047 01-.13 
18 up to 18.5 . aes oo co... 047 01-.14 04” 01-13 
18.5 and older ....... TORTNI 047 01-13 047 01-,13 
Female coo its ets 4.927 183-1326 5067 1.87-13.67 
Hispanic a... ao o essers oo encino 1.427  105-1.92 144° 1.06-1.95 
Hispanic female ....... 537 36-.77 517 35-76 
Asta oo... cc. canon OS 667 46-96 66" 46—.96 
Single-perent family ....... o. reres 1267 103-153 1.277 1.041,54 
Step- or foster family ... ........ .... . 1.227  1.00-1.48 121° .99-1.47 
A G-dzebsetenscad wo oe dia 1.00 951.05 1.00 95~1.05 
SES x female 0... ....... 937 .87- 99 937 .87-.99 
Closeness to parents ...... o... o 0.0... .90 78—1.03 89° 83.95" 
Perceived parental disapprovel 82” .74—.90 82” .14—.90 
In romantic relationship 326° 233-457 328° 2344.60 
N romantic partners .. .......... 116” 1,00-124 1.16" = 1.09-1.23 
Emotional commitment ...... . ........ 1.147 1.05-1.23 1.147 = 1,05~1.23 
Embeddedness in peer group -93 82-1.04 -93 821.04 
School attachment .... ...... 837 74-93 83” .74-.93 
Unpopular in school network ....... 30” 65.99 81” 66-99 
Pubertal development .......... 0. aeree 116” 105—1.29 1.167 105-129 
Add Health verbal test score .... . 99 .99—1 00 1.00 99-1.00 
GPA ........ dodo és 887 79-98 88" 79-.98 
School sports participation . 99 .89-1.11 99 .89-1.11 
Sports x female . ........ dana 88 751.03 88" 81-.95* 
Self-esteem 00... 0... ereere 1.257 .97-1.60 1.25" 1.10-1.42* 
Self-esteem x female .. ....... .... 79 59-1 06 .78° 67-91" 
Religiosity .. 84” 78-91 347 78-91 
Pledge 0.02. eee 647 51-81 1.00 591.68 
Socually closed school 109 82-1.44 
Pledge in socially closed school 397 17-90 
Mpledgers (same sex) ...... o 2.2.2.0... 1.00 99-1.02 
%pledgers x closed school ...... 100 .98—1.02 
Pledge x %pledgers . 98° .96—.99* 
Pledge x %pledgers x closed school 1057 1 01-1.10 


AE AA SS OA 
NoTE —N of cases = 5,679; N of events = 690 x’ (df) ls 464 (44) for model 1, for model 2, 468 (50) 
interval 


CI = confidence 
* 90% confidence mterval 
* P< 05, one-tailed test 
** P< 0S, two-tailed test 


888 


TABLE 8 
TRANSITION TO FIRST INTERCOURSE: BLACK RESPONDENTS 











MODEL 1 MODEL 2 
Relative Relative 
Risk 95% CI Risk 95% CI 
Age: 
12.5 up to 14.5 essees o ceeee ee oe 047 00- 36 047 .00—.40 
14.5 up to 15.5 0.0.0... esere eee 057 01-.41 05” 01-46 
15.5 up to 16.5 0... esses o ereere 06” 01-51 06” 01-55 
16.5 and older ..... ccc... o ceceeeeee ees 087 .01-.67 07” .01-.68 
Female a anien neda dress dci 1.98 537.40 214 553 34 
Single-parent family ....... . o.0moomooo. 1.27 .87-1.87 1.28 .87-1.89 
Step- or foster family... . .omcommocoo...- 1.30 -82-2.05 1.28 80-2.02 
SRS ii deal deco aa aaa 98 .91-1.05 97 91—1.04 
Closeness to parents . .......... 5 cee eee 87 65-1.16 86 .64-1.16 
Percetved parental disapproval ... ...... 94 .75-1 18 93 .75-1.17 
In romantic relationship 3.84” 1.93-7.61 3827 1.92-7.63 
N romantic partners .... . nases > 1 06 89-1.27 1.05 88-127 
Emotional commitment ..........o o. 0... 81° 73-91" 81 73-91" 
Emotional commitment x female ....... 1.207 1.00-1 44 120” 1.00-1 44 
Embeddedness in peer group ........ - 98 75-1.29 99 .75-1 30 
School attachment ...... o 00cooooooo conoce 1.06 83-1.35 1.04 81-1.33 
Unpopular in school network ...... . ... 12° 59— 88* TP 58-.86* 
Pubertal development . .....0oooooccccooo. 131 89-1 94 1.34 .90-2.00 
Pubertal development x female .. ...... 73 AT-1 14 70 45-1 11 
Add Health verbal test score .......... 99 .98-1.01 99 .98-1.01 
GPA aone e sete decd caches 17” .60- 99 To .60— 99 
School sports participation .. 1317 1.03-1.66 1317 1.03-1.66 
Sports x female . ... 2.2.0... coo erer 147 55-101 76° 65-89" 
Self-esteem o. csecscvsessesdesseceucesens 96 .66-1.39 96 66-1 39 
ReHgiostty ....o.ococoroonccccccncnnnnoss 89 .76—1.05 .90 -76—1.06 
A vicvavet saben irura 102 .69-1.52 1.51 574.00 
Socially closed school ....0oocooocoococoo. 1.79 744.32 
Pledge in socially closed school ......... 21° 09-51" 
Spledgers (same sex) ........ oeren 1.01 .98—1.05 
%pledgers x closed school ... ....... ... 94° .88-1.01 
Pledge x %pledgers . 98 .92-1.04 
Pledge x %pledgers x ‘cinsed "school . 1.10° 1.05—1.16* 


Pe 
NOTE —N of cases = 997; N of events = 152 x’ (df) for model 1 1 52 (30), for model 2, 55 (36). 


* 90% confidence interval 
* PS 05, one-tailed test. 
= P<.0S, two-tailed test. 
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Pledgers of Rospondent's Sex in School (%) 


Fic 3 —Transition rate for pledgers and context, by school type 


itself —the delay effect of pledging on an individual's transition rate de- 
pends on how many other students of the same sex have pledged. Where 
no other pledgers were present at the time of the first in-home interview, 
pledgers transition at the same rate as others. In short, there is no delay 
effect The more other pledgers are present in the school, however, the 
later pledgers initiate intercourse. Specifically, for nonblack adolescents 
(table 7, model 2), for every 1% additional same-sex pledgers, the rate of 
transition to first intercourse is delayed by 2%. In socially open schools, 
pledging delays intercourse only if there is an interacting community of 
pledgers. 

We call schools where the majority of cross-sex friendship and romantic 
relationship ties are sent to other students in the school “socially closed.” 
For socially closed schools (30% of the schools in our sample), where social 
relations are largely confined within the school, the opposite obtains. 
Where no other pledgers are present, pledgers are much less likely than 
others to experience sexual debut. Where many other pledgers are present 
in these schools, pledgers’ transition rate is higher than that of pledgers 
in schools with few pledgers. These findings are illustrated in figure 3, 
which reports the estimated relative effect of %pledgers on the transition 
rate for nonblack pledgers in both school types. 

For adolescents in schools with more than 30% pledgers, the relative 
change in the baseline rate changes from negative to positive, indicating 
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a threshold for pledging’s effectiveness.” Once the pledge becomes nor- 
mative, it ceases to have an effect. The pledge identity is meaningful, 
consequently, only if it is a minority identity, a common situation for 
identity movements.” In socially open schools, all identities can be in the 
minority. 

The pledge movement as an identity movement and the pledge effect 
as an identity effect would be consistent with an empirical observation 
that the effect of pledging approaches zero with increasing proportion of 
pledgers in a school. Instead, we observe a reversal of the effect, that is, 
a positive effect of pledging on the hazard rate. To interpret this reversal 
recall that the rate is a “temporally local” dimension.” Recall that the 
same dynamic appears earlier in this article as well. Early gender differ- 
ences and later convergence in the survivor functions for the transition 
to first sex (fig. Al) necessarily translates into different estimates of the 
gender effect on the baseline rate depending on which ages we look at. 
When looking at a sample that includes early transitioners, females are 
expected to have a lower transition rate than males. When looking at 
later ages, the rate for females is higher than for males—females have to 
“catch up” with males in order for the proportions to converge, as they 
do in figure A1. In the same way, pledgers have to “catch up” with non- 
pledgers, resulting in a transition rate that is higher than that of 
nonpledgers.* 

Consider by analogy two tubs of water that both have to become full 
(.e., the eventual transition to first sex). If one faucet is running at full 
blast it fills up quickly. The other faucet runs more slowly, so at a certain 
time it is less full than the first. But, in order for both tubs to be full at 
roughly the same time, the second faucet needs to release water more 
quickly at some time. This faster release of water to achieve parity is the 


** There are 14 schools in our sample with 30% and more pledgers (Le., about 10% of 
all schools contained in this sample) These 14 schools, eight of which are socially 
closed schools, contribute 348 adolescents to the sample. This results in 141 adolescents 
in socially closed schools with 30% and more pledgers in them. Of these 141 persons, 
85 are pledgers. Socially open schools with with 30% and more pledgers contribute 
206 adolescents, 136 of whom are pledgers. 

* The lack of a similar threshold for socially open school may simply mean that we 
capture more effectively the relevant social context, since the measurement of the 
context effect is more precise when social relations are largely taking place within the 
school 

7 The classic illustration of this property of a rate is the comparison between the 
volume of water in a container (Le., the proportion of units experiencing some event) 
and the speed at which the water filling the container leaves the faucet (the transition 
rate) 

“In the next section, we test for such age dependencies of the effect of pledging and 
other covariates on the transition rate. 
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positive effect on the baseline rate. Substantively, pledgers in schools with 
many pledgers would not transition to sex more quickly if they were not 
already delayed. 

Our data are limited in a specific way: we do not observe the process 
of pledging in schools over time, and the data are right censored and left 
truncated. The best we can do with the data is to estimate a proportion 
of pledgers at a specific point in time and observe what happens after 
that. If we could operationalize context in a time-varying fashion we 
would most certainly observe a flattening-out of the context effect: because 
we could observe individuals and the movement as they grow together.” 
In the data that we have, we observe teens in socially closed schools with 
many pledgers on the average in a phase in which they are catching up 
with nonpledgers.* Note, however, that figure 3 is assuming a propor- 
tional hazard rate. What we can do is let the context effect vary across 
different ages. When we relax the proportional hazard assumption, we 
do see a decline of the school type interaction effect (see table 9).* 


” We tested for nonlinearity in the context effect by breaking up the distribution in 
sets of dummy variables (e.g., 0 up to 15%, 15 up to 30%, 30% up to 40% pledgers, 
above 40%). We tried out different cutoffs for the groups but there was no indication 
that the effect turns around. The model fit did not increase, and the dummy variables 
indicating high percentages of pledgers were significant in interaction with school type. 
* AJS reviewers also suggested we use “social closure” as a continuous variable rather 
than as a dummy variable with an arbitrary cutoff point and to represent the resulting 
three-way interaction between pledging, context, and school closure as a three-dimen- 
sional plot Unfortunately, the distribution of the social closure measure (proportion 
of opposite-sex friendship ties within school) is not normally distributed (most teens 
are in schools with very few outmde ties). In addition, the use of a continuous variable 
for the three-way interaction implies a linear association of social closure with the 
interaction effect of pledging and proportion pledgers, an assumption that does not 
hold in these data (see n. 22 above for the choice of a cutoff point) It was also suggested 
to employ a “more naturally” dichotomous school characteristic such as urban or 
regional characteristics as a substitute for the social closure measure. There is no one- 
to-one correspondence between rural settings or location and social closure, however, 
and it seemed preferable to use a concept that has direct bearing on the nature of the 
underlying sociological mechanism rather than a categorical proxy that introduces 
both measurement error (since there are socially open schools even in nonurban settings 
and ın the South, for example) and conceptual ambiguity Although charactenzations 
of the environment such as urbanicity and region are associated with the outcomes 
studied in this article, in the multivariate context of the analysis shown they do not 
contribute to any of the models, most likely because we control for a great number of 
intervening variables such as individual and family characteristics. 

* This is the closest that we can come to modeling the process Reviewers suggested 
using the estimates of the odds of pledging obtained in the logistic regression reported 
in appendix B instead of the observed pledge as a covariate in the hazard models, 
even though these estimates would be based on a different sample. The idea is to rule 
out the possibility of correlated errors resulting from the fact that pledging in schools 
where pledging is common is in part a measure of being in auch a school. In other 
words, the proportion of pledgers in the school would be an indicator of the extent of 
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For adolescents who have not taken a pledge, in contrast, the school 
context effects are not significant. That is, their transition rate is not 
affected by the presence of pledgers in the school. This suggests that the 
alternative interpretation of the threshold effect we identify in socially 
closed schools—that once there are too many pledgers in a school, others 
concentrate their romantic efforts on trying to seduce pledgers since there 
are too few alternative outlets (a sort of opportunity argument)—cannot 
be accurate. If this were true, we would expect to observe an effect of 
the proportion of pledgers on nonpledgers. The coefficient for the effect 
of the prevalence of pledging on nonpledgers, however, not only lacks 
significance but is also very small. 

The mechanisms by which the pledge operates are revealed through 
this contextual effect. If there are no pledgers, there is no pledge effect. 


“unobserved heterogeneity” among pledgers, but only in some schools and not in others 
lt is a well-established finding that as a movement grows, members will be more 
heterogeneous with respect to their backgrounds and onentatons. For the unobserved 
heterogeneity interpretation to be consistent with the reported results, however, social 
closure has to be interpreted as an indicator for heterogeneity of pledgers (albeit only 
when many pledgers are around). Hence, we also need to find some property of the 
school that accounts for both widespread pledging of heterogeneous students and a 
tendency for friendship ties to be organized within the school. We have tried to rule 
out the presence of any such property we could think of in terms of characteristics of 
students, parents, and schools. For example, small schools may have both more het- 
erogeneous students (in spite of residential segregation) and be socially closed. Con- 
versely, large schools may have a more heterogeneous but larger student body and 
thus could have a critical mass of homogenous pledgers and be socially open. However, 
socially closed schools are on the average alightly bigger than socially open schools. 
In addition, the school type-context interaction effect cannot be reproduced/erased 
with a school size-context interaction effect. But beyond this, the proposed strategy 
cannot work for our analyses. The reason why we used observed rather than predicted 
pledging in the sexual debut models is a time-ordering issue. Recall that the strongest 
predictors of pledging are characteristics that vary over time (being in a romantic 
relationship, number of partners, and, most important, proportion of pledgers in the 
school) Because we have no information on the timing of pledges, we cannot use these 
predictors for estimating the odds of pledging for adolescents in the sample used for 
predicting the timing of sexual debut because we do not know their values. Imagine 
further if one would use the current values (e g., romantic involvement, proportion of 
pledgers) of the determinants of pledging to calculate the predicted log odds of pledging 
for our debut models. Since all the variables predicting intercourse are also in the 
pledge prediction equation, by definition one throws away the unique contribution 
pledging makes because the predicted score would const only of the contribution the 
other covariates make to pledging. Since we control for all of these in the sexual debut 
model, the predicted odds of pledging should not have an effect by definition. Unob- 
served heterogeneity of pledgers, ın a way, is exactly what makes them pledgers and 
different from others Of course, in the debut models, this is not technically unobserved 
since we measure it as pledge. But it is unobserved in the pledge model—it is exactly 
the part we would throw away if we used the predicted odds This would not be the 
case if we had measures of the predictors of pledging before pledging occurs. But our 
data do not allow us to do this. 
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TABLE 9 
TRANSITION TO First INTERCOURSE’ NONBLACK RESPONDENTS, MODEL WITH PERIOD- 
SPECIFIC EFFECTS 





RELATIVE RISKS 


EFFECT SIYE 
Early Middle Late ‘TENDENCY 

Female co... o... ... es 1.07 3.83" 15.967 Increasing with age" 
Hispanic 22. oe. eeeeeeee 99 1.30 2.02" Increasing with age 
Hispanic female . . .............. 1.19 547 33” Increasing with age 
Asian ...... .. eh som deohcesteer ted 85 307 1.06 Middle 
Single-parent family 1.37 1.08 134 NS 
Step- or foster family... ......... 118 1.29" 1.06 Stronger early 
SES: iii Litas 93 99 1.05 NS 
SES x female ..... 97 95 887” Late 
Closeness to parents ...... 677 105 87 Early 
Perceived parental disapproval 787 847 817 Always 
In romantic relationship ........... 2.09 3.18" 4.17° Increasing 
N romantic partners .. 6. was 131% 1.18” 1.09 Decreasing 
Emotional commitment 1.08 1207 111 Middle 
Embeddedness in peer group 1.01 85" 102 Middle 
School attachment . 80° 807 87 Early and middle 
Unpopular in school network 91 88 77 Increasing 
Pubertal development .. 1417 1.12 1.10 Decreasing, early 
Add Health verbal test score ...... 100 99 1.00 NS 
GPA A oft Reed canes 75% 91 98 Decreasing, early 
School sports participation . . ........ 100 107 91 NS 
Sports x female ........ o ae aen 80 81° 1.05 Middle 
Self-esteem oe wee 1.17 1.05 166 Late 
Self-esteem x female ............. 1.28 A 557 Late 
Religiosity ..... a aesa rererere 1.00 80” 80” Middle and late 
Pledge AA 3.73 65° 109 Middle 
Socially closed school ... . 1.33 108 94 NS 
Pledge in socially closed school 127 50 37 Early 
%pledgers (same sex) . ............. 99 1.01 1.00 NS 
%pledgers x closed school . ..... .. 99 100 102 NS 
Pledge x %pledgers .. .......... .92% 98 98 Decreasing 
Pledge x %pledgers closed school +. 112" 1.05 1.04 Decreasing 


Norr —Stages of adolescence are eariy, 12 5 years up to 155 years, middle, 15 5 years up to 175 
years, late, 17.5 years and older. N af events = 166 for early, 321 for middle, and 203 for late, N of cases 
= 5,679 x? (@ Ls 490 (96). Coefficsents for baselme rate are omitted from the table NS = not agnificant. 

* For white females, this is a “catch up” effect 

* P< 0S, one-tailed test 

* PS 05, two-tailed test 
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If there are too many pledgers, the pledge effect disappears. Pledging will 
work only insofar as it is nonnormative and constitutive of individual 
identity in the larger adolescent subculture. The upper bound on pledg- 
ing’s effectiveness provides a cautionary note to those interested in uni- 
versal social policies. We discuss this further below. 


RELAXING THE PROPORTIONAL HAZARD ASSUMPTION 


The models presented so far assume the effects of explanatory variables 
to be constant over time. There are reasons to assume, however, that some 
factors are more important early on in the process while others figure 
more prominently later on. 

We are especially interested in the effectiveness of the pledge across the 
course of adolescence. In the following analyses, reported in table 9 and 
10, we therefore relax the proportionality assumption and estimate a 
model in which the effect of all the independent variables is allowed to 
vary across time periods. This involves estimating a great number of 
parameters. To ensure that there are enough events and variation in the 
independent variables for each time period, we divide the observed age 
axis into three time periods, which roughly correspond to early (12.5 years 
to 15.5 years), middle (15.5 years to 17.5 years), and late adolescence (17.5 
and older). For black adolescents, we choose slightly different age periods 
that reflect the earlier onset of puberty and sexual behavior. The results 
are reported in table 9 for nonblacks and table 10 for blacks. The results 
for blacks are based on a rather small number of events and should be 
interpreted with care. 

The direct gender effect increases with age as girls catch up with boys. 
Hispanic adolescents differ from whites mainly in middle and later ad- 
olescence, Asians in middle adolescence (table 9). Our expectation was 
that parents’ influence on the behavior of teens would be greatest in early 
adolescence and decline later as their children achieve more independence 
and autonomy. For nonblack adolescents, this hypothesis is borne out. 
Closeness to parents has a significant effect only in early adolescence. 
Recall that it was not significant in the simpler model reported in table 
7. With respect to family type, the results are less conclusive although 
there is a tendency for the effect size to be stronger in early and middle 
adolescence. For girls only, parents’ SES has significant effects only in 
late adolescence. Perceived parental disapproval of sex, however, is a 
strongly delaying factor throughout adolescence (table 9). The time- 
variant influence of family type is more pronounced for black adolescents 
(table 10). The coefficients for closeness to parents and parental disap- 
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TABLE 10 
BLACK RESPONDENTS, MODEL WITH PERIOD-SPECIFIC EFFECTS 


RELATIVE RISKS 





ARES EFFECT SIZE 
Early Middle Late TENDENCY 
Female ; ir 180 1.83 2.20 (Decreasing with age) 
Single parent family.» E a E a 107 1.01 Early 
Step- or foster family .... 3367 84 89 Early 
SES... . weer is. de 106 96 99 NS 
Closeness to parents ..... eer $ 85 100 89 NS 
Perceived parental disapproval ...... 83 99 102 (Decreasing) 
In romantic relationship ... ......... 4.987 3267 6187 Increasing 
N romantic partners ......... ccoo...» 1.16 113 76 NS 
Emotional commitment... .. .94 .90 72 (Increasing) 
Emotional commitment x female 1.28 105 1.20 NS 
Embeddedness in peer group... . . 72 98 124 NS 
School attachment . ware "LO? 1.37 66° Late 
Unpopular in school, hewa X 62 .70 1.40 NS 
Pubertal development ........ 183° 99 1.49 Early 
Pubertal development x female ee As 90 75 Early 
Add Health verbal test score . 101 1.00 977 Late 
GPA .. . Kodak .90 .74 85 (Middle) 
School sports participation wae 132 135 1.827 Increasing 
Sports x female .... ae 61 .70 -66 NS 
Self-esteem .........ccccceseeeeeeeee eee 71 91 132 NS 
Religiostty 2.0... .. eee eee 1.06 84 .90 (Middle) 
Pledge .. tho gift, creme ae ates 4% . 129 69 97 (Middle ) 


Nortt —Stages of adolescence differ from nonblack respondents and are early, 12 5 years up to 15 
years, muddle, 15 years up to 17 years; late, 17 years and older N events ts 38 for early, 66 for middle, 
and 48 for late, N cases 18 997 x’ (df) is 68 (69) Coefficients for baseline rate are omitted from the table. 
NS = not significant 

* P< 05, one-tuled test 

*=* P< 05, two-tailed test 


proval have the expected direction, are greater in early adolescence, but 
are not significant. 

For black adolescents and for others, the effect of being in a romantic 
relationship is almost twice as large in middle and late adolescence than 
in early adolescence. The effect of number of prior romantic partners 
decreases. First sex happens in the context of a romantic relationship. 
Adolescents who date a lot at an early age will also have sex earlier than 
others. For nonblack adolescents, emotional commitment within rela- 
tionships and embeddedness of the relationship in the adolescents’ social 
world have the most pronounced effects in middle adolescence. 

School attachment is a protective factor in early and middle adolescence 
for nonblack adolescents. For blacks, in contrast, it is only significant and 
negative in late adolescence (table 10). The effect of social isolation is, 
compared to that in the model in tables 7 and 8, reduced in size (except 
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for late adolescence), and not significant. As expected, pubertal devel- 
opment is most important for young adolescents; each one-unit change 
in the five-point scale is estimated to increase the transition rate by 41%, 
other things equal, while for young black males, it is almost twice as large 
(83%). As in the simpler proportional hazard model reported in tables 7 
and 8, cognitive ability is not related to the timing of first intercourse for 
nonblacks (but it does delay intercourse for older blacks). Both academic 
achievement and sports participation (for nonblack girls only) have the 
strongest delay effect in early and middle adolescence. The gendered pat- 
tern of self-esteem’s influence for nonblacks is pronounced only in late 
adolescence, when girls with high self-esteem are actually less likely to 
experience sexual debut than are girls with low self-esteem. This negative 
effect of self-esteem for girls is consistent with the idea that girls with a 
developed sense of self find it easier to refuse sex when negotiating ro- 
mantic behavior with their male partners (Martin 1996). The change of 
the effect over time and the gender interaction account for the fact that 
many studies do not find any effect of self-esteem on sexual initiation 
(Hofferth 1987; Brown and Theobald 1999). Religious adolescents are 
slower to experience first intercourse only in middle and late adolescence 


PLEDGING CONTEXT AND AGE DEPENDENCY 


The context interaction effect between pledging, %pledgers, and school 
type is extremely strong only in early adolescence (table 9).* In middle 
adolescence, the effect of pledging is “individualized,” while in late ado- 
lescence neither pledging nor the interaction effects are significant. When 
omitting the interaction terms, we obtain an estimated relative risk for 
pledgers of .79 (P = 0.28) in the early period, .49 (P = 0.0002) in the 
middle period, and .72 (P = .11) in the late period: only the coefficient 
for middle adolescence is significantly different from zero. This timing 
pattern is similar in the model for blacks, although none of the coefficients 
are significant. This age dependency further conditions the effectiveness 
of the pledge. It works, but only for a subset of adolescents in specific 
social contexts defined by the proportion of other pledgers. 

Overall, relaxing the assumption of time-constant effects yields different 
conclusions about the factors influencing sexual initiation. Focusing on 
age dependencies provides additional information about the effectiveness 
of various factors across age periods. The idea that individual attitudes 
and personality become more important as adolescents get older is borne 


* Because of convergence problems, we could not test the time dependence of the 
context effect for black adolescents (table 10). 
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out by this model. It is noteworthy that parental disapproval of sex is 
strong throughout adolescence.* For older adolescents, first sex is more 
likely to happen in the context of a romantic relationship. For nonblacks, 
the effect of pledging is highly context dependent in early adolescence, 
more individualized in middle adolescence, and probably absent or much 
smaller in late adolescence. Thus, while it may be unlikely that most 
pledgers will delay intercourse until marriage, pledging significantly de- 
lays intercourse for the majority of pledgers in early and middle 
adolescence. 


PROMISE KEEPERS AND PROMISE BREAKERS 


Critics of the pledge movement suggest that pledgers who break their 
promise and experience sexual intercourse will suffer negative psycho- 
logical consequences. To assess this criticism, we consider two scales that 
measure psychological well-being, the self-esteem scale used in the tran- 
sition models above, and an 11-item depression scale. For all groups except 
black males, teens who had first intercourse experience a loss of self- 
esteem compared to those who remained virgins (results not shown; tables 
available from the authors). Female pledgers experience an increase in 
self-esteem between waves, and the interaction effect between pledging 
and having intercourse is not significant for any group. We found no 
indication that changes in self-esteem associated with intercourse depend 
on the timing of intercourse or on age at the time of the first wave in- 
terview. Hence, pledgers do not lose more self-esteem when they break 
their promises than is experienced by nonpledgers who have first inter- 
course. The same obtains when using the depression scale, and in a mul- 
tivariate model controlling for relationship with parents, religiosity, school 
integration, academic orientation, network position, and romantic in- 
volvement. Thus, pledging has no negative consequences in terms of psy- 
chological well-being, even for teenagers who break their promise. 

We do find, however, an effect of pledging on the likelihood of using 
contraception at first intercourse. In this analysis, reported in table 11, 
we look only at teens who had first intercourse between waves. We use 
a set of predictors similar to that used in our model of first intercourse.“ 


% Parents’ own report of disapproval of their teens’ having sex is only weakly related 
to adolescents’ sexual behavior (results not shown) Most adolescents do not really 
know what their parents think But what they believe their parents think is what 
counts. 

“Information on romantic relationship is averaged over the observation period (until 
first intercourse occurs). Alternative specifications (last or current relationship) yield 
the same conclusions. 
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TABLE 11 
LOGISTIC REGRESSION OF CONTRACEPTIVE USE AT FIRST 
INTERCOURSE FoR NONBLACKS 


Odds 

Ratio 95% CI 
Age oo... a ee 105 91-121 
Age at first intercourse"... ...... 1.00 .99-1.01 
Female .......... ccoo 1.10 77-158 
Hispanic . . aeee o o 0o 0. .68° .46—1.00 
ASLAN 25.2. aein io sad eteee BS 344" .14—.84 
Single-parent family ................ .76 49-1 17 
Step- or foster family ........ .687 50-.91 
SES a beans 95 89-1.02 
Parental closeness .. . ... ......... 1.01 .75—1.34 
Parental disapproval. .... .. 1.04 84-1.28 
School attachment . 1.09 87-1.38 
Unpopular in school network” . 88 59-1.32 
Add Health verbal test score . .. . 101 .99-1 02 
GPA o dad oe Mega ei 1347 1.10-1 64 
Self-esteem .. 0... o oocososorocono 1.05 .77-1.42 
Religiosity ............ : 96 31-1.14 
Pledge .. ........... a 65" 43.99 


Norr —AI variables were measured at wave 1, in-home interview except where 
noted N = 1,329 xX (df) 1s 56.11 (17) and ts significant at the .01 level SEs are 
adjusted for clustermg, weighted date 


Table 11 shows the results of a logistic regression of the probability of 
using contraception as a function of SES, relationship to parents, school 
integration, academic achievement, religiosity, and pledge status.“ Neither 
parents’ SES, nor age at first intercourse, nor cognitive ability are sig- 
nificant in this model. On the other hand, GPA, which is conceptually 
closer to academic aspirations, has a strong effect on contraceptive use. 
This suggests that contraceptive use is strongly associated with invest- 
ments adolescents make in school. Hispanics and Asians are less likely 
to use contraception than are whites. We found no indication for gender/ 
ethnicity interaction effects. The key finding for this article is the effect 
of pledge status. The estimated odds for contraceptive use for pledgers 


* We have omitted black respondents from this model because we know the sample 
to be selective. In this sample, blacks are more likely than any other group to use 
contraception at first intercourse—about 75% of all black teens do, and the multivariate 
model used for blacks yields little information about blacks’ contraceptive behavior. 
With the exception of GPA, none of the predictors accounts for differences between 
users of contraception and others for blacks 
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are about one-third lower than for others. Pledgers are less likely to be 
prepared for an experience that they have promised to forgo. It is hard 
not to imagine the dissonance that would arise if they were contraceptively 
prepared. This suggests that pledgers, like other adolescents, may benefit 
from knowledge about contraception and pregnancy risk, even if it ap- 
pears at the time that they do not need such knowledge. 


DISCUSSION 


Many adults are concerned about adolescent sex, and many adolescents 
are concerned as well. Many adults worry about the negative conse- 
quences arising from early transitions to sexual behavior. They seek ways 
to help adolescents negotiate the complex and contradictory messages they 
get about sex. The pledge movement is organized by adults for adolescents. 
It is designed to help adolescents say no thank you to sex. In general, if 
adolescents do say no thank you, they are better off. They have fewer 
health risks, and they feel better about themselves. Programs that work 
to delay first intercourse make a contribution to adolescent health. Even 
if we did otherwise, and even if some adolescents feel that they ought to 
do otherwise, they are better off waiting. This is especially true for girls, 
who are better able to negotiate better (safer) sex the older they are. 

Pledgers come from normative backgrounds. They pledge because they 
do not want to have sex. They do not need to pledge to avoid sex, but 
pledging helps them not have sex. On the average, it reduces the baseline 
rate of time to sexual debut by 34%. A pledger with four romantic partners 
has the same relative risk of sexual debut as a comparable nonpledger 
with no romantic partners. The pledge effect is not a selection effect. It 
is real and it is substantial. But it is not a panacea for all of our adolescents. 
It works best for younger adolescents. It works only in specific contexts. 
The limits of the pledge’s effectiveness provide insight into the mecha- 
nisms by which it operates. 

The key mechanism is that pledging creates a moral community. Pledg- 
ers participate in this community and take on a pledge identity. Their 
joint participation sustains this identity and makes it meaningful for their 
behavior. Movement organizers are well aware that their movement will 
be successful if it builds a new identity. Pledge paraphernalia available 
on the Internet provides visible markers of identity, pledge rings, pledge 
music, pledge books. Pledgers need to be visible to others. This is because 
pledging outside a community of pledgers provides no protection for ad- 
olescents. Pledgers may pledge, but they cannot determine the effect of 
the pledge all by themselves. 

The importance of pledging for the pledge movement itself is significant. 
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Fic. 4.—Estimated probability of pledging and school context 


As reported in appendix B, the effect of other pledgers among same sex 
adolescents in each school on pledging is curvilinear. Up to a threshold 
of around 40% pledgers, all other covariates of pledging considered at 
their means and modes, the probability of pledging increases by 1% for 
every additional 1% same-sex pledger in an adolescent’s school. At higher 
levels of pledgers, the curve flattens out and the effect turns negative. 
This relationship is illustrated in figure 4. 

Pledging thus provides fuel for its own growth and success, up to a 
limit. This phenomenon of self-sustained growth, with an associated 
threshold, is common to identity movements. For an identity to be mean- 
ingfulthat is, to have a meaningful relationship to behavior—it must 
be embedded in and sustained by an interacting population of persons. 
Public pledge events work to encode pledging into a moral community. 
The community is effective only if it has self-conscious recognition, which 
presumes minority status. 

The contextual effects we identify point to the mechanisms through 
which the pledge effect operates. When the pledge works, it works because 
pledgers are embedded in an identity movement. Like other movement 
identities, the pledge identity is relatively fragile and meaningful only in 
contexts where it is at least partially nonnormative. If most adolescents 
were to pledge, there would be no pledge effect on the transition to 
intercourse. 
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Not surprisingly, as with most social movements, the pledge movement 
has contradictory elements. Pledge rhetoric stresses making your own 
decision and getting away from the crowd. But the movement produces 
as a by-product of its core activity (public pledging) something even better 
than a crowd—specifically, a visible moral community. Because pledging 
generates a public moral community, pledging also provides, consequently, 
both a cognitive foundation for new identity (someone who has made a 
commitment) and the structural context (a community of interacting 
pledgers) in which identity can be articulated and practiced. The moral 
community of pledging is the key to its success and failure. 

Critics of the pledge movement are as concerned about adolescent sex 
as are the supporters of the movement. These critics are both wrong and 
right about the pledge. They are wrong when they think it does not work. 
But they are right when they think that it cannot work as a universal 
strategy. Of course, pledging will not work for adolescents who want to 
have sex. That is always true. But pledging does not work in some social 
contexts even for pledgers who do not want to have sex. If pledgers attend 
a school that is the focal point of their social relationships, and if too 
many other students in their school pledge, the pledge effect disappears. 
In these contexts, about one-third of schools in the Add Health sample, 
pledgers transition to first sex at the same rate as nonpledgers. Beyond 
this, if pledgers do have sex, they are less likely to be contraceptively 
prepared than nonpledgers. 


IMPLICATIONS FOR SOCIAL POLICY 


Individuals decide to pledge or not to pledge. The pledge movement 
attracts adults and adolescents who are pledge supporters. Pledgers do 
not want to have sex when they pledge, but they cannot determine the 
effect of the pledge all by themselves. In some contexts, if there are no 
other pledgers around, the pledge effect disappears. In other contexts, if 
there are too many pledgers around, the pledge effect disappears. The 
contextual effect we identify suggests interesting limits to the applicability 
of a universal, pledge-based policy. Like most other things, pledging works 
in moderation. Universal pledge-based policies will succeed only if they 
fail. Pledging is rooted in its nature as a minority identity movement. 
Movement leaders are aware of this as well. They recognize that pledging 
is not for everyone. They understand the nonlinear effect of other pledgers 
on the probability of pledging. Policy makers should recognize that the 
pledge works because not everyone is pledging. 
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APPENDIX A 


Sample Selectivity 


As part of the ACASI section of the in-home interviews, adolescents were 
asked whether they had ever had sexual intercourse. Those who reported 
intercourse were asked when they had intercourse for the very first time 
(year and month). Figure A1 shows the survivor function for the transition 
to first intercourse, combining data from both waves,” for the full sample, 
by gender and race (all adolescents with valid data on month and year). 
In contrast to earlier research, among nonblacks we find gender differ- 
ences only at early ages. Later, girls “catch up” and the two functions 
converge.*” For blacks, gender differences are pronounced until late ad- 
olescence, when the survivor functions for girls and boys converge. The 
median transition age for nonblack teenagers is slightly below 18 years, 
for blacks, slightly over 16. These figures are comparable to other national 
estimates (AGI 1994; Moore, Driscoll, and Lindberg 1998). 

The great advantage of longitudinal data resides in the possibility to 
order presumed causes and effects in time. Since most of our explanatory 
factors, including whether adolescents had taken a pledge, are measured 
in the first in-home interview, causal inferences for adolescents who by 
that time already had experienced first sexual intercourse are limited. 
Excluding these adolescents from the sample means that our sample is 
selective for later transitions to first intercourse. We now explore the 
impact of this selectivity. In our analyses, we consider only adolescents 
who were virgins at their first in-home interview.” Forty-four percent of 


“ Adolescents participating in both waves were asked twice about the date they had 
intercourse for the first time. In some of these cases, there is a inconsistency between 
the data collected in the two waves. The solution we adopted was to chose the date 
given in the first wave on the grounds that recollection of the timing of salient events 
in the life course declines in accuracy over time—the longer the time between the 
survey and the event in question, the more likely are recall errors (Brückner 1995). 
Since in the case of sexual debut underreporting is more likely than overreporting, we 
adopted this strategy also for teens who reported sexual debut in the first wave but 
claimed to be virgins in the second wave. A comparison of the survivor functions 
resulting from choosing either wave 1 or wave 2 data shows that the differences are 
minimal and have no impact whatsoever on the estimates of median transition age or 
the proportion of virgins at any other age. 

* Tn contrast to other studies, Add Health data do not show gender differences in the 
timing of first intercourse for white adolescents. This is likely the result of ACASI 
technology. Because interviewers were not privy to adolescent responses, girls may 
have felt freer to report earlier transitions. Boys may have felt freer to admit to a later 
transition. Turner et al (1998) report similar changes for abortion arising from ACASI 
interviews. 

* We also exclude a few adolescents who report to have had intercourse before age 
10 (age 9 for black males). 
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Fic. A1.—Transition to sexual intercourse by race and gender, full sample 


teens in the in-home sample had already experienced sexual debut by the 
time they were interviewed. Although the internal validity of causal in- 
ferences is strengthened by concentrating the analysis on adolescents who 
were still at risk for initiating sexual activity when data on explanatory 
variables were collected, we have to be concerned with the external va- 
lidity of the resulting restricted sample. Specifically, the effects of factors 
influencing the age of sexual initiation can be underestimated in a sample 
from which part of the population at risk has already dropped out. Figure 
A2 shows a comparison between the full Add Health sample and the 
restricted sample that we described above. 

For nonblack respondents, who on average are older at sexual initiation, 
the survivor functions for the full and restricted samples are identical 
until age 15, after which they diverge somewhat. At the end of the ob- 
servation period, the estimated proportion with intercourse is very similar. 
For black adolescents, the restricted sample is far more selective toward 
late transitions. The estimated median transition age from the restricted 
sample is almost one year higher than that estimated from the full sample 
for black adolescents. 

In the intercourse models, adolescent pledge status is the key indepen- 
dent variable we focus on. Adolescents who were not married at the time 
of the first in-home interview were asked whether they had “ever taken 
a public or written pledge to remain a virgin until marriage.” For nonblack 
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—— Foll Sample, Whites (15,466) 
Sample, Whites (N=7,123) 

---- Full Sample, Blacks (N=4,181) 

— — Restricted Sample, Blacks (Nu1,349) 





Fic. A2 —Transition to sexual intercourse by race, comparison of samples 


females, pledgers transitioned to first intercourse on average more than 
three years later than nonpledgers (38 months). Because less than half of 
nonblack male pledgers had experienced sexual initiation, we compare 
the transition for pledgers and nonpledgers in the lowest quartile. Here, 
pledgers transitioned to first intercourse on average 2.5 years later than 
their nonpledging counterparts. For both black females and males, the 
difference in median transition age between pledgers and others is slightly 
above two years (27 months). For all groups, we observe a substantial 
delay in sexual debut for pledgers. 

For the restricted sample, selected on late intercourse, the difference in 
median age for nonblack girls is still at 1.25 years. For nonblack males, 
pledgers transition to first sex 20 months later than nonpledgers. For both 
male and female blacks, there is no difference in median transition age 
between pledgers and others. Compared to the full sample, pledgers are 
overrepresented in this sample, especially among black girls. Pledgers 
make up 30% of the black female restricted sample compared to 19% of 
the full sample (for black boys, the respective numbers are 9% vs. 17%, 
for nonblack boys 9% vs. 17%, and nonblack females 16% vs. 23%). 
Thus, with our restricted sample for black adolescents we cannot show 
a pledge effect - however, pledging is associated with being in our sample 
(since pledgers have intercourse later, their chances to be part of our 
sample are greater). 
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Sample selectivity also results from missing data. Like all surveys, Add 
Health data suffers from missing data. Throughout, we have replaced 
missing values with the means for the associated race/sex group. In all 
cases we control for measurement error that may be introduced by mean 
imputation. None of the dummy variables were significant in the models 
reported below.* Unfortunately, mean substitution was not always pos- 
sible. A key requirement for our models involved timing relationships 
with respect to intercourse. We therefore excluded all respondents who 
had missing data either on the occurrence or date of first intercourse or 
on starling and ending times of romantic relationships. In addition, we 
lost a few respondents who had either missing data on the pledge question, 
or were not linked to any of the schools in the sample. Of the 8,150 
respondents who were still under risk to experience first intercourse at 
the time of the first in-school interview and participated in the second, 
we were able to retain 6,776 cases in the event history models reported 
below. Most of these losses are due to missing start and end dates in the 
relationship data. We found no indication that respondents with missing 
dates were different from others in terms of their transition behavior or 
in terms of the distribution of independent variables used in this analysis. 
The results of these analyses are available from the authors. 

Problems of selectivity are certainly present for black adolescents in 
our sample. For blacks, selectivity on the dependent variable all but wipes 
out the effect of our key independent variable, pledging. Given these 
problems, we cannot draw inferences about race differences from our 
sample. Rather, we focus on gender differences among black teenagers, 
asking the reader to keep in mind that the results presented below for 
black adolescents should not be taken to be representative for the pop- 
ulation. For white adolescents, a delay of sexual debut associated with 
pledging observed in the full sample is somewhat reduced but still sub- 
stantial.* In sum, we are more likely to under- rather than overestimate 
a delaying effect of pledging on sexual debut. 


” Missing data problems are especially great for popularity. For the popularity measure, 
we need full information on the entire friendship network in the school. In the 24 
schools where the response rate was too low, this measure was set to missing We 
tested models both by including only cases with valid data on these variables and by 
excluding these two variables and found that the results are stable across all 
specifications. 

* We explored the extent of sample selectivity by running a multivanate hazard model 
for both samples and comparing the estimated effects across samples, including only 
explanatory variables that do not change over time and are not affected by sexual 
initiation With the exception of the early gender difference (see fig A1) that we cannot 
show with the restricted sample, the models for nonblack teenagers look very simular 
in terms of effect sizes and direction (results not shown, tables available on request) 
Again, for black teenagers, the restricted sample is clearly selective. 
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APPENDIX B 


Who Pledges? 


In this section, we present a multivariate model of pledging with the goal 
of further exploring what differentiates pledgers from others. Because the 
differences we observe might be due to selection, we model the probability 
of pledging between the two waves of in-home interviews. This neces- 
sitates the use of a different sample than the sample we use in our tran- 
sition models. First, we need to exclude all adolescents who said they had 
taken a pledge in the first wave. Adolescents who had already experienced 
sexual initiation at Wave 1 are excluded from the present analysis as well 
since adolescents who have had sex are very unlikely to pledge. This 
yields a sample of 6,849 persons, of which 699 (9.41%) report that they 
have taken a pledge between waves.** Compared to the wave 1 pledgers 
included in subsequent analyses of sexual debut, the pledgers in this sam- 
ple have joined the pledge movement later in its history. 

We estimate a logistic regression model with the same covariates we 
use in the transition rate models, including age as an additional covariate.” 
Relationship information cannot be treated as time-varying covariate in 
this type of model. We therefore include the relationship status as mea- 
sured at the time of the first in-home interview, the number of partners 
an adolescent had up to that date, and the mean characteristics of those 
relationships. The results are reported in table B1.” 

While pledgers are more likely to be female once we control for other 
factors, the gender effect disappears. All variables associated with parents 
and family background have virtually no effect on pledging. Although 
we find a bivariate difference with respect to parental disapproval of 
sexual initiation and parental closeness, these differences disappear in the 
multivariate model, when controlling for religiosity and school context. 
School integration, as well as academic achievement and sports partici- 
pation are also not associated with a greater propensity to pledge. Pubertal 
development and cognitive ability, however, both decrease the likelihood 


s We lose fewer cases due to missing data here because we do not need to know the 
exact dates of relationship start and end. Adolescents were asked whether a relationship 
was still on going and that information was used to extract the relationship status at 
the time of the first in-home interview Cases where the relationship status was unclear 
were included, together with a variable flagging them. 

£ Questions on attitudes about sex where only asked of respondents that were either 
older than 14 or reported that they had experienced sexual debut. For this reason, we 
could not use these data in the models presented in this article. 

* Because pledging may effect some of the variables described in table 2, e g., self- 
esteem and one’s relationship to parents, and because we look at a different sample, 
the means of independent variables for both groups are reported in cols 1 and 2 of 
table B1. 
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TABLE B1 
DETERMINANTS OF PLEDGING 


> AA ODD3 

Pledgers Nonpledgers RATIO 95% CI 
ARE aida S AAA wae eee 15.43 15.43 96 .90-1.02 
Female .. paris Ys 60° 49 98 .71-1.25 
Black . bt A ale ew 18 .16 .85 .64-1.13 
Hispanic 2... eee eee ee 18 16 85 .60-1.22 
Aslan. ow... ... > See .12° .07 1717 1 09-2.69 
Single-parent family" . ....... 21 21 109 81-1.49 
Step- or foster family" ........ 17 17 1.15 86-154 
SESS ace 34 T 5.89 615 97 .93-1.01 
Closeness to parent EE 447° 4.40 1.12 89-1 43 
Perceived parental disapproval . 3.67° 3.52 1.08 84-139 
In romantic relationship" ..... 19 22 1.517 1.09-2.10 
N romantic partners* cit, 54° 72 787 .64—.96 
Mean emotional commitment” . . 185 2.22 1.08 97-1 19 
Mean embeddedness... . ... 83" 105 84 .67-1.07 
School attachmentt ..... 2.87 278 109 .90-1.33 
Unpopular in school network’ . ... 27 24 1.18 91-1.55 
Pubertal development” bt aie 285 2.96 83” .70— 99 
Add Health verbal test Score" . . 97 12* 102.30 98” 97.99 
GPA". ; gesa 293 2.88 1,14 97-1.34 
School Do partipation" 1.16 115 108 98-1.19 
Self-esteem"... . E do 3.07 306 .89 -70-1.14 
Relgionty®.. | eee : 2 28° 1.81 1.517 131-174 
%same-sex pledgers* . eee 16.77 1.90 1.127 1.09-1 15 
%same-sex pledgers™ En uii 404 17° 359.02 997 .98—.99 


NOTE —N of cases, measured at wave 2 in-home interview, 15 699 for pledgers, 6,150 for nonpledgera. 
X (df) ts 602 (37) Coefictents not shown sm denommation categomes, eight missing value flags (none 
significant ata = 05) SEs adjusted for clustermg, weighted data The coefficient for %same-sex pledgers’ 
is multipHed by 10 

* Measured at wave 1, in-home interview 

* Significant brvariate differences 

* Measured at time of in-school mterview 

* P< 05, one-tmled test 

+ P< 05, two-tailed test. 


of pledging substantially. Pubertal development here may indicate in- 
creased motivation to engage in sexuality (Udry 1988), which in turn 
decreases motivation for pledging. Cognitively more able adolescents 
probably have access to multiple strategies for saying no thank you to 
sex, and so are less likely to be attracted to the pledge movement. Being 
in a relationship increases the odds of pledging. This is consistent with a 
recruitment strategy of the pledge movement that constructs pledging as 
a token of “true love” between teenagers. There was no interaction effect 
between being in a relationship and number of prior partners. Thus, 
adolescents who have experience in the dating game are less likely to 
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pledge, regardless of their current relationship status. The nature of the 
relationship does not play a role for pledging.” 

The results of this analysis suggest that pledgers tend to be more re- 
ligious, from more normative backgrounds, and less physically developed 
than their nonpledging peers. We control for these, and other character- 
istics, in all of the intercourse models that assess a pledge effect. 


APPENDIX C 


Items Used in Index Construction 


Items for self-esteem index.—You have a lot of energy. You seldom get 
sick. When you do get sick, you get better quickly. You are well coor- 
dinated. You have a lot of good qualities. You are physically fit. You have 
a lot to be proud of. You like yourself just the way you are. You feel like 
you are doing everything just about right. You feel socially accepted. You 
feel loved and wanted. 

Items for school integration index.—You feel close to people at your 
school. You feel like you are part of your school. You are happy to be at 
your school. The teachers at your school treat students fairly. You feel 
safe in your school. 

Closeness to parents.—Most of the time, your mother (father) is warm 
and loving towards you. You are satisfied with the way your mother 
(father) and you communicate with each other. Overall, you are satisfied 
with your relationship with your mother (father). 

Religiosity.—How often do you go to church? How often do you pray? 
How important is religion to you? 

Social embeddednes of romantic relatonship.—You went out together 
in a group. You met your partner's parents. You told other people that 
you were a couple. 

Emotional commiiment.—You saw less of other friends so you could 
spend more time with your partner. You and your partner went out to- 
gether alone. You gave each other presents. You told each other that you 
loved each other. You thought of yourselves as a couple. 
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Case of Abortion’ 
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Recent observers have pointed to a growing polarization within the 
U.S. public over politicized moral issues—the so-called culture wars. 
DiMaggio, Evans, and Bryson studied trends over the past 25 years 
in American opinion on a number of critical social issues, finding 
little evidence of increased polarization; abortion is the primary ex- 
ception. However, their conclusions are suspect because they treat 
ordinal or nominal scales as interval data. This article proposes new 
methods for studying polarization using ordinal data and uses these 
to model the National Election Study (NES) abortion item. Whereas 
the analysis of this item by DiMaggio et al. points to increasing 
polarization of abortion attitudes between 1972 and 1994, this ar- 
ticle’s analyses of these data offers little support for this conclusion 
and lends weight to their view that recent concerns over polarization 
are overstated. 


INTRODUCTION 


Is America wracked by “culture wars”—divisive debates in the media 
over issues such as abortion rights, affirmative action, homosexuality, 
public funding for the arts, and so on—that polarize public opinion into 
opposing viewpoints? According to proponents of the “culture wars hy- 
pothesis,” the erosion of a common ground for reasoned ethical debate 
makes it difficult to resolve politicized moral issues and portends dan- 
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gerous escalations in levels of social conflict (Hunter 1991, 1994; Bennet 
1992; Gitlin 1995; Hochschild 1995). This same concern is echoed in the 
lay literature on contemporary politics (e.g., Shor 1986; Bolton 1992; Ja- 
coby 1994; Bloom 1992; Gates 1992; Hartman 1991), as well as in the 
press. The Washington Post reported that between 1993 and 1996 there 
were approximately 1,500 articles in the media referring to contemporary 
American “culture wars.” Apparently, the public also believes that Amer- 
icans are culturally fragmented, with 86% of the respondents to a 1995 
Newsweek poll agreeing that Americans share fewer common values now 
than in the past. 

Others claim the concerns above are overblown, arguing that public 
opinion data indicate that the majority of people do not hold “extreme” 
views on divisive social issues (Brint 1992; Dillon 1996). For example, 
most national surveys about abortion show that the majority of Americans 
think abortion should neither be strictly legal or illegal in all circumstances 
(Blake and Del Pinal 1981; Gallup Poll News Service 1998). However, 
this does not address the basic question posed by the culture wars hy- 
pothesis: while public debate in America has always been contentious, is 
there something special about the contemporary “culture wars” that makes 
Americans more polarized now on important social issues than they were 
in the past? To address this hypothesis, it is not sufficient to measure 
polarization at a single point in time—we need to model trends in po- 
larization over time. 

In an important recent article, DiMaggio, Evans, and Bryson (1996, p. 
693) argued that polarization as a state “refers to the extent to which 
opinions on an issue are opposed in relation to some theoretical maxi- 
mum,” while polarization as a process “refers to the increase in such 
opposition over time.” To investigate polarization in the latter sense, they 
examine the change in variance and kurtosis of opinion items over time, 
with increasing variance and declining kurtosis taken to evidence polar- 
ization. Using time series of items from the General Social Survey (GSS) 
and National Election Study (NES), spanning the period 1972-94, 
DiMaggio et al. find no evidence for polarization of attitudes concerning 
crime and justice, gender, race, and sexual morality. They do find that 
attitudes toward abortion polarized over this period. However, their con- 
clusions are suspect because they apply statistical methods appropriate 
for interval data to data that are nominal or ordinal. For example, to 
study trends in the binary GSS abortion items, they create a scale by 
summing the scores for each item and treating the resulting scale as in- 
terval (in general, scales created in this fashion are not even ordinal). To 
analyze the four-category ordinal NES abortion item, they score the re- 


2 Washington Post, February 23, 1997, final ed, p. C5. 
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sponse categories 1, 2, 3, 4, and treat the resulting data as interval (see 
app. B, table B1). However, it is misleading to treat ordinal data as interval 
data because the scores assigned to the categories are arbitrary—only the 
relative ranking of the categories is known. 

Our article makes several contributions. We develop appropriate meth- 
ods for studying polarization in ordinal variables, using these to model 
trends in attitudes toward abortion and to ascertain whether or not po- 
larization is increasing. We proceed as follows. First, we discuss the rel- 
evance of the abortion issue to the literature on culture wars and review 
previous work on trends in abortion attitudes. In the data and methods 
section, we describe the ordinal NES abortion item analyzed by DiMaggio 
et al. (1996) and several problems with their analysis, subsequently ex- 
positing (in context) the generalization of the cumulative probit model we 
developed to make inferences about trends in latent attitudes toward 
abortion. (A general discussion of the model and its implementation—in 
the Stata software package—is given in app. A.) We treat the observed 
NES item as an ordinal variable, making the usual type of assumption 
that responses to this item are the result of quantizing a normally dis- 
tributed latent attitude, and we develop models that are consistent with 
this assumption. Using the models, we find no evidence for an increase 
in polarization between 1972 and 1980. While several other models offer 
some support for the view that polarization increased between 1980 and 
1994, our best-fitting models, derived from theoretical considerations, do 
not support this view; a number of other considerations also suggest the 
superiority of these models over those that give evidence of polarization. 
Thus, as abortion is the exemplar issue in much of the culture wars 
literature, our findings suggest that the concerns.expressed therein are 
overstated. 


SOCIAL CONFLICT OVER ABORTION RIGHTS: CULTURE WARS? 


“Culture wars” may be characterized as conflicts over issues that are 
rooted in nonnegotiable conceptions of cultural and moral order (Hunter 
1991, 1996; Wutbnow 1996). According to Wuthnow and Hunter, these 
conflicts stem from a breakdown of the old denominational religious loy- 
alties in America, with the traditional denominations splitting along a 
crosscutting conservative/liberal or orthodox/progressive divide that 
threatens the normative consensus (Davis and Robinson 1996). Hunter 
(1991) argues that the orthodox side stresses scriptural inerrancy and the 
belief that God is the ultimate arbiter of right and wrong, while pro- 
gressives claim that morality is a human construct that is constantly evolv- 
ing. Disagreement based upon fundamentally different moral frameworks 
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is especially difficult to resolve given the lack of common ground between ' 
opposing viewpoints. Instead of a substantial number of moderates who 
can work toward compromise, the spectrum of opinion is increasingly 
polarized into irreconcilable viewpoints. Unable to negotiate between 
these competing moral views, debate thereby ceases to be an attempt to 
convince the opposition with reason and becomes instead a struggle for 
the power to impose a particular moral order. For these reasons, the new 
moral divisions in U.S. society create polarizing tendencies (Hunter 1991, 
p. 43). Furthermore, even if the majority of the population is “middle of 
the road,” it is argued that people will adopt more extreme views over 
time (Nolan 1996). According to Hunter (1991), this sort of discord is 
especially dangerous because it is antidemocratic—each side’s position is 
seen as a fundamental right that transcends the democratic process. He 
argues that the metaphor “culture wars” should be taken literally: “Vio- 
lence? The suggestion that violence can occur is not made lightly: culture 
wars always precede shooting wars. It is culture, after all, that justifies 
the use of violence. We are truly in the midst of a culture war of great 
social and historical significance” (1993, p. 30). 

Moreover, the divisions between competing moral visions are so deep 
“there is no resolution to the culture war in America” (Hunter 1996, p. 
253). If culture wars lead to increasing opinion polarization, then the 
decline in the proportion of people with moderate views makes it difficult 
to resolve politicized moral issues and may increase the likelihood that 
advocacy groups will resort to violence to impose their views. 

The dispute over abortion rights appears to epitomize the foregoing 
arguments (Hunter 1994), as witnessed by a number of recent books whose 
titles call attention to the presumed irreconcilable viewpoints of the parties 
to this conflict: Abortion: The Clash of Absolutes (Tribe 1992), No Neutral 
Ground: Abortion Politics in the Age of Absokutes (O'Connor 1996), The 
Abortion Dilemma (Claire 1995), and Between Two Absolutes: Public 
Opinion and the Politics of Abortion (Cook, Jelen, and Wilcox 1992). In 
an ethnographic study of abortion activists, Luker (1984, p. 2) argues that 
pro-choice activist women “share almost no common premises and very 
little common language” with antiabortion activist women; in particular, 
the worldviews and conceptions of motherhood held by the two different 
groups of women are antithetical Hunter (1994) argues that activist or- 
ganizations further contribute to opinion polarization by conducting “sym- 
bolic warfare” with their opponents. According to him, activists on both 
sides of the abortion dispute attempt to coerce consent by relying on 
rhetorical distortions and emotional appeals, with the modern media ex- 
acerbating matters by encouraging superficial and confrontational dis- 
course. Thus, the debate over abortion is reduced to a struggle over in- 
flammatory images that limits the possibilities for compromise: the coat 
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hanger for pro-choice groups and grisly pictures of aborted fetuses for 
pro-life groups. A pro-life activist interviewed by Hunter (1994, p. 67) 
states, “This is a visual war we’re fighting here [because of] the 30-second 
sound-bite society we live in.” Hunter concludes that the outcome of this 
sort of discourse is a “thinning out” of the middle ground, reducing the 
chances for democratic compromise. 

Others do not view the abortion dispute as a clash between irrecon- 
cilable views. Ginsburg’s (1989) study suggests that although pro-life and 
pro-choice activists come to hold different opinions on abortion, there are 
also significant commonalities in their views, with both types of activists 
emphasizing the importance of nurturance, for example. Dillon (1996) 
argues that the public opinion literature indicates that Americans have 
complex and nuanced views about abortion that are not readily under- 
stood using a simple orthodox/progressive dichotomy. She also argues, 
from her work on the language of the abortion debate, that the level of 
rhetorical complexity featured here is similar.to the level encountered in 
political discussions of other issues. According to her, activists on both 
sides of the fence use arguments that attempt to engage those with op- 
posing views, as opposed to summarily dismissing the contrary view out 
of hand. If this is the case, then the abortion dispute, while heated, does 
not differ from other forms of reasoned democratic discourse, implying 
that the concerns of writers on “culture wars” (at least on the abortion 
issue) are misdirected. 

Nonetheless, Hunter’s concern that increasing polarization portends 
outright civil violence should not be taken lightly. As table 1 shows, 
abortion violence and protest appear to have escalated in the first half of 
the 1990s, culminating in the murder of five abortion providers in 1993—94. 
These acts of violence and social protest, widely reported in the media, 
constitute the public face of the conflict over abortion and provide prima 
facie evidence for the concerns of Hunter and others who worry that the 
divide in U.S. society does not permit a peaceful resolution. However, 
some caution is in order. First, the level of violence associated with the 
abortion issue seems “small,” perhaps reflecting the actions of isolated 
individuals or fringe groups. Second, much of the increase in abortion 
protest appears to be picketing, and this type of activity cannot be viewed 
as antidemocratic. Thus, to use the type of evidence in table 1 to suggest 
that polarization in the U.S. public is spiraling out of control, as some 
sociologists and journalists have done, seems unwarranted. Further, even 
were the prima facie evidence stronger, it has been argued (Ginsburg 1998; 
Risen and Thomas 1998) that in the wake of this violence both pro-life 
and pro-choice groups adopted more moderate positions in an attempt to 
distance themselves from extremists. Therefore, it is conceivable that re- 
cent violence will generate a decline in polarization. Finally, and most 
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important, no matter how tempting it is to do so, it is simply incorrect 
to infer the existence of increasing opinion polarization from the very 
outcomes such polarization is held to predict; social psychologists inter- 
ested in the relationship between attitudes and behavior learned a similar 
lesson many years ago. 

In short, it seems clear that in order to ascertain whether or not opinions 
are becoming more polarized, as argued by Hunter and other writers who 
speak of “culture wars,” it is necessary to directly examine opinion data. 
Using this type of evidence, Brint (1992) and Dillon (1996) argued that 
the proportion of persons holding extreme views on the abortion issue is 
not large enough to evidence polarization. However, there are several 
problems with these conclusions. Most important, these authors do not 
examine time trends, as would be required to ascertain whether or not 
levels of polarization are increasing. Second, even if interest resided in 
polarization at a single point in time, the definition of an extreme response 
is subjective, and these authors offer no criteria for assessing the extent 
of polarization. Having concluded that it is necessary to examine trends 
in abortion attitudes to assess the validity of the culture wars hypothesis, 
we now examine previous work on temporal trends in abortion attitudes. 


TRENDS IN ABORTION ATTITUDES 


Gauging public opinion about abortion is difficult given the complexity 
of the moral and legal issues involved; in addition, responses to opinion 
items depend on question wording and order (Jaffe, Lindheim, and Lee 
1981; Adamek 1994; Bumpass 1997). Nevertheless, several findings 
emerge consistently from the literature. First, at all points in time and on 
all major surveys, a substantial proportion of the population does not 
think that abortion should be strictly legal or illegal in all circumstances 
(see Cook et al. 1992; Hunter 1994). According to a 1998 Gallup poll, this 
percentage was 54% in 1975, 48% in 1992, and again 58% in 1998.’ In 
this sense, it seems true that a large proportion of the population holds 
“middle of the road” views about abortion. Second, support for abortion 
varies widely according to the specific circumstances of the abortion. A 
large majority of individuals support abortion when the life of the mother 
is in danger or in the case of rape or incest. In contrast, there is consid- 
erably less support for abortion for social reasons, such as not being able 
to afford the child (Gallup Poll News Service 1979; Granberg and Gran- 


3 These data were mentioned in an article entitled “Fluctuations, but No Major Change 
in Views on Abortion” that was posted at CNN Interactive (http//www cnn.com/US/ 
9801/20/abortion poll/Andexhtml) in 1998. See also Gallup Poll News Service (1998) 
and Hall (1998). 
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berg 1980; Cook et al. 1992). Third, the level of support for abortion has 
been fairly stable since the Roe v. Wade decision. While the level of support 
increased substantially from the early 1960s to the early 1970s, there was 
little change throughout the rest of the 1970s and a slight decline in the 
early 1980s (de Boer 1977; Glenn 1987; Gallup Poll News Service 1979; 
Granberg and Granberg 1980; Gillespie, Vergert, and Kingma 1987; Page 
and Shapiro 1992). By 1985, the net result was that the level of public 
support for abortion in the 1972 and 1985 GSS data is almost identical 
(Glenn 1987). Recent opinion data show some fluctuation in the support 
for abortion. According to Gallup survey data, between 1975 and 1998, 
the percentage of people who thought that abortion should be legal in all 
circumstances rose from 21% in 1975 to 29% in 1989 and 34% in 1992, 
but has since declined to 23% in 1998 (Gallup Poll News Service 1998). 
The remarkable thing about this trend is that, despite these fluctuations, 
support for legal abortion is about the same as it was 20 years ago. 

However, as Hunter (1994) and DiMaggio et al. (1996) note, aggregate 
measures of support for a particular issue—that is, the “average” opin- 
ion—fail to measure attitude polarization. To see this, simply imagine the 
case where the “average opinion” is temporally stable, with individuals 
on both sides of the center moving steadily toward the extremes. While 
the average opinion is unchanged, the variance is increasing; substan- 
tively, increased variability may make compromise between opposing 
views more difficult to obtain. 

To address the issue of opinion polarization in abortion attitudes, 
DiMaggio et al. (1996) used two sources, the NES and the GSS. The NES 
is a biennial survey in which respondents were asked, beginning in 1972, 
to express their opinion about abortion on a four-point ordinal scale. 
The abortion item was not asked in 1974, but was asked in 1976, 
1978, ..., 1994. DiMaggio et al. scored the categories 14 and treated 
the resulting data as interval data, calculating (at each time point) the 
sample mean, variance, and kurtosis of the item responses; they then 
regressed the series of means, variances, and kurtoses against time. In 
1980, the NES response codes were changed, but a subset of 1,320 of the 
respondents in that year were asked to respond using both the old and 
new response codes; for this year, they used only these respondents, de- 
fining the respondent’s score as the average of the responses to the old 
and new items. Their analysis suggests that, over the study period, the 
variability in opinions increased and the kurtosis decreased, evidencing 
increasing polarization. ` 

There are two major problems with DiMaggio et al.’s analysis of the 
NES data. First, because the NES item is ordinal, it is not appropriate 
to calculate means, variances, and the kurtosis of the item. Second, 
DiMaggio et al. (1996) pieced the NES data into a single series combining 
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the old NES item (1972-78), the average of the old and new items (1980), 
and the new NES item (1982-94). We discuss these problems further in 
the next section and present appropriate statistical methods for assessing 
polarization in ordinal opinion items. 

Using six binary items on abortion from the GSS, DiMaggio et al. also 
constructed a time series (with 14 points) spanning 1977-94, Each item 
was initially scored 1 or 2, and the item scores were added, resulting in 
a scale value for each respondent. The resulting scale was treated as 
interval data, and the data were analyzed as above. According to the 
authors, opinion was already sharply divided at the onset of the series in 
1977, and polarization increased over the study period, as evidenced by 
increasing dispersion and declining kurtosis, with the decrease in kurtosis 
leveling off by the mid-1980s. This analysis is also problematic, as scales 
constructed in such a fashion need not even be ordinal, and even if the 
items are “unidimensional,” that does not mean that a scale constructed 
in the manner above is interval. To analyze these data properly, it would 
be necessary either to examine the trends in each item separately or to 
construct a probabilistic model of the response process with an underlying 
latent attitude toward abortion. 

Because the methods we present for analyzing polarization using an 
ordinal item are not necessarily appropriate for analyzing the GSS items, 
these data are not analyzed in this article. Although we do not accept the 
conclusions DiMaggio et al. reached using these data, the reader should 
note that it is possible that the GSS data, were they are analyzed properly, 
might still suggest that attitudes toward abortion polarized. We return to 
that issue in the discussion... 


DATA AND METHODS 


The NES is conducted biennially by the Institute for Social Research at 
the University of Michigan. The NES sample that we use is a nationally 
representative probability sample of all households in the United States. 
In 1992 and 1994, some of the NES respondents are panel respondents 
in ongoing longitudinal studies. For each year, the nonpanel respondents 
constitute a nationally representative sample, but the panel respondents 
do not; thus, only the nonpanel respondents are used in the analysis. Table 
2 displays the the time series of data for both old and new questions. 
“Comparison of the response codes in table B1 reveals that the new item 
emphasizes the legality of abortion, while the old one does not, suggesting 
the two items do not measure the same latent attitude toward abortion. 
Tf this is the case, it was inappropriate for DiMaggio et al. to combine 
the two items to form a single series and then examine this series for 
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TABLE 2 
RESPONSES TO ABORTION QUESTIONNAIRE 


Never Life in Cleariy Always 
Year Legal Danger Established Legal Total N 
Old: 
1972 .. 112 469 173 24.5 100 2,619 
1976 ... 11.3 457 163 26.8 100 1,822 
1978 .. 10.9 446 16.8 276 100 2,187 
1980 98 44.7 18.0 275 100 1,362 
New: 
1980 ... 11.5 32.7 19.0 36.8 100 1,547 
1982 ... 133 308 19.6 36.4 100 1,359 
1984 13.3 303 20.0 36.4 100 2,171 
1986 134 29.0 18.4 393 100 2,120 
1988 ... 12.7 33.2 185 35.6 100 1,996 
1990 12.2 332 14,4 402 100 1,932 
1992 . 10.8 26.2 138 492 100 1,087 
194 . 12.6 30.8 147 42.0 100 994 


Norr —Data are from the National Election Study For 1992-94, sample excludes panel study mem- 
bers See table B1 for a description of the categones 


trends. To address this important issue further, we use the 1,320 NES 
respondents who answered both the old and new abortion items in 1980. 
Table 3 contains the percentages, means, and variances for both sets of 
questions. Table 4 contains the cross-classification of responses to both 
sets of questions. If these items measure the same latent attitude and the 
relationship between the latent attitude and observed responses is the 
same for both items, the marginal distributions (see table 3) of the two 
items will be identical. If the two marginal distributions are not identical, 
either (1) the two items do not measure the same latent variable, in which 
case it is not appropriate to combine the years 1972—78 with the years 
1980—94, or (2) the relationship between the latent attitude and the ob- 
served response is not identical for the two items, in which case assigning 
identical sets of scores to the two items and examining trends over the 
full period 1972-94 almost certainly confounds the item change in 1980 
with any possible true attitudinal change. Inspection of table 3 suggests 
that the 1980-94 question elicits a lower level of response in category 2 
and a higher level of response in category 4; further, the null hypothe- 
sis—that the two marginal distributions are identical—is rejected.* There- 


* To ascertain whether or not the distributions of the old and new NES items are 
identical, we test for marginal homogeneity in the cross-classifications of old by new 
responses for the 1,320 respondents in 1980 who answered both itmes. The data are 
given in table 4. Because the log-linear model of quasisymmetry (QS) fits these data 
adequately (L1OS] = 7.21,df = 3,P = .07), where L’ denotes the likelihood ratio x? 


922 


Culture Wars 


TABLE 3 
OLD AND NEW ABORTION QUESTIONS IN 1980, 
PERCENTAGES, MEANS, AND VARIANCES 


Question/Category Old New 
1 9.9 12.0 
2 442 32.1 
3 18.2 18.7 
4 27.7 37.2 
Total ........ 100.0 100.0 
Mean .........- 2 64 2.81 
Variance ......... 983 1.14 


Notr —N = 1,320 


fore, the two items should not be combined into a single time series, as 
was the case in DiMaggio et al.* 

To understand the effect of the change in question wording on the 
conclusions drawn by DiMaggio et al., we need to compare their analysis 
(described previously) with an analysis that adjusts the series of means, 
variances, and kurtoses for this change. To perform these adjustments, 
we (see table 3), used the 1980 data on the 1,320 respondents who an- 
swered both the old and new NES items, treating these data as interval 
data, as was done in DiMaggio et al. (1996). The means of the old and 
new items are, respectively, 2.638 and 2.811, which yields an estimate of 
the effect (on the mean) due to the change in wording as 2.811 — 
2.638 = .173; we therefore added .173 to the means for the pre-1980 data. 
For the data from 1980-94, we used the mean calculated from the new 
item. The variance series and the kurtosis series were adjusted similarly, 
using .983 for the variance of the old item and 1.140 for the variance of 
the new item, and —1.198 for the kurtosis of the old item and —1.319 for 
the kurtosis of the new item. 

Without adjusting the data for changes due to wording, DiMaggio et 
al. (p. 716) regressed their series of means, variances, and kurtoses on 
time, leading them to conclude that, between 1972-94, attitudes toward 
abortion become more liberal, but also more polarized, as evidenced by 
increasing variability and decreasing kurtosis. Using the adjusted mean 
series, the estimated slope is .008 (SE = .003; P = .011), slightty smaller 
than the estimated slope in DiMaggio et al.; For the variance series, the 


statistic and P denotes the P-value (based on L’), marginal homogeneity is equivalent 
to symmetry (S) (Agresti 1990, p. 390) therefore, the null hypothesis may be tested 
using the conditional likelihood ratio test (1*[S] — LOS] = 133.69 — 7.21 = 125.48; 
df = 3; P < .001), clearly, this leads to rejection at any conventional level of significance. 
+ One could overcome this problem if plausible adjustments were made to compensate 
for the effect of the change in wording. 
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TABLE 4 
Cross-CLassIFICATION OF RESPONSES TO THE OLD AND NEw 
ABORTION QUESTIONS, 1980 


NEW QUESTION/CATEGORY 


OLD QUESTION/CATEGORY 1 2 3 4 TOTAL 
1 75 40 7 8 130 
2 73 334 120 57 584 
3 7 36 90 107 240 
A de 3 14 30 319 366 


otal. AA 7 158 424 247 491 1,320 
Norr —Source: National Election Study, 1980 


estimated slope is .003 (SE = .0009; P = .009); although the estimated 
slope is now substantially smaller than that reported by DiMaggio et al., 
note that it is still significantly different from zero at the .05 level, indic- 
ative of increasing polarization. For the kurtosis, the estimate is —.004 
(SE = .003; P= .163); in DiMaggio et al., the slope is —.013, significantly 
different from zero at the .05 level. 

Our calculations suggest that when the ordinal NES item is adjusted 
for the change in item wording, but still inappropriately treated as interval 
data, there is still evidence (although not as strong as before) of increasing 
polarization. Similarly, our calculations suggest that attitudes became 
more liberal, but not to the extent suggested by DiMaggio et al. However, 
these calculations do not bear on the implicit assumption that the old and 
new NES items measure the same attitude. 

Because the old NES item makes no reference to the legality of 
abortion—and therefore seems to tap personal beliefs, while the new NES 
item refers to legality and therefore seems to tap something other than 
purely personal beliefs—we do not believe it is reasonable to think the 
old and new items measure the same attitude. Therefore, in this article, 
we model the old NES abortion item using the data from 1972-78 and 
the respondents who answered the old item in 1980, and we separately 
model the new NES abortion item using the data from 1980-94. We ask 
whether polarization increased between 1972 and 1980 (using the old item) 
and whether or not polarization increased between 1980 and 1994 (using 
the new item). 


The Model 


We now discuss a model for ordinal dependent variables that allows us 
to more appropriately assess whether or not attitudes toward abortion 
have polarized in the past 25 years. The model we use to analyze the 
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data in table 2 is a cumulative probit model with heteroscedasticity and 
variable cutpoints. A general discussion of the model and its estimation 
is given in appendix A; here we exposit the model within the context of 
a four-point ordinal scale. In the usual cumulative probit model for an 
ordinal scale Y with 4 categories, a latent variable indexing the unobserved 
attitude of respondent i at time ¢ (for convenience of exposition, we index 
the time points t = 1,..., T) is postulated to follow a regression model: 


Ys = pw, + Es (1) 


where u, is the mean response for time #, and e,, is a normally distributed 
error with mean zero and variance o°. The relationship between Y and 
the observed variable Y,"(the response on the four-point scale) is assumed 
to be of the form Y, = 1 if <n, Y, =2 if 1,<Y/¿Sr, Y, =3 if 
7, <¥j<1,, and Y, = 4 if Y >T, where t, t, and t, are the category 
cutpoints. As is well known, not all the parameters (7,,7,,7;,07,u,, # = 
1,..., T) of the cumulative probit model are identified, and two restrictions 
must be imposed to set the location and scale of the latent attitude Y”. 
Typically, the restrictions 7, = 0 and o? = 1 are used, although other 
identifying rules may be more useful in particular applications, as herein. 
A graphical interpretation of the model can also be given (see fig. 1). 
Respondents with fewer than 7, = —1 units of the latent attitude Y” an- 
swer the NES item using response category 1, while respondents between 
7, = —1 and 7, = 0 units choose response category 2, and so on. 

The usual cumulative probit model cannot be used to model opinion 
polarization because the variance of the latent attitude is constrained to 
be the same at all time points in this model. We remove this deficiency, 
allowing the variance of the latent attitude to depend on time (see Sobel 
[1998] for closely related material). In addition, we allow for the possibility 
that one or more of the cutpoints may change over time; this feature is 
demonstrated in our empirical analysis. In previous work, Terza (1985) 
allowed for variable cutpoints in the cumulative probit model, and 
McCullagh and Nelder (1989) discussed extensions of the proportional 
odds model where the variance is allowed to depend on independent 
variables. We also note that the kurtosis of any normal distribution is 
zero. Were we, following DiMaggio et al. (1996), to argue that declining 
kurtosis evidences bimodality and is therefore also necessary for estab- 
lishing increasing polarization and were our model to fit the data, the 
null hypothesis of no increase in polarization would be accepted (even if 
the variance was increasing). There are two problems with such an ar- 
gument. First, kurtosis does not indicate bimodality (Balanda and 
MacGillivray 1988). Second, in our view, bimodality does not necessarily 
evidence polarization; just consider the case where the modes are “very 
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(Y = observed ordinal variable) 





q=-1 TFO Ty Ta Ty tower) 
Y’ Latent Variable Measuring Abortion Attitude 


Fig. 1.—Hypothesired effect of the 1989 Webster decision on the category 3 cutpoint, 7, 


close.” However, in our view, it is intuitively clear that an increase in the 
variance of normally distributed random variable evidences increasing 
polarization. 

Under our model, the parameters of substantive interest are u, (the 
mean at time £) and 0] (the variance at time t), + = 1,..., T. To identify 
the model, we set 7, and 7, to zero and one, respectively (for all time 
periods). In a number of the models considered later, the cutpoint r, is 
allowed to change over time. Using these identification rules, a mean 
k, > 0 indicates that more than half the respondents at time t respond 
using category 3 or category 4, while fewer than half the respondents use 
these categories if u, < 0. The variances are also not identified, but fixing 
the cutpoints identifies the scale of the latent variable. As a result, the 
absolute magnitude of the variance is not meaningful herein, but the 
relative sizes of the variances are identified; that is, the variances can be 
compared over time. We test the culture wars hypothesis that polarization 
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is increasing over time by looking for trends in the variance of the latent 
variable. 

The method of maximum likelihood is used to estimate the model. 
Under the usual types of regularity conditions (met here), the parameter 
estimates are consistent and asymptotically normally distributed in large 
samples. The computations were performed using the software package 
Stata (StataCorp 1997; for further details, such as the log likelihood and 
the first and second derivatives, see app. A). 


FINDINGS 
Old NES Item, 1972-80 


We begin by analyzing the old NES item for the years 1972-80 (see table 
2). For comparative purposes, we first note that DiMaggio et al. would 
have found no trend in the variance (slope = .003; SE = .004) of the 
old NES question between 1972 and 1980 by inappropriately treating the 
data in the upper part of table 2 as interval data and regressing the series 
of variances on time.* Table 5 presents model evaluation statistics for a 
number of the models we fit to these data (models H,—H,). The Bayesian 
information criterion, or BIC (Raftery 1995) is calculated here as L, — 
df x In(N of cases). Lower levels of BIC indicate preferred models. In 
this article, we use L’ to assess the model fit, and we use BIC as a secondary 
selection criterion. 

First, we considered model H,. In this model, the cutpoint is constant 
across years, but the means and variances are allowed to differ in each 
year. To insure that the cutpoint 7, > 0, we parameterized the model using 
In(7,); similarly, we estimated In(o/). To obtain the residual degrees of 
freedom for this model, note that nine parameters are estimated: one 
cutpoint, T, and two parameters for each of the four time points—the 
mean p, of the latent variable and the variance o;. In addition, at each 
time, the response proportions must add to one, imposing four additional 
constraints. Thus, the number of constraints on the 16 proportions is 13, 
hence there are 3 df. The degrees of freedom for the models subsequently 
considered can be determined analogously. It is evident that H, fits the 
data adequately; the P-value associated with L’(H,) is .87. That is, our 
latent variable model accounts for the distribution of the observed NES 
item. 

We now ask whether or not the variances increased over the study 
period, considering model H,, in which the variances (but not the means) 


*In addition, for the mean series, the estimated slope is .010; SE = .001; for the 
kurtosis series, the estimated slope is —.015; SE = .004; P = .073. 
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are equal in all years; this model also fits the data adequately (P = .71) 
and H, does not fit the data better (1*[4,] — '[H,] = 3;df = 3;P = 
.391). Thus, we conclude (for the old NES item) that there is no polari- 
zation between 1972 and 1980. 

We also ask if there is a trend in the means between 1972 and 1980. 
Model H, is the special case of H, in which the means are constant between 
1972 and 1980; comparing H, with H, (2’[H,]— P*[H,] = 7.52; df = 3; 
P = .057) suggests the means may vary over this period. Similarly, model 
H, is the special case of H, in which the means are constant between 
1972 and 1980, and comparison of H, with H, again points to variability 
in the means (’[H,] — ’[H,] = 7.07; df = 3;P = .067). Therefore, we fit 
model H, with constant variance and a linear trend in the mean; this 
model fits the data almost as well as H,, but with two fewer parameters. 
In addition, H, is clearly preferred over H, on the basis of the likelihood 
ratio test (1*[H,] — L’[H,] = 7.07;df = 1;P = .008); further, although the 
BIC statistic would suggest favoring H, over H,, the difference between 
the two values is trivial. 


New NES Item, 1980-94 


Next, we model the new NES item between 1980 and 1994. If DiMaggio 
et al. (1996) had analyzed this series by inappropriately treating the 
1980-94 data in table 2 as interval and regressing the series of means, 
the natural logarithm of the variances, and the kurtosis series on time, 
they would have concluded that there is strong evidence that attitudinal 
variability increased between 1980 and 1994 (slope .004; SE = .001), and 
somewhat weaker evidence suggesting that attitudes may have liberalized 
over this period (slope .009; SE = .005). For the kurtosis series, the slope 
is —.0013 with standard error .004.” 

Table 5 presents model evaluation statistics for a number of the models 
we considered. Under model M,, the means and variances are allowed to 
vary across survey years; however, the cutpoint 7, does not vary by year. 
It is evident that this model does not fit the data adequately (P < .001). 
However, before concluding that the data are incompatible with the hy- 
pothesis of a normally distributed latent attitude toward legalized abor- 
tion, we reexamine the assumption in model M, that the cutpoints are 
constant from 1980 to 1994. It is important to note that by allowing all 
the cutpoints to vary freely over time, the proportions in table 2 can be 
reproduced exactly. That is, a model with variable cutpoints can always 
be obtained that will fit these data, whether or not the model correctly 
describes the underlying process generating the data. Consequently, we 


' For the kurtosis senes, the slope is —.0013; SE = .004. 
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TABLE 5 
MODEL EVALUATION STATISTICS FOR MODELS OF ATTITUDES TOWARD 
LEGALIZED ABORTION 
Model Description D BIC df 
1972-80 data: 
Hiss auis Fit all means and variances and 1, 72 —26.24 3 
ds rere H, with constant variance 3.72 —50 20 6 
H,. H, with constant means 8.24 —45.68 6 
A, H, with constant means 10.89 — 69.98 9 
H, .. H, with Hnear trend ın means 3.82 —68.07 8 
1980-94 data: 
Mio Fit all means and variances and 7, 35.28 —31.14 7 
Mica M, with 7, change for 1990-94 6.14 —50.79 6 
M, BS M, with constant vanance 19.91 — 103.43 13 
Mita M, with linear trend in In(variance) r, 19 58 —94.28 12 
Mies isk M, with different variance in 1986 13.16 —99.70 12 
Mest: M, with constant means 46.88 —142.88 20 
M,. M, with linear trend in means 44 94 135.33 19 
M, M, with different mean in 1992 24 25 —156 03 19 
M, M, + linear trend in In(variance) 49.03 —74,32 13 
Mo M,+ constant variance with different 42.72 —71.14 12 
variance in 1986 and 1990-94 

Mis M,+ constant means with different 68.91 —101.88 18 


mean in 1992 





believe that models with variable cutpoints are useful only when theo- 
retical considerations strongly suggest their use, and we would further 
argue that hypotheses about cutpoint changes should be empirically 
testable. 

To begin our reexamination, note that for categories 1 and 2 in table 
B1, the language in the NES abortion item clearly specifies the circum- 
stances under which the law should permit abortion, but this is not the 
case for category 3. Whereas the phrase “permit abortion for reasons other 
than rape, incest, or danger to the woman’s life” demarcates response 
category 3 from category 2, the circumstances that constitute the clear 
establishment of a need for an abortion are not fixed by the item, leaving 
respondents to demarcate the choice between categories 3 and 4. Signif- 
icantly, the item also does not say who establishes the “need” for an 
abortion. Is it the woman and her doctor, or is it the state? As a result 
of this ambiguity, we believe that there are good reasons to suspect that 
changes in the public discourse on abortion rights caused this point of 
demarcation to change after 1989 as the result of media and activist 
organizations’ response to the 1989 Webster decision. In July 1989, the 
Supreme Court, in Webster v. Reproductive Health Services, rendered the 
most important legal ruling on abortion since Roe v. Wade in 1973. The 
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Webster decision ruled that restrictions on access to legal abortions were 
not unconstitutional, upholding a Missouri statute prohibiting the use of 
public facilities to perform abortion and requiring physicians to test for 
fetal viability after 20 weeks of pregnancy (Cook 1998). 

The Webster ruling indicated that states could now attempt to restrict 
abortion access (Garrow 1994), and as a result, the abortion issue was 
thrust into the political arena as state legislatures proposed legislation 
restricting public funding for abortions, requiring parental consent, and 
testing for fetal viability (Saletan 1997). Subsequently, the Supreme 
Court’s willingness to approve restrictive legislation in Webster was sup- 
ported by the 1992 Planned Parenthood v. Casey decision, which upheld 
the constitutionality of restrictive legislation in Pennsylvania that required 
a 24-hour waiting period for abortions, parental consent, and reports on 
abortions performed in publicly funded institutions. Currently, 29 states 
have passed legislation requiring parental consent of minors’ abortion 
decisions, 14 require a delay following state-directed counseling, 13 restrict 
private or public insurance coverage, 34 restrict Medicaid coverage, and 
26 restrict postviability abortions.* 

Although most people may not understand the intricacies of the Webster 
decision (see Hunter [1994] for a depiction of the public’s poor under- 
standing of Supreme Court rulings about abortion), we argue that the 
public discourse on abortion after Webster affected respondents’ inter- 
pretation of the NES abortion item. In particular, we argue that the public 
discourse affected the semantics of category 3 of the NES question (see 
app. B), which stresses the legitimacy of abortion under the subjectively 
defined circumstances of “clearly established need,” by associating it with 
increased government control over what is and is not a legitimate abortion. 
Even though the proposed restrictions of the Webster decision may not 
prohibit most women from obtaining abortions, many pro-choice organ- 
izations viewed them as a direct threat to the Roe v. Wade framework. 
The National Organization for Women, for example, has argued that 
although late-term abortions constitute only a tiny fraction of all abortions, 
bans on late-term and partial-birth abortions are worded ambiguously 
and “are intended to restrict access to all abortions.” In a review of the 
politics of abortion rights, Saletan (1997) argued that the Webster case 
galvanized the pro-choice movement to reframe its defense of abortion 
rights. For example, the National Abortion Rights Action League 


* See The Status of Major Abortion-Related Laws and Policies in the States, December 
31, 2000, Alan Guttmacher Institute special report at http//www agi-usa org 
fpubs/abort_law_status.html 

* Issue Report. Abortion Procedwre Bans, NOW at http//www-now.org/issues 
fabortion/laterep html; we viewed this document in 1999. 
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(NARAL) adopted the new slogan “Who Decides—You or Them?” Saletan 
(1997, p. 118) argues that the value of this slogan was that it could appeal 
to a broader audience: “Voters with conservative instincts who shied away 
from the outright prohibition of immoral behavior could embrace 
NARAL’s new message as a rejection of big government. In the minds 
of these voters, ‘them’ meant nosy, corrupt politicians and bureaucrats; 
‘you’ meant families and communities.” 

Before the Webster decision, many NES respondents may have assumed 
that establishing the “need” for an abortion was the result of a private 
decision between a woman and her doctor. At the time, support for abor- 
tion in cases where the desire for an abortion was established in consul- 
tation with a doctor was high. For example, before the Webster decision, 
a CBS/New York Times survey in April 1989 indicated that 72% of those 
with opinions said that “if a woman wants to have an abortion and her 
doctor agrees to it,” she should be allowed to have an abortion (Dionne 
1989). However, the abortion discourse post-Webster encourages respon- 
dents to think that state governments will increasingly intervene in the 
doctor-patient relationship by placing additional restrictions on abortions 
that weaken the original legal framework of Roe v. Wade. In other words, 
who will decide when the need for an abortion has been “clearly estab- 
lished”—the woman and her doctor or the state? As a result, respondents 
may now associate the ambiguous language of the third category of the 
NES abortion question with increased state control over what constitutes 
a legitimate need for abortion. We argue that, as a result, the cutpoint 
separating categories 3 and 4 decreased post-Webster. 

The rationale for the foregoing argument can also be seen from figure 
1. Respondents whose latent attitude toward abortion (Y”) is less than —1 
respond that the law should never permit abortion. Similarly, respondents 
with values of Y" between —1 and O respond that the law should permit 
abortion only in cases of rape, incest, or danger to the mother’s life. In 
both of these cases, the circumstances under which abortion is permissible 
are clearly specified by the item, and thus there is no reason to hypothesize 
a change in cutpoints pre- and post-Webster. Now consider two hypo- 
thetical respondents. Suppose that while the first respondent would en- 
dorse abortion in circumstances other than rape, incest, or danger to the 
mother’s life, he/she also favors restrictive legislation requiring fetal vi- 
ability testing, parental consent, or bans on public funds for abortions. 
In terms of figure 1, this respondent’s value of Y” is between O and t,,, 
where t,, represents the dividing line between those who agree or do not 
agree with increased state control over what constitutes a legitimate abor- 
tion. In contrast, suppose the second respondent does not agree with these 
proposed restrictions, but also does not think abortion should be legal in 
every situation. In terms of figure 1, this respondent's value of Y” is 
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between t,, and £,. If the NES item analyzed here were administered to 
these two respondents prior to 1989, both will answer the item using 
response category 3. 

However, as argued above, the discourse on abortion rights after 1989 
may cause the cutpoint t, to decrease because the phrase “clearly estab- 
lished need” is now associated with the threat of increased state con- 
tro fetal viability tests, parental consent laws, bans on public funding 
etc. How would this affect individual responses to the NES abortion 
question? After the ruling, the first respondent will still answer using 
response category 3 despite the shift in the third cutpoint, because he or 
she agrees with the proposed restrictions. In contrast, however, the second 
respondent is now forced to choose either category 4, requiring him/her 
to endorse abortion under a wider range of circumstances than he/she 
would ideally want to permit, or to choose category 3, which is now 
associated with increased state control in regulating a woman’s “need” 
for an abortion. Presumably, if the second respondent’s value of Y” is 
“very close” to t,,, he/she will choose category 3, and if this value is “very 
close” to t,, category 4 will be chosen. Thus, it is reasonable to hypothesize, 
as herein, that there is a point t,,, between ?,, and £, such that respondents 
with more than ż, units of Y” choose category 4, while respondents with 
between O and #,, units of Y” choose category 3. £,, is not the same as the 
hypothetical cutpoint t,, because t,, represents the dividing line between 
respondents based upon the hypothetical division of category 3 into two 
distinct categories (resulting in a total of 5 categories). In contrast, t,, 
represents the shift of the cutpoint dividing categories 3 and 4 in the case 
where there are only 4 categories. In other words, after the Webster de- 
cision, the third cutpoint shifts to the left from t, to t. 

In light of this hypothesized effect of the abortion discourse after 1989 
on the NES item, we modified model M, allowing the cutpoint separating 
categories 3 and 4 to change in 1990 and subsequent years. The resulting 
model (M,) fits the data very well (P = .41), and the estimated cutpoints 
(standard errors in parentheses) for the first and second time periods are, 
respectively, .495 and .364, consistent with our hypothesis that the cutpoint 
is smaller in 1990 and thereafter. To test whether or not there is a sig- 
nificant difference between the two cutpoints, recall that we estimated 
the natural logarithms of these quantities; on this scale, the difference 
between the two maximum-likelihood estimates is In (7,) — In (7,,)= .305 
(SE = .057). Hence a one-tailed test of the null hypothesis that the cut- 
points are identical versus the alternative that the second is smaller leads 
to rejecting the null in favor of the alternative at the .001 level. 

We now ask whether or not the variance of abortion attitudes increased 
over the period 1980—94. To address this issue, we consider model M,, in 
which the variance is constant and the means are unrestricted. This model 
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1 Variance 


80 82 84 86 88 90 e 94 
Year 


Fic. 2.—Trends in the mean and variance of abortion attitudes 1980-94 under model M, 


also fits the data adequately (P > .05), consistent with the conclusion of 
no trend in the variance. However, the conditional likelihood ratio test 
of M, versus M, (L'[M,]— D'[M,] = 13.77;df = 7) has P value of .055, 
suggesting possible heterogeneity in the variation across years. DiMaggio 
et al. (1996) fit a linear model to the variance estimates in their data, 
claiming the variance increases over time. In model M,, the logarithms 
of the variances follow a linear model: ln (of) = a + f(t — 1980). From 
table 5 it is clear that the data offer no grounds for preferring M, to M,, 
as there is almost no improvement in L’ at the cost of one degree of 
freedom (L'[M,] — L’[M,] = 0.33;df = 1); that is, the null hypothesis of 
no linear trend in the variance is not rejected.] 

Figure 2 plots the estimated means and variances under model M, 
against time; recall that because these parameters are identified only up 
to location and scale, only comparisons amongst the means and the var- 
iances are of interest. The estimated o? fluctuate irregularly, consistent 
with the hypothesis that the variation does not increase between 1980 
and 1994. However, although there is no obvious trend in these estimates, 
it also appears that the variance may be larger in 1986 or 1992 than in 
the other years; similarly, the variance may be smaller in 1990 than in 
the other years. Ex post facto, this suggests considering models where the 
variance may be smaller (larger) in a given year than in other years, testing 
such models against M,. There are eight such models, each adding a single 
parameter to M,. If we had a priori substantive reasons for focusing on 
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pre specified years, then it would be legitimate to estimate a subset of 
these models, but as this is not the case, all eight models should be con- 
sidered, which raises the issue of overfitting the data. Therefore, to test 
the null hypothesis of constant variation against the alternative of non- 
constant variance, we test each model against M, using a significance 
level of .05/8 (Miller 1981). Of the eight models considered, the best fitting 
(M,) allows the variance to differ in 1986. However, the likelihood-ratio 
test of M, versus M, is not significant at the .05/8 level, and the BIC 
statistic (Raftery 1995) suggests that M, should be preferred to M,. 

To this point, the results suggest selecting either M, or M, as the pre- 
ferred model; importantly, neither of these suggests that attitudes toward 
legalized abortion have polarized over the period 1980-94. Of course, 
many models other than those considered here are nested between M, 
and M,. However, because we think it unlikely that the conclusions about 
polarization one would draw from M, or M, would change by choosing 
such a model, we limit our selection to these two models, choosing M, 
over M, on the basis of the BIC statistic. 

Having established that the variance did not increase between 1980 
and 1994, we now turn attention to modeling the means, using M, as a 
baseline model. We first compare model M,, in which the means are 
constant, with model M,; the conditional likelihood-ratio test (L’[M,] — 
D’(M,] = 26.97; df = 7;P <.001) suggests some heterogeneity. Following 
DiMaggio et al. (1996), we model the means as a linear function of time 
in model M,; the model does not fit the data as well as M, (L*'[M,] — 
D[M,] = 25.03; df = 6;P <.001), and the fit is virtually identical to that 
of M, 

Figure 2 suggests the mean is larger in 1992 than in the other years; 
therefore model M, allows the mean to differ in this year Note that model 
M, does not fit better than M, (’[M,]— D'[M,] = 4.34;df = 7;P = .74) 
and that if the BIC statistic is used to select between these models, M, 
is preferred. Note also that M, is preferred to the more parsimonious M, 
on these grounds. Finally, although M, does not fit the data uncondition- 
ally at the .05 level, the difference L’(M,) — L’?(M,) = 22.63 on one degree 
of freedom suggests a large improvement, and even though the contri- 
bution from M, does not have a chi-square distribution, the chi-square 
distribution for differences between L’ values is often approximately valid 
in such cases; assuming the validity of this approximation, the P value 
is less than .001, substantially less than .05/8. 

Our results thus lead to selecting model M, to describe the data; table 
6 reports the estimated variance, the two estimated means (u, is the mean 
for all years except 1992), and the two estimated cutpoints. The estimated 
cutpoint prior to the Webster decision is .491, while the estimated cutpoint 
for 1990 and later is .369. As before, the ordering is consistent with our 


934 


Culture Wars 


Parameter Estimate SE 
Int,” . —.711 027 
AA —.995 041 
Mia .151 012 
Migs io sat 325 036 
mp .. 029 .027 


* i m exp(— 711) = 491. 
Th = exp(— 995) = 369. 
37 a exp(.029) = 103. 


a priori expectations, and the differences between the estimates is signif- 
icant, on the basis of a 1 tailed test, at the .001 level of significance. 
Although we cannot explain why attitudes are more favorable to legalized 
abortion in 1992 than in the other years, the evidence does not support 
the hypothesis that Americans are becoming more favorable to legalized 
abortion over the study period. Finally, recalling that the variance is 
constant and that the kurtosis is also constant because the latent attitude 
is normally distributed, the data offer no support for the hypothesis of 
increasing polarization. 

Our substantive conclusions about the 1980-94 period are obtained 
using models for ordinal variables in which the cutpoint separating the 
third and fourth categories of the NES item changes after the Webster 
decision. The models fit the data well, whereas models that do not allow 
this cutpoint change do not fit the data. We have also given good reasons 
to expect such a change and correctly specified the direction of this change. 
Further, the empirical impact of this change seems reasonable; using the 
preferred model M,, in 1990, 14.4% of the respondents are predicted to 
fall in category 3, 41.5% in category 4. If the pre-Webster cutpoint were 
used instead, 4.6% of the respondents would be reallocated from category 
4 to category 3. In addition, the best-fitting model with a cutpoint change 
occurs at the time point we identified a priori; no model with a cutpoint 
change in another year fits the data at the .05 level. For the model with 
a cutpoint shift in 1982, L? = 35.3 (df = 6; P < .001). For 1984 L = 
33.4 (df = 6; P < .001), and for 1986, LI = 23.3 (df = 6; P < .001). For 
the model with a shift in 1988, 1? = 11.89 (df = 6; P = .06); the near 
fit of this model is explained by its similarity to M,. For the model with 
a cutpoint shift in 1992, I’ = 28.0 (df = 6; P < .001); for the model with 
a shift in 1994, LZ? = 31.3 (df = 6; P < .001). Despite the strong justif- 
cations we have given, a critic could always attack our conclusions, ar- 
guing that the assumptions used to construct the statistical model are 
faulty or that our use of L’ to measure both goodness of fit and aid in 
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the model selection process is misguided; for example, obviously one could 
argue that instead of a cutpoint change, the latent variable follows some 
unspecified distribution other than the normal after 1988. Similarly, as 
the model parameters are not identified (see eq. 2 in app. A), conclusions 
about the variability of the series can depend on the parameterization of 
the model. Although we believe our parameterization offers the most 
plausible representation of the behavioral process pre- and post-Webster 
respondents might have employed in responding to the new NES item, 
a critic could always reject our line of argument. We cannot address all 
these potential criticisms, nor can we handle all of these within the mod- 
eling framework proposed herein, and even were we to construct a more 
general framework, untestable assumptions used in the model construc- 
tion process would still be a target for criticism. 

Although we believe our analysis is compelling, it is simply less defin- 
itive than would be the case were the NES item measured at the interval 
or ratio level. For the sake of completeness and so that the reader may 
judge all the evidence for him or herself, we now report the findings we 
would have obtained had we confined attention to models without a 
cutpoint change. In that vein, recall that under model M,, the means and 
variances were allowed to vary over the survey years, but the cutpoint 
was the same in all years; we rejected this model because it did not fit 
the data at any reasonable significance level, using L’. However, given 
the large sample size, one might try to argue that model M, provides a 
reasonable description of the data (this is not an argument we would 
endorse, but it is not an uncommon type of argument). Were we to proceed 
along such lines, a natural model to consider next is M,; this is a special 
case of model M, in which the logarithms of the variances change linearly 
over time. Were we to accept the results from model M, at face value, 
the trend coefficient is significantly different from zero at the .05 level 
(.023; SE = .005). However, if we use the conditional-likelihood test, M, 
is inferior to M, (L’[M,]— P'[M,] = 13.75; df = 6;P = .03), which sug- 
gests that a linear trend does not adequately capture the temporal change 
in attitudinal variability. In addition, the variances were modeled using 
polynomials up to order 3 with no improvement. Next, inspection of the 
variances estimated under model M, suggested model M œ in which there 
are three distinct variances (o; for 1980, 1984, and 1988; o2 for 1986; and 
o; for 1990, 1992, and 1994). Under this model, o? is greater than 0? (using 
the .05 level), suggesting (with the exception of 1986) that polarization is 
higher in the 1990s than the 1980s. A similar pattern shows up in model 
M; this model is obtained from M, by constraining the means (except 
for the mean in 1992) to be identical. 

The ad hoc models M,, and M,, do not support the hypothesis that 
polarization is monotonically increasing over time, but they do suggest 
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ex-post facto that the variability of abortion attitudes is greater in the 
1990s than in the 1980s (save 1986). However, none of the models above 
come close to fitting the data at conventionally used significance levels, 
and if the BIC is used as a model selection criterion, model M,. in which 
the variability of abortion attitudes is constant, is still clearly preferred 
to any of those just considered. 


DISCUSSION 


Sociologists and journalists have argued that recent “cultural wars” over 
politicized moral issues have resulted in rising levels of attitude polari- 
zation and social conflict. According to these writers, the level of polar- 
ization and fragmentation among the public as a whole is so irreconcilable 
that democratic compromise on these issues is increasingly difficult, posing 
a strong threat to the viability of American political institutions. The 
controversy over abortion exemplifies the concerns of these writers, who 
often use this as the leading example with which to make their case. 
Typically, however, no empirical evidence of increased attitudinal polar- 
ization is presented; rather, the purported increase is indirectly inferred 
from the outcomes with which such polarization is (in theory) associated. 
In an important contribution, DiMaggio et al. (1996) examined temporal 
trends in public opinion data to assess this thesis, finding little evidence 
of increasing polarization, with opinions about abortion constituting the 
major exception. In general, however, their conclusions may be misleading 
because they analyze nominal and ordinal opinion items using methods 
designed for interval measurements. In particular, DiMaggio et al. in- 
appropriately analyze both the GSS data and the NES data on abortion. 

We develop new methods for studying opinion polarization using or- 
dinal data. In our model, a generalization of the cumulative probit model, 
the latent attitude toward abortion follows a normal distribution at each 
time point. In contrast to DiMaggio et al., our results do not suggest an 
increase in polarization. Interestingly, with respect to abortion, one of the 
most controversial public issues, our results suggest support for the more 
general conclusion in DiMaggio et al.: public opinion has not polarized 
much over the last 25 years. Nonetheless, two caveats are in order. First, 
because our model is not appropriate for analyzing the GSS data on 
abortion, our conclusions are based solely on the analysis of the NES 
item; it is conceivable that an appropriate analysis of the GSS data would 
find evidence of increasing polarization. Second, the latent variable in our 
analysis is assumed to follow a normal distribution, and in our analysis 
of the 1980-94 data, we allowed the cutpoint separating categories 3 and 
4 to change after 1988. Although we offered both theoretical and empirical 
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support for this analysis, because we are using ordinal data and must 
therefore make assumptions that cannot be fully tested, our analysis will 
always be open to criticism by those who want to maintain that polari- 
zation in abortion attitudes has been increasing. However, at this point, 
we believe that our analyses are strong enough to place the burden of 
proof upon the would-be critic. In future work on this topic, we intend 
to extend our model to include distributions other than the normal, and 
we intend to develop appropriate methods for studying polarization when 
multiple items are used to measure the underlying attitude (as is the case 
in the GSS). 

Do the culture wars exist? Proponents of the culture wars hypothesis 
claim that America is becoming increasingly divided over politicized 
moral issues. Abortion is an issue where the debate is particularly divisive. 
Hunter (1991, 1996), for instance, argues that because of the sound-bite 
society we live in, special interest groups and political organizations at- 
tempt to inflame public opinion about abortion and undermine their op- 
ponents rather than participate in genuine debate. This contributes to a 
“hollowing out” of the possibility of achieving consensus and raises the 
specter of violence as a means of imposing one’s beliefs upon others. 
However, our results show that there is no evidence that Americans as a 
whole became more divided over abortion rights between 1972 and 1980 
nor between 1980 and 1994. In the lag between submission and publi- 
cation of this article, new NES data have become available, and we have 
extended the 1980-94 analysis to 2000. In addition to models analogous 
to those considered in tbis paper, we designed new models to ascertain 
whether polarization had increased during the 19903. Our results indicate 
no increase in polarization in abortion attitudes between 1980 and 2000 
(complete results available from authors on request). While the contem- 
porary culture wars have been portrayed as a threat to democratic in- 
stitutions, our evidence suggests that, at least with respect to the debate 
over abortion rights, the American public is no more divided now than 
in the past. 


APPENDIX A 


Estimation of the Model 


Our model extends the usual cumulative probit model to allow for het- 
eroscedasticity and variable cutpoints. Let Y denote an observed ordinal 
variable with response categories j = 1,...,J, let Y* denote a metrical 
latent variable, and let X denote a vector of covariates. Under the model 
herein, the distribution of Y”, given X = x (denoted ¢{y * |x] is normal, 
with mean p(x; 6) and standard deviation o(x; 0), where 0 is a vector of 
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parameters to be estimated. The observed variable Y is a quantized ver- 
sion of the latent variable Y”; for each level x of the covariates, Y = 1 
if and only if Y* < n(x; 6), Y = 2 if and only if (4,9) < Y° < næ... 
,..., and Y = J if and only if Y” > 7_,(x; 9). It follows that 


, (x 6) — wx 8) T- 6) — wx 6) 
Y =y[X = = gp/2 — pu A 
prob( j| x) 6 x0) , (AD 


where $ denotes the distribution function of the standard normal distri- 
bution, and for all xr (10) = —%,7,(x,0) = o, As 





7,50) — 1350) _ £,[1,(50) + A,]— [HG #) + A,] (A2) 


o(x; 6) x,0(x; 0) 


it is evident that at each level of the covariates, the parameters p(x; and 
o(x; #)are identified only up to location and scale. In the usual cumulative 
probit model, it is assumed that neither the cutpoints 7, (x;0) nor the 
variance 0”*(x;0) depends on x. Therefore, only two restrictions need be 
imposed to identify the model; the restrictions 7,(x;6) = 7, = 0 and 
o7(x;6) = o? = 1 are conventionally used, though other restrictions could 
be used and might be more desirable in certain types of applications. 

We now briefly discuss maximum-likelihood estimation of the extended 
model. We assume the parameter vector 6 is identified (as is the case for 
the models considered in this paper). 

Let (x,,¥,),4 = 1,..., n, denote the observed data (hereafter D), and, 
forj = 1,...,J, let yy = lify, = j, otherwise. The log likelihood is 


2,(x; 0) = 


s J 
In LO; D) = >, >) y,"ln prob(Y = j|X, = x)In 


1m] jæ] 


= 2 2 yy Ln (Bla, (x; 0)] — $15, _.(x; 0D, (A3) 


and this is to be maximized with respect to 0. 

Under the usual types of regularity conditions (see, e.g., Amemiya 1985), 
a maximum-likelihood estimator @ of 0 is a solution of the likelihood equa- 
tions: 





-D a J a Y= . = 
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In large samples, bis approximately normally distributed with mean 0 
and covariance matrix nm”? > @). The matrix » @ can be estimated con- 
sistently by ((1/m) x [P In L (0; D)/(0009")) — 1; for the model here, 


yo Pro, = j[X, =x,) 
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To estimate the parameters and their standard errors, a modified Newton- 
Raphson algorithm using numerical derivatives for (A4) and (A5), imple- 
mented in the Stata package, was used. For further details, see the de- 
scription of the linear form routine in the section on maximum- 
likelihood estimation in the Stata reference manual (StataCorp 1997). The 
code used to implement the model in Stata is available online from the 
authors. 

Because our covariate (X) is discrete, we (in order to avoid computing 
the log likelihood for each individual observation) maximized the equiv- 
alent grouped data log likelihood: 


InL(6; D) = 2, 2, n, In [prob(F, = j|X, = 1), (A7) 


where n, denotes number of observations at time ¢ in category j of Y. 
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APPENDIX B 
Abortion Questionnaire 


There has been some discussion about abortion during recent years. Which 
one of the opinions on this page best agrees with your view? You can 
just tell me the number of the opinion you choose. 


TABLE B1 
CoMPARISON OF NEW AND OLD ITEMS 


OPINION/QUESTION 

CATEGORY 1972-80 1980-94 

Lina Abortion should never be permitted. By law, abortion should never 

be permitted. 

Le si Abortion should be permitted only The law should permit abor- 
tf the life and health of the tion only in case of rape, in- 
woman 15 in danger. cest, or when the woman’s 

life is in danger 

Vas Abortion should be permitted if, The law should permit abor- 
due to personal reasons, the tion for reasons other than 
woman would have difficulty in rape, incest, or danger to 
caring for the child. the woman’s life, but only 


after the need for the abor- 
tion has been clearly 


established. 
rr Abortion should never be forbidden, By law, a woman should al 
since one should not require a ways be able to obtain an 
woman to have a child she abortion as a matter of per- 

doesn’t want sonal choice. 
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In our 1996 article “Have Americans’ Social Attitudes Become More Po- 
larized?” (AJS 102 [3]: 690-755) we addressed the substantive question 
of whether or not polarization on 18 morally charged social issues had 
increased over a period of approximately two decades. That article ex- 
amined all of the then-available years of data from the General Social 
Survey (GSS) and the National Election Study (NES). The approach was 
necessarily broad, entailing more than 200 separate analyses of polari- 
zation within the population as a whole, within subgroups of the popu- 
lation and between groups. Moreover, each examination in turn required 
the use of three or four statistical models. 

Our major conclusion was that little polarization had occurred over 
these years and that the American public was by many criteria more 
unified in its views in the 1990s than it had been in the 1970s. The most 
important exception to this generalization, based on analyses of data from 
both the NES and the GSS, was that the population’s attitudes toward 
abortion had been polarizing. The results of Mouw and Sobel’s analysis 
of the NES data suggest that attitudes toward abortion had not been 
polarizing, thus challenging the one qualification we made to our central 
argument that polarization had not increased. Thus, although Mouw and 
Sobel challenge the methods we used in 1996, their analysis supports our 


1 We would like to thank Ted Mouw for sharing the Stata code for his procedure and 
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lottesville, Virginia 22904-4766. E-mail: bryson@virginia edu 
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Comment on Mouw and Sobel 


central finding that widespread polarization of public opinion had not 
occured. 

Other scholars working in parallel have developed analogous methods 
for measuring attitude polarization in the senses of dispersion (Gay, El- 
lison, and Powers 1996) and consolidation (Hout 1999). Hoffmann and 
Miller (1998) expanded on our 1996 article to create a method for adjusting 
variance and kurtosis measures that would control for changes in dem- 
ographic variables. 

According to Mouw and Sobel, we and other scholars who have ex- 
amined polarization have erred in treating ordinal scales as if they were 
interval scales. Statisticians have many concerns about this practice, but 
the major problem, as Mouw and Sobel note, is that we do not know if 
the distance between 1 and 2 on the scale is the same as the distance 
between 3 and 4. Mouw and Sobel offer a solution to this problem for 
one of the four dimensions of polarization we discussed, dispersion, and 
analyze one of the 18 issues we examined, abortion (based on one of the 
two scales we used to examine it). Additionally, they criticize us for com- 
bining two different abortion questions in the NES into one time series, 
and they make the substantive claim that polarization over abortion at- 
titudes in the general population has remained unchanged from 1980 to 
1993, the last year in our time series. We respond to their methodological 
and substantive claims separately. 

To anticipate the essence of our comments, we welcome Mouw and 
Sobel’s approach as a valuable new tool for students of polarization. We 
raise some cautionary points of interpretation, but even in pointing to 
limitations in the application of the method we call attention to and 
provide examples of additional ways in which it can be valuable. At the 
same time, we emphasize that the method only addresses one of our four 
indicators of polarization in attitudes toward abortion. Moreover, using 
Mouw and Sobel’s method and additional years of NES data, we find 
support for our substantive conclusions about even this one measure of 
polarization with evidence indicating that polarization in responses to the 
NES abortion item increased between 1980 and 1998. 


THE METHODOLOGICAL CRITIQUE 


We begin by emphasizing a significant point of agreement: Mouw and 
Sobel have made an important methodological contribution to the study 
of polarization. They propose a method for modeling changes in the var- 
iance of categorical data and argue that their method is more appropriate 
than the ordinary least squares (OLS) method we used. Like many meth- 
odological advances, however, theirs entails both costs and benefits. In 
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this section we first address their criticisms and then explore the advan- 
tages and limitations of using an ordered heteroscedastic probit analysis 
to measure attitude polarization. In the process, we discover a benefit of 
their method that Mouw and Sobel do not discuss: They are able to test 
for increasing variance in the overall spread of opinion while controlling 
for some forms of bimodality.’ 


Abortion Question Wording Change 


Before we begin that longer discussion, we will address briefly a less 
complex methodological issue. Mouw and Sobel argue that we should not 
have combined the two abortion questions into one time series. Using an 
appropriate adjustment formula they reveal that our conclusions about 
the change in the mean and variance are unaffected by the change in 
wording, but that the decreasing kurtosis identified in our analysis may 
have been an artifact of the wording change. 

Faced with the 1980 wording change, we compared responses before 
and after the change and found (as Mouw and Sobel did) that slightly 
fewer respondents chose middling responses with the new wording. We 
had two choices. First, we could shorten the time span of the data series 
(as Mouw and Sobel do), thus reducing the chance of finding significant 
results, both by decreasing the number of cases and by reducing the 
chronological scope of our analysis. Or we could combine the questions 
and report in our article that the change in question wording could inflate 
estimates of polarization. We chose the latter course, reporting the limi- 
tations of the data and the risk inherent in our decision in table 1 and 
footnote 13 (pp. 700, 702). If we had been testing polarization on this one 
issue with one population and one data set, we would have taken Mouw 
and Sobel’s approach. But ours was a broad analysis of many issues and, 
because we were skeptical of finding polarization, we tried in every case 
to give the polarization hypothesis the benefit of the doubt. Moreover, we 
were confident that analysis of the longer and better-constructed time 
series on abortion attitudes from the GSS would prevent us from drawing 
the wrong conclusions about polarization in abortion attitudes. We accept 
Mouw and Sobel’s point in the context of their single-issue analysis. To 
test whether there has been polarization over abortion, the best course of 
action is to separate the two questions. 


2 Much of this section is taken from a longer discussion of Mouw and Sobel's method 
(Bryson 2000). 
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Is This a Comprehensive Test of Polarization? 


Let us now turn to the heart of Mouw and Sobel’s methodological critique 
and contribution. Our 1996 article identified four dimensions of polari- 
zation: dispersion, bimodality, constraint, and consolidation. Mouw and 
Sobel only attempt to measure dispersion (variance). They dismiss the 
value of identifying bimodality (kurtosis) on the grounds that their mea- 
sure of dispersion also taps bimodal trends (see our 1996 article, pp. 
695-96, for a critique of this view). Although they do not mention the 
constraint of opinion across issue domains (Cronbach’s alpha) or the con- 
solidation of opinion parameters with structural parameters (group-level 
differences in means), they do discuss changes in means for one gruop 
over time. Such changes in a single group, however, do not indicate po- 
larization. Finally, although they do not make this argument, Mouw and 
Sobel’s general critique could extend to our method of measuring con- 
solidation by comparing group-level differences in means calculated from 
ordinal data. 

In recent work on polarization between and within religious traditions 
in the United States, Evans (2000) used a method that addresses Mouw 
and Sobel’s implicit critique of the consolidation measure. Instead of 
estimating the mean for each year for each group under comparison, he 
used an ordered logistic regression model, with the abortion scale as the 
dependent variable. A group X year interaction term tested whether 
changes in response to the abortion question were significantly different 
for the two groups. This approach also made it possible to control for 
covariates that may shape opinions over time, as advocated by Hoffman 
and Miller (1998). He found that polarization in abortion attitudes (in the 
sense of consolidation) has occurred between mainline and evangelical 
Protestants since 1972. 


The Conflation and Separation of Different Dimensions of Polarization 


As Mouw and Sobel note, their method transforms data from the abortion 
question into a new variable that has more desirable qualities including 
a normal distribution. Because the kurtosis of a normal distribution is 
zero however, they are unable to model changes in kurtosis, an indicator 
of bimodality. This would appear to be a severe limitation on their method, 
particularly because bimodality (the clustering of opinion into distinct 
camps) is likely to be more closely related to political conflict than is 
dispersion. Moreover, by allowing the cutpoints to shift, Mouw and So- 
bel’s method effectively absorbs increasing bimodality. To understand this 
claim, we must delve a bit farther into their method. 

Probit is a regression model designed to work with ordinal or dichot- 
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omous dependent variables by transforming the observed data into a 
variable that meets the assumptions of regression. Whereas logistic re- 
gression—the most commonly used solution to this problem—turns the 
dependent variable into the log of the odds of an event, probit regression 
addresses the problem by taking the z-score of the probability to produce 
an unconstrained continuous variable. (Specifically, it calculates the cu- 
mulative normal probability of choosing the next higher category of Y.) 
It happens that a z-score includes the mean and standard deviation of 
the (now continuous) variable in question: (x — mean/SD. The important 
point here is that s-scores contain estimates of variance for the new de- 
pendent variable. This is the reason to choose probit rather than some 
other transformation. 

Mouw and Sobel's adapted probit procedure has the useful feature of 
making it possible to ascertain how well the model fits with prescribed 
changes in the variance across time (x). It does not focus our attention, 
as do most probit analyses, on the effect of x on y. Instead, Mouw and 
Sobel assess the fit of a model that posits increasing variance over time, 
and compares the fit of that model to the fit of other models that hold 
variance constant. Alvarez and Brehm (1995) use this method (derived 
from Harvey 1976) to demonstrate that respondents who hold conflicting 
values on the role of women and the sanctity of human life vary more 
in their responses to abortion questions than respondents who hold con- 
sistent positions on those issues. 

Mouw and Sobel estimate several of these probit models in order to 
test such assumptions as constant variance, increasing variance, increasing 
mean, and several combinations thereof, on the premise that the model 
with the best fit describes the data most accurately. If the best model were 
one that specified a linear change in variance, Mouw and Sobel would 
conclude that there had been a linear polarizing trend in underlying opin- 
ions. To better understand this point, consider the example of the NES 
abortion scale. 

The observed variable, here called ABLAW, has four possible values: 
1, 2, 3, and 4. The probit transformation begins with the probability of 
choosing each response category (represented by the percentage of re- 
spondents in each category). For example, the observed proportions of 
respondents in each category for 1980 were .115, .327, .190, and .368, 
respectively. To predict responses to multiple categories in one equation, 
the method standardizes the cumulative probability for each category, in 
this case, .115, .442, .632, and 1.00. Because such probabilities cannot 
range below 0 or above 1, probit standardizes the (cumulative) probability 
for reach category using a z-score: (x — meanWSD. 

Think of the area under a normal curve as a histogram of sorts. A line 
down the middle of a normal curve would divide the area into two equal 
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Fic. 1.—A normalized histogram 


parts (50% in each category). Adding two more lines would divide the 
distribution into four categories, but equally spaced lines do not fit the 
observed data. The left two categories together only account for 44.2% 
of responses, so the middle line should go to the left of center. Figure 1 
represents the 1980 data as areas under a normal curve (mean = 0; SD 
= 1). We divide the bell into four unequal pieces of 11.5%, 32.7%, 19.0%, 
and 36.8%. To make the proportions match the 1980 data, we must draw 
the lines (hereafter, “cutpoints”) dividing that bell in specific locations that 
can be mathematically derived using the probability density function. This 
is the sense in which the analysis fits parameters to match the data. Every 
time the procedure “fits” a parameter, we lose a degree of freedom. In 
this example, we specified two parameters (the mean and SD). Those two 
parameters defined the scale and shape of the curve and the other three 
parameters (the cutpoints) emerged to fit the data as proportional divisions 
of the area under the curve. We could, however, choose to specify one 
cutpoint and the mean. Then the other three parameters (two cutpoints 
and the SD) could be identified from the data. This is the crux of Mouw 
and Sobel’s approach. It allows probit to identify a meaningful standard 
deviation from the observed data. Together, the five parameters inform 
the equation that estimates latent opinion on abortion law. 
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Shifting Cutpoints Absorb Changes in Bimodality 


Probit defines the new variable as normal, forcing its shape to have zero 
skew and zero kurtosis. That transformation is usually acceptable, but it 
is fatal if the goal of an analysis is to identify change in kurtosis over 
time. The transformation is, furthermore, problematic for cases where the 
distribution has large tails or where the action occurs in the endpoints of 
an ordinal scale because the model’s assumptions do not fit the observed 
data as well as they might otherwise. The result would be a poorly fitting 
model that cannot identify changes in the skew or bimodality of a dis- 
tribution across the independent variable. This issue is particularly salient 
for the survey question that Mouw and Sobel reexamine. Figure 2 presents 
a histogram of responses to the abortion question in 1980 and again in 
1994. This is where Mouw and Sobel’s discussion of cutpoints becomes 
important. 

As indicated above, Mouw and Sobel’s figure 1 can be interpreted as 
a bell-shaped histogram divided with three “cutpoints” into four portions 
matching the frequency distribution in the observed variable—11% of 
the area in the first section, 32% in the second, and so on. Mouw and 
Sobel’s cutpoints are not evenly spaced and symmetrical because the 
observed variable does not have a normal distribution. We may think of 
this procedure as, in effect, absorbing bimodality into a normal curve. 

Over the course of the survey period, however, the proportion of re- 
sponses in category 4 increased, making the observed distribution even 
less appropriate for standardization. Mouw and Sobel’s method accom- 
modates this change by allowing the third cutpoint to move toward the 
center of the distribution after 1990—making room in the right tail for 
more pro-choice responses in the later part of the time period and thereby 
absorbing the change that produced decreasing kurtosis (increased bi- 
modality) in our 1996 models. Indeed, the large improvement in fit that 
Mouw and Sobel report as a result of allowing cutpoint 3 to move toward 
the mean constitutes evidence of this polarization. Following the logic 
that an improvement in fit indicates a significant effect of changes in the 
model’s parameters, we conclude that there has been a significant shift 
in abortion opinion from category 3 (a middling opinion) to category 4 
(an endpoint). Given that the leftmost category has not lost ground, we 
argue that the shift toward category 4 constitutes polarization of the sort 
measured by kurtosis. 

When the leftmost or rightmost cutpoint moves toward the center, an 
endpoint of that distribution increases, and polarization may result if the 
opposite pole maintains its strength. If both tails increase, we can be more 
certain that the shift is not a shift toward agreement on a polar position 
(as would be the case for opinion on women’s rights over time; see fig. 3 
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in our 1996 article). If both tails increase at the same rate, however, there 
may be no need to allow cutpoints to shift. A small or moderate evenly 
balanced shift toward the tails would merely produce increasing variance 
and can be modeled that way with Mouw and Sobel’s method. Unfor- 
tunately, a strong polarizing trend has the potential to empty the middle 
beyond the constraints of normality. That is, there could be so few re- 
sponses in the middle category that 68% could not lie within one standard 
deviation of the mean at the same time that only 95% lie within two 
standard deviations of the mean. Therefore, the result of a strong trend 
toward the tails of a distribution would be a poorly fitting model rather 
than a significant increase in variance. This is not the case for the example 
at hand, but it is an unfortunate characteristic of the method that reduces 
its usefulness as a measure of polarization. 

Although scholars of the pro-choice movement might debate the merits 
of Mouw and Sobel’s historical claim that a new abortion discourse ap- 
peared in the early 1990s, the observed shift in survey responses does 
appear suddenly and in conjunction with political changes rather than in 
a long gradual trend of eroding middle ground. It appears, then, that 
Mouw and Sobel have documented the same shift that we found in our 
1996 article while offering an alternative explanation that fits the evidence 
more closely than does the linear view implicit in the culture wars 
hypothesis. 

On the other hand, we cannot know whether their interpretation is 
correct. Mouw and Sobel state that it would be inappropriate to allow 
for a cutpoint shift unless the researcher is fairly certain that there has 
been a shift (using the current example) in the way respondents interpret 
the question. Their decision to allow a cutpoint shift thus assumes that 
the hypothetical respondents who would answer category 3 in earlier years 
but switch to category 4 after 1992 would do so rot because their un- 
derlying opinion had changed but because their interpretation of the ques- 
tion had changed such that they felt a different category better matched 
their unchanging opinion. In short, they argue that the relevant indicators 
changed because people changed the way they expressed a given attitude 
without necessarily changing the attitude itself. 

For our question, however, the theoretical interpretation of that change 
is irrelevant. We did not ask whether interpretations had changed over 
time, we asked whether the pattern of responses had changed, regardless 
of the meaning one might attach to survey questions or their response 
categories. It is entirely possible that political contexts changed the way 
respondents interpreted questions about abortion, producing an “artificial” 
polarization. But it is also possible that people changed their opinions 
and that a deeper moral rift appeared in attitudes toward abortion law. 
The data and methods before us cannot determine whether either of these 
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assumptions is true. We can only know that more people chose category 
4 in later years than in earlier years and that there had not been any 
erosion in the popularity of the opposite pole. 

We suggest, however, that there is an important unremarked benefit to 
modeling cutpoint shifts and changes in the mean of a dependent variable 
as Mouw and Sobel do. If cutpoint changes can be interpreted as indicators 
of polarization, Mouw and Sobel’s method offers the useful feature of 
making it possible to distinguish the trend in dispersion from the trends 
in both means and kurtosis we observed in our 1996 article. In their 
analysis, an improved fit with changing cutpoints indicates and, in effect, 
controls for the change that produced decreasing kurtosis. From this per- 
spective, then, the comparison of their M, and M, demonstrates that 
defining a linear change in variance does not significantly improve the fit 
of the model, when controlling for the shift from category 3 to category 
4 (e.g., an increase in bimodality). This method, furthermore, allows the 
researcher to test for changes in variance under other assumptions such 
as constant, fitted, or linear trends in the mean (another by-product of 
the increased size of category 4). Because the increasing proportion of 
respondents selecting the pro-choice category is clearly specified, tested 
and controlled, the result is a clearer view of changes in the NES measure 
of opinion toward abortion law between 1980 and 1994 than the one we 
presented in “Have Americans’ Attitudes Become More Polarized?” 


SUBSTANTIVE CRITIQUE OF FINDINGS ON POLARIZATION OVER 
ABORTION 


Beyond their methodological innovation, Mouw and Sobel also offer a 
substantive intervention into debates about polarization in the United 
States, joining a large number of scholars who have worked on this very 
question (DiMaggio et al. 1996; Hunter 1991, 1994; Williams 1997; Evans 
1996, 1997; DiMaggio and Bryson, in press; Hoffmann and Miller 1998; 
Gay et al. 1996; Hout 1999; Davis and Robinson 1996). Although we 
welcome the methodological innovations that lead to this conclusion, and 
although Mouw and Sobel’s no-polarization conclusions, if accurate, 
would strengthen the underlying argument of our 1996 article as a whole, 
we cannot accept that conclusion for several reasons. 

First, as Mouw and Sobel note, their method has not yet been extended 
to deal with additive scales of the kind derived from the GSS. Our 1996 
conclusions about polarization of abortion attitudes relied primarily on 
GSS results, and are therefore in no way undermined by Mouw and 
Sobel’s new empirical findings. 

Mouw and Sobel do imply that our GSS results are incorrect because 
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the NES variable is an ordinal measure and the GSS scale may fail to 
meet even this standard. The problem of using what appears to be an 
ordinal dependent variable is resolved in the operationalization used in 
our article. Rather than treating response categories as meaningful po- 
sitions “in relation to some theoretical maxim” (p. 693), we asked whether 
the distribution of responses had changed over time. For that purpose, 
meaningful estimates of mean, variance, and kurtosis only require that 
the data be unidimensional. Although we did not consider the meanings 
that survey respondents may have attached to response categories, schol- 
ars who wish to do so would find our methods inappropriate, as Mouw 
and Sobel indicate, which is one of many reasons that we consider their 
approach a useful complement to our own. 

Furthermore, there is a long and well-documented debate over whether 
the Rossi abortion scale in the GSS is bidimensional (e.g. whether three 
questions represent a latent attitude toward “hard” cases, and three toward 
“easy” cases of abortion morality decisions; see, e.g., Clogg and Sawyer 
1981; Duncan, Sloane, and Brody 1982; Muthen 1982; Gillespie, Vergert, 
and Kingma 1987, 1988). Whereas in the past many scholars used the six 
questions to create a Guttman scale, more recently researchers have cre- 
ated additive scales as we did (see, e.g., Hout 1999). Adjudicating this 
debate over scaling goes beyond the scope of this comment, but the re- 
search indicates that the Rossi scale is indeed unidimensional and, while 
not a Likert scale, that it is appropriate for use in the conventional manner 
(Gillespie et al. 1987, 1988). In other words, the most important evidence 
that we presented for the polarization of abortion attitudes, evidence from 
the GSS, remains compelling. 

Second, Mouw and Sobel have not challenged our conclusions about 
two of the four dimensions on which we measured (and found) opinion 
polarization over abortion. We have demonstrated, as have other re- 
searchers (Hout 1999), that intergroup differences in attitudes toward 
abortion have increased over time, indicating a form of polarization we 
called consolidation. Similarly, we found a significant increase in ideo- 
logical cohesion among the items used to form the GSS abortion scale, 
indicating an increase in the aspect of polarization we called constraint. 

Third, although we accept Mouw and Sobel’s conclusion that there 
was no linear trend in the dispersion of NES abortion attitudes from 1980 
to 1994, we have applied Mouw and Sobel’s method to the NES abortion 
question from 1980 to 1998, bringing the data series up to date and 
separating the effects of changes in variance from the pro-choice shift 
that produced declining kurtosis in our article. Because Ted Mouw gra- 
ciously shared his Stata code with us, we were able to apply their method 
precisely and find evidence of an increase in variance even when con- 
trolling for the cutpoint shift. 
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TABLE 1 
MODEL EVALUATION STATISTICS FOR MODELS OF ATTITUDES TOWARD LEGALIZED 
ABORTION, NES 1980-98 


r P af 
72.35 000 18 
EBD? ... EBD1 with linear trend in 52.63 -000 17 
variance 
EBD3 ... EBD1 with different vari- 50 66 000 17 
ance in 1992-98 
EBDA ... Fit all means with 7, change 27 48 051 17 
for 1990-98, with constant 
variance 
EBDS . EBD4 with Hneer trend in 26.55 047 16 
variance 
EBD6 ... EBD4 with different vari- 24.83 073 16 
ance in 1992-98 
EBD? ... EBD4 with different vari- 17 63 283 15 


ance in 1986, 1992-98 


We first ran a model (EBD1, reported in table 1) with constant variance, 
constant cutpoints, and fitted means as a baseline comparison. This model 
does not fit the data well. What about change in dispersion (variance) 
without a shifting cutpoint? We fit EBD2, which holds the cutpoints 
constant, but allows the variance to change linearly (their M,). EBD2 fits 
much better than EBD1, but not well enough for further consideration. 
We concur with Mouw and Sobel that there does not appear to be any 
linear increase in variance without also modeling a change in cutpoint. 

Has there been a nonlinear change in variance? One of the important 
innovations in Mouw and Sobel’s method is the ease with which nonlinear 
changes in variance can be modeled. In our article, we hypothesized 
increasing polarization as linear and monotonic not for any theoretical 
reason, but due to limitations of available data and methods. After re- 
jecting the hypothesis of a linear change in variance, Mouw and Sobel 
examined the variance in each year individually in their model 1 (M,), 
which suggested that there were three distinct variances: o? = 1980, 1984, 
and 1988; o?” = 1986; and o?” = 1990, 1992, and 1994. The result was 
their Mœ where the variance from 1990 to 1994 differed from the variance 
in the 1980s (excepting 1986). Mouw and Sobel reported that while the 
coefficient for the difference in variance in the 1990s is highly significant, 
suggesting polarization, they dismiss the model because it fits the data 
poorly. 

Adding the 1996 and 1998 data to the series changes the perception of 
where the distinct variance eras begin and end. Our equivalent to Mouw 
and Sobel’s M, (available upon request) suggested that the discontinuous 
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TABLE 2 
MoprL EBD6: ATTITUDES TOWARD THE LEGALITY OF ABORTION, 
NES 1980-98 
Parameter Estimate SE P 
Variance. 
In (0%) 1992-98 .... .087 054 .105 
Variance constant ... 023 029 424 
Tau: 
In (r,) 1990-98 .. — 219 044 000 
Tt constant . .... — 713 .028 000 
Mean. 
41982 ....... — 024 041 557 
111984 —.021 037 573 
11986 021 037 570 
#1988 —.041 037 272 
p1990 neces eee —.020 039 599 
41992 188 048 000 
21994 eee 022 048 643 
P1996 re 104 067 122 
H1998 .. 040 045 371 
H constant . .164 029 000 


change in variance occurred in 1992, not 1990. We ran model EBD3 to 
determine whether there was a nonlinear increase in variance when not 
modeling the shifting cutpoint. Although the model improved upon EBD1, 
it does not fit within conventional standards. Thus, our findings support 
Mouw and Sobel’s conclusion that there has been no overall trend in 
dispersion without modeling the change in cutpoint. 

However, another strength of their method is the ability to disentangle 
generalized dispersion from a specific move toward one end of a distri- 
bution. Assuming that Mouw and Sobel are correct that the cutpoint shift 
occurred in 1990 we fit the equivalent of their model M, (EBD4), which 
allows for a change in cutpoint after 1990 but holds variance constant. 
This fit significantly better than EBD1, suggesting that after 1990 there 
was a move from the center to the most liberal category. This finding is 
also consistent with that of Mouw and Sobel. 

To approximate the linear methods that we used in our 1996 article, 
we can also ask whether variance increases in a linear fashion after mod- 
eling, and thus controlling for, the shifting cutpoint. Examining EBD5 
reveals that the answer is no. 

Finally, we ask whether or not there is a nonlinear change in variance, 
given the change in cutpoint. We created a model (EBD6) with fitted 
means, a cutpoint shift in 1990, and a nonlinear increase in variance from 
1992 on. The data fit the model adequately, using the same fit standard 
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TABLE 3 
MoDEL EBD 7: ATTITUDES TOWARD THE LEGALITY OF ABORTION, 
NES 1980-98 
Parameter Estimate SE P 
Variance 
In (0) 1992-98 ... .109 .054 044 
In (0%) 1986 ....... .155 .058 .008 
o’ constant — 004 031 894 
Tau: 
In (r) 1990-98 ... —.232 044 .000 
y constant .... —.709 028 .000 
Mean: 
H1982 ...... eee —.024 041 555 
mI9BA woo. —.020 036 572 
1986 039 038 308 
41988 —.041 037 269 
B1990 ÓN —.023 038 547 
1992 188 048 000 
p1994 .022 048 642 
41996 .104 067 122 
41998 040 045 370 
u constant ....... .163 029 000 


as Mouw and Sobel, and the change in variance is almost significant at 
the .10 level we used in our original paper (P = .105) (see table 2). This 
is weak evidence of increased dispersion in attitudes over abortion from 
1992 to present, assuming a change in cutpoint after 1990. 

In an effort to be as fair as possible to our earlier thesis, Mouw and 
Sobel created a model with the slightly weaker claim that dispersion is 
greater after 1990 than in earlier years except 1986 (Mo). When we control 
for 1986 in EBD6, the result is EBD7 (see table 3). The model fits the 
data quite well: the individual coefficient for the variances after 1992 is 
significant (P < .05) and the conditional likelihood test of difference be- 
tween EBD7 and EBD4 is also significant (P < .01). Thus, if we too make 
concessions to 1986 as an exception in the earlier period, there is evidence 
that attitudes over abortion are more dispersed from 1992 to present than 
in previous years, assuming a change in cutpoint after 1990. This is sug- 
gestive of increased dispersion in addition to the shift toward the pro- 
choice pole that we measured with kurtosis in our earlier paper. In sum, 
even when the change modeled by the shifting cutpoint is controlled, there 
seems to have been increased dispersion in abortion between 1980 and 
1998. 


957 


American Journal of Sociology 


CONCLUSION 


In sum, we welcome the advance in understanding that Mouw and Sobel’s 
innovations provide. Using their method to model the shift to one end of 
the distribution captured by the kurtosis statistic in our 1996 article, as 
well as the variance—and extending the data series—we find that there 
has been a demonstrable increase in the dispersion of opinion toward 
abortion law, as well as a shift toward the liberal end of the distribution 
with the other end holding steady. While Mouw and Sobel’s method does 
not test for all four dimensions of polarization, it does improve upon our 
tests of dispersion by (a) solving the problem posed by ordinal data and 
(b) allowing the researcher to avoid the risk of conflating changes in mean, 
variance, and some types of bimodality. Although we highlight some lim- 
itations of their method and question some of their substantive conclu- 
sions, we hail their paper as an important advance toward a generalized 
method for examining polarization. 
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Organizational theories, especially ecological perspectives, empha- 
size the disruptive effects of change. However, the mechanisms pro- 
ducing these effects are seldom examined explicitly. This article ex- 
amines one such mechanism—employee turnover. Analyzing a 
sample of high-technology start-ups, we show that changes in the 
employment models or blueprints embraced by organizational lead- 
ers increase turnover, which in turn adversely affects subsequent 
organizational performance. Turnover associated with organiza- 
tional change appears to be concentrated among the most senior 
employees, suggesting “old guard disenchantment” as the primary 
cause. The results are consistent with the claim of neoinstitutionalist 
scholars that founders impose cultural blueprints on nascent organ- 
izations and with the claim of organizational ecologists that altering 
such blueprints is disruptive and destabilizing. 


INTRODUCTION 


Models of the employment relationship play an important—if not always 
explicit—role in contemporary theories of organizations. Notwithstanding 
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their other differences, numerous perspectives affirm the idea that organ- 
izations embrace particular culturally accepted logics or blueprints for 
organizing, including a model of how employment relations should be 
structured. These models are claimed, in turn, to guide subsequent or- 
ganizational evolution and to be resistant to change. For instance, in 
discussing organizational inertia, population ecologists have argued that 
survival prospects are enhanced by organizational features that promote 
reliability and accountability, including a coherent system for managing 
employees: “Testing for accountability is especially intense during organ- 
ization building. ... When membership involves an employment relation, 
potential members often want guarantees that careers within the organ- 
ization are managed in some rational way” (Hannan and Freeman 1984, 
p. 153). Among the most important factors in fostering reliability and 
accountability, according to organizational ecologists, are clearly specified 
forms of authority and well-understood bases of exchange between mem- 
bers and the organization. Hence, organizations stand to benefit by de- 
veloping and institutionalizing coherent blueprints for employment re- 
lations that can foster reliability and accountability; once such a blueprint 
gets adopted, it is risky and costly for organizations to alter it. 
Neoinstitutional perspectives also emphasize the importance of nor- 
mative or cultural blueprints in shaping organization building and or- 
ganizational evolution (e.g., Guillén 1994; Fligstein and Byrkjeflot 1996). 
According to these accounts, the designers of organizations draw on cul- 
turally appropriate templates and conceptions of control in crafting struc- 
tures, work roles, and employment relations because this enhances or- 
ganizational legitimacy and because their own prior socialization and 
enculturation presumably preclude doing otherwise (Fligstein 1987, 1990). 
Organizational economists have made similar arguments about the 
value of distinctive and coherent human-resources systems and their in- 
ertial tendencies (Milgrom and Roberts 1995) Organizations benefit from 
a coherent, consistent, and well-understood philosophy and set of practices 
governing human resource management because: (1) they benefit from 
technical complementarities among specific personnel practices (Milgrom 
and Roberts 1992, chap. 4)—for instance, investments in training increase 
the value of policies that reduce turnover, and vice versa; (2) learning is 
simplified and accelerated; and (3) the organization can more readily dif- 
ferentiate itself from competitors, helping it attract workers well-suited 
to the kinds of jobs and values the firm has on offer. These benefits are 
particularly likely to be realized when the organization’s model (e.g., an 


consin, the University of Chicago, and the Wharton School, none of whom should be 
blamed for any remaining defects in the finished product. Direct correspondence to 
James N. Baron, Graduate School of Business, Stanford University, Stanford, Cali- 
fornia 94305-5015. E-mail: baron_james@gsb stanford.edu 
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ironclad bureaucracy or a familial clan) resonates with behavioral scripts 
or blueprints familiar to prospective and current employees from their 
experiences in other settings (Baron and Kreps 1999, chap. 3). Yet strong 
complementarities and interdependencies among various facets of an or- 
ganization’s model make changes in any particular element more difficult 
and costly. 

These various arguments suggest that organizational models regarding 
employment relations should be resistant to change and that efforts to 
redraw blueprints should be disruptive. Indeed, according to organiza- 
tional ecologists, efforts to alter the premises governing employment re- 
lations should be among the most disruptive types of organizational 
change. Changes in these premises can obsolesce skills and routines that 
employees have learned, alter bases of power and status within the or- 
ganization, and violate the implicit or explicit contracts specifying the 
expectations and entitlements of employees vis-A-vis the firm. Therefore, 
efforts by firms to implement change along these dimensions should in- 
crease discord and employee turnover. For instance, consider recent efforts 
by health care organizations to implement performance evaluation and 
reward systems for physicians based on patient volume. These changes 
have proved enormously contentious and sparked unionization efforts and 
rancor within the profession, precisely because they promote behaviors 
and values that conflict with most physicians’ basic premises about their 
role (Greenhouse 1999). 

Yet, the destabilizing effects of fundamental organizational changes 
have been assumed more than tested in organizational research. Numerous 
studies have documented that some type of organizational change (in 
strategy, top leadership, etc.) has deleterious consequences for organiza- 
tional performance or survival (for reviews, see Barnett and Carroll [1995] 
and Carroll and Hannan [2000), which are often attributed to the internal 
disruption, rancor, and turnover of personnel wrought by changes in val- 
ues, routines, working relationships, and requisite skills. In this article, 
we examine empirically whether changes to the organizational model 
embraced by the founder(s) have disruptive effects, which we operation- 
alize by focusing on labor force turnover and organizational performance. 
Turnover is particularly disruptive in the setting we examine—high-tech 
start-ups in Silicon Valley—for at least two reasons: the acute shortage 
of scientific, technical, and engineering talent facing these organizations; 
and the fact that, for many technology start-ups, employee turnover risks 
losing the firm’s most precious asset, its human capital. 

These analyses speak to several important issues for organizational 
scholarship. First, they explore the validity and utility of the notion that 
culturally based scripts, templates, or blueprints are imprinted on organ- 
izations at their founding. In prior research, we have documented the 
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existence of distinct organizational blueprints or models—different sets 
of premises governing the employment relation—among the founders of 
high-technology start-ups in Silicon Valley. We have demonstrated that 
these blueprints shaped various aspects of organizational evolution, in- 
cluding bureaucratization and administrative intensity, the development 
of the HR function, and even the odds of replacing the founder with a 
new chief executive officer (CEO) and of going public (Baron, Burton, 
and Hannan 1996, 1999; Baron, Hannan, and Burton 1999a, 19996; Han- 
nan, Burton, and Baron 1996; Burton 1995). If these blueprints represent 
part of the “hard wiring” of an enterprise and shape the expectations that 
employees and firms have of one another, then changing the blueprint 
should have demonstrable consequences, including heightened turnover. 
Second, our analyses address the general claim that fundamental change 
destabilizes organizations. Third, labor force turnover is, of course, an 
inherently important organizational outcome, which has received consid- 
erable attention from social scientists (for reviews, see Price [1977] and 
Mobley [1982]. Whatever its relation to performance, turnover surely 
affects organizational life (Staw 1980; Pfeffer 1983). Two otherwise iden- 
tical organizations with persistent differences in turnover will evolve very 
different tenure distributions, with implications for stability and change 
in organizational culture (Carroll and Harrison 1998). Finally, we believe 
the models and methods we employ might prove useful to researchers 
interested in depicting organizational founding conditions and assessing 
their enduring effects. 

We begin by describing the sample, the organizational setting, and our 
typology of employment models. We then formulate and test predictions 
relating labor force turnover to changes in the employment blueprint. We 
assess whether the turnover wrought by changing the employment blue- 
print is disruptive for organizations in two ways: (1) by specifying and 
testing hypotheses concerning how that turnover is patterned (i.e., who 
is most likely to depart); and (2) by analyzing the effects of labor force 
turnover on organizational performance (revenue growth). We test these 
predictions with pooled cross-section time-series data describing annual 
turnover rates and various organizational change events for more than 
100 organizations between 1991 and 1995. 


THE STANFORD PROJECT ON EMERGING COMPANIES (SPEC) 


A panel study of young, high-technology firms in California’s Silicon 
Valley, SPEC examines the evolution of employment practices, organi- 
zational designs, and business strategies. The SPEC project seeks to un- 
derstand how human resource systems get established. The focus on firms 
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in a single region and sector of economic activity holds constant key labor 
market and environmental conditions, as well as some of the institutional 
influences asserted to shape labor force turnover. Within the Silicon Valley 
region, we sought industries containing sufficient numbers of comparable 
firms to allow quantitative comparisons; accordingly, we concentrated on 
firms engaged in computer hardware and/or software, telecommunications 
(including networking equipment), medical and biological technologies, 
and semiconductors. We assumed organizations must reach a minimum 
size before needing formal systems or practices; accordingly, we required 
that firms in our study have at least 10 employees when sampled.’ We 
also wanted to understand how founding conditions and early decisions 
affect subsequent organizational evolution, which necessitates information 
about the earliest days of the organization. We assumed individuals could 
only reliably recall fairly recent information; consequently, we limited the 
sample to firms no more than 10 years old when first visited in 1994-95 
(the typical firm was six years old).’ 


DATA COLLECTION 


Survey, interview, and archival methods were used to gather information 
on each firm (see Burton 1995). Trained MBA and doctoral students con- 
ducted semistructured interviews with the current CEO. The CEO was 


2 In 1994, we identified 676 technology firms in Silicon Valley founded within these 
industries during the previous ten years and having more than 10 employees, according 
to two commercial databases on Silicon Valley technology companies: Rich’s Everyday 
Sales Prospecting Guide (1994); and the Technology Resource Guide to Greater Silicon 
Valley (1993/4), pubhshed by CorpTech From that group, 250 firms were selected 
according to a stratified sampling plan described in Baron, Burton, and Hannan (1996, 
fig 1) Of the 250 firms to whom we wrote (some of which had gone out of business, 
left the area, or been acquired by the time we contacted them), 109 agreed to participate. 
Utilizing the same sampling frame, we contacted 94 additional companies in 1995 (of 
168 that were added to the 1995 edition of the CorpTech directory); 42 agreed to be 
studied. Finally, we supplemented the sample by contacting 32 very young firms (not 
listed in CorpTech), which we identified by tracking the Silicon Valley business press; 
22 of these firms participated. We further enlarged the sample in 1997-98 by soliciting 
participation from very young firms, including enterprises in the newly emerging In- 
ternet sector. However, we were only able to obtain turnover data for several of these 
firms, whose responses concetvably reflect a different time period, labor market, and 
business environment than the rest of the firms in our sample confronted. Consequently, 
we did not include those several firms in the analyses reported here. 

? About 10% of firms proved to be more than 10 years old when we first visited them. 
In some cases, for example, our interviews revealed that the inception of the organi- 
zation occurred somewhat earlier than the date of legal incorporation used in con- 
structing our sampling plan. Even employing the most liberal definition of “founding,” 
however, only three firms in the sample analyzed here had existed for more than 12 
years when we first visited them. 
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asked to identify the founder (or member of the founding team) best 
equipped to provide information regarding the firm’s origins; and the best 
informant regarding human resources (HR) practices in the organization. 
We followed up with these informants about company history and HR 
(respectively) and asked them to return completed surveys to us prior to 
being interviewed. The company history survey solicited details about the 
firm’s founding and subsequent milestones. The HR survey sought in- 
formation about workforce demographics and a variety of employment 
policies and practices. Information from the surveys, when available, was 
used to guide interviews with informants in each company. 


FOUNDERS’ ORGANIZATIONAL MODELS 


As noted above, recent neoinstitutional work invokes the notion of cul- 
turally based logics, blueprints, scripts, or conceptions of control. Yet 
researchers have seldom tried to operationalize such blueprints directly, 
tending instead to infer their existence from other sources of information. 
Testing the proposition that altering founders’ initial blueprints for or- 
gapizing and for structuring employment relations is disruptive demands 
a method for measuring those blueprints. 

In designing the study, we knew from the extant literature that con- 
ceptions of employment relationships could vary along numerous dimen- 
sions, and we were unsure a priori which dimension(s) would be most 
relevant in our setting. Accordingly, we used open-ended interviews to 
gather information. We asked each founder whether he or she had “an 
organizational model or blueprint in mind when (you) founded the com- 
pany.” (The CEO was asked a parallel question about the period corre- 
sponding to the date of the interview.) We inductively analyzed transcripts 
of interviews with founders and CEOs. Those analyses indicated that 
interviewee’s images regarding how work and employment should be 
organized varied along three main dimensions—attachment, coordination/ 
control, and selectton—each characterized by three or four fairly distinct 
options or approaches from which organizational architects seemed to be 
selecting. (For descriptions of these different response categories and il- 
lustrative quotes from interview transcripts, see Burton [1999].) Based on 
the interview transcripts, members of the research team coded or classified 
responses of each founder and each CEO on these three dimensions, unless 
missing data precluded this.* In previous work, we have shown that these 


‘Two senior researchers on the project Independently coded the three dimensions for 
all firms, based on transcripts of interviews with founders and with CEOs. Many 
respondents indicated that more than one option for a dimension was relevant to their 
conception of the employment model; for instance, almost all regarded compensation 
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three dimensions cohere and can be used to characterize the implicit 
organizational model or blueprint of the founder and of the CEO at the 
time our team first visited each firm (for an overview and additional 
details, see Burton [1995] and Baron and Kreps [1999], chap. 19). Here 
we briefly summarize the approach. 

Attachment.—Founders articulated three different bases of employee 
attachment, which we label love, work, and money. Some founders en- 
visioned creating a strong family-like feeling and an intense emotional 
bond with the workforce that would inspire superior effort and increase 
retention of highly sought employees, thereby avoiding the frequent mo- 
bility of key technical personnel that plagues Silicon Valley start-ups. 
What binds the employee to the firm in this model is a sense of personal 
belonging and identification with the company—in a sense, love. Many 
SPEC firms pursue cutting-edge technology, and the primary motivator 
for their employees is the desire to work at the technological frontier. 
Recognizing this, many founders anticipated providing opportunities for 
interesting and challenging work as the basis for attracting, motivating, 
and (perhaps) retaining employees.’ Here, employees were not expected 
to be loyal to the organization, the supervisor, or even coworkers per se, 
but instead to a project. Finally, other founders’ responses indicated that 
they regarded the employment relationship as a simple exchange of labor 
for money. 

Basis of coordination and control.—A second dimension concerned the 
principal means of coordinating and controlling work. The most common 
conception involved extensive reliance on informal control through peers 


as relevant to retention, even if they regarded, say, exciting work as more important. 
Hence, the coding task sought to select the option on each dimension that the re- 
spondent indicated as dominant in his/her thinking about the dimension. Some re- 
spondents were unable or unwilling to give priority to one option. Therefore, it was 
important that we use explicit default rules for responses that did not fall neatly into 
one of our categones. We used the following defaults: skills for selection, work for 
attachment, and peer-based control for coordination/control. In effect, this makes the 
engineering model the default (see below). We interpreted the default rules as follows: 
unless the respondent clearly indicated that some other option was more important 
than the default, we coded the response as the default category. After independently 
coding each firm on all three dimensions for the founder and CEO responses, the two 
researchers compared their two sets of codings. In the large majority of cases, they 
were the same. When they were not, we scrutinized the transcripts looking for am- 
biguities that might have led the two coders to disagree. In most casea, we decided 
that the difference was due to some opacity in the response and therefore settled on 
the default code 

* A few founders also spoke about providing unrivaled “opportunity” for prospective 
employees. Although opportunity is potentlally a conceptually distinct basis of attach- 
ment, it was closely aligned with “challenging work” and there were very few such 
cases in our sample. Hence, we treated these cases as instances of attachment based 
on “work.” 
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or organizational culture. Other founders intended to rely on professional 
control, even if they did not explicitly use this terminology. They took it 
for granted that workers were committed to excellence in their work and 
could perform at high levels because they had been professionally so- 
cialized to do so. (Not surprisingly, this approach tends to be accompanied 
by an emphasis on hiring high-potential individuals from elite institu- 
tions.) Professional control emphasizes autonomy and independence rather 
than enculturation. A third group of founders espoused a more traditional 
view of control as embedded in formal procedures and systems. Finally, 
some founders indicated that they planned to control and coordinate work 
personally, by direct oversight, reminiscent of Edwards’s (1979) descrip- 
tion of the simple-control paradigm that characterized small capitalist 
firms in the late nineteenth and early twentieth century. 

Selection The third dimension concerns the primary basis for se- 
lecting employees. Some founders’ responses suggested that they con- 
ceived of the firm as a bundle of tasks and sought employees to carry out 
particular tasks effectively. Time and money tended to be the paramount 
concerns here, so the focus was on selecting employees who could be 
brought on board and up to speed as quickly and cheaply as possible. In 
these cases, founders envisioned selecting employees having the skills and 
experience needed to accomplish some immediate task(s). Other founders 
focused less on immediate and well-defined tasks than on a series of 
projects (often not yet even envisioned) through which employees would 
move over time. Accordingly, they focused on long-term potential. Finally, 
some founders focused primarily on values and cultural fit, emphasizing 
how a prospective hire would connect with others in the organization. 

Relationships among the three dimensions. These blueprints can be 
classified into three types of attachment and selection and four types of 
control, yielding 3 x 3 x 4 = 36 possible combinations. However, the 
observations cluster into a few cells (see Burton 2001), which we will refer 
to as five basic model types for employment relations, summarized in 
table 1. 

The engineering model involves attachment through challenging work, 
peer group control, and selection based on specific task abilities. This 
model parallels standard descriptions of the default culture among high- 
tech Silicon Valley start-ups (e.g., Saxenian 1994), and it is the modal 
employment blueprint among founders of SPEC firms. The star model 
refers to attachment based on challenging work, reliance on autonomy 
and professional control, and selecting elite personnel based on long-term 
potential. The commitment model entails reliance on emotional-familial 
attachments of employees to the organization, selection based on cultural 
fit, and peer group control. The bureaucracy model involves attachment 
based on challenging work and/or opportunities for development, selecting 
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TABLE 1 
Five Basic EMPLOYMENT MODEL TYPES 


DIMENSIONS 
Basic MODEL TYPE Attachment Selection Coordination/Control 
Star Work Potential Professional 
Engineering ...... Work Skills Peer/cultural 
Commitment Love Fit Peer/cultural 
Bureaucracy Work Skills Formal 
Autocracy ...... Money Skills Direct 


individuals based on their qualifications for a particular role, and for- 
malized control (for further discussion of this model type, see Baron, 
Hannan, and Burton [1999b], appendix). Finally, the autocracy model 
refers to employment premised on monetary motivations, control and 
coordination through close personal oversight, and selection of employees 
to perform pre-specified tasks. 

We refer to these five blueprints as the basic model types. We do so 
not only because they are the most prevalent combinations observed 
within this sample, but because they also display several other important 
properties. First, each of these blueprints exhibits a high degree of co- 
herence or internal consistency among the three dimensions, suggesting 
that they complement one another to form an overarching system. For 
instance, consider a founder intending to emphasize control and coordi- 
nation through organizational norms and seeking emotional bonds to the 
company itself (rather than attachment based on the specific work as- 
signment), perhaps in order to create overarching goals among differen- 
tiated subunits. Here there would be a clear technical complementarity 
with selection mechanisms that screen for values and cultural fit, as is 
found under the commitment model. Second, these types display cultural 
resonance and salience within this population and its setting. When we 
have described these archetypes to Silicon Valley employers, employees, 
and other knowledgeable parties, they understand the distinctions and 
frequently begin classifying organizations with which they have experi- 
ence in these terms. 

Furthermore, the five basic types reflect different logics of organizing 
within other institutions that actors in this organizational field have ex- 
perienced; indeed, the labels for the types are fairly evocative of the char- 
acteristics. For instance, the star model—particularly prevalent among 
firms developing medical technology or pursuing research*—resonates 
* Among SPEC firms in the medical technology or research sectors (including bio- 


technology), 42.3% were founded along star model lines, compared to only 16% of 
firms in other industry sectors. 
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closely with the model that underlies academic science, from which many 
of the founders and key scientific personnel sought for these start-ups are 
recruited. The commitment model draws instead on familial imagery and 
the revered legend of Hewlett-Packard within Silicon Valley. The engi- 
neering model resonates with the socialization that engineers receive in 
professional school and suits the Valley’s highly mobile labor force. The 
bureaucratic model is readily familiar from encounters with bureaucracies 
in numerous contexts. Finally, the austere, no-nonsense autocracy model 
communicates a powerful and consistent message that employees certainly 
have encountered elsewhere before: “You work [for me, the boss], you get 
paid [by me}—nothing more, nothing less.” We make no claim that these 
basic model types are generic, or even generalizable outside this popu- 
lation of organizations. Rather, we simply claim that these basic model 
types capture blueprints for organizing that have a systemic quality and 
display cultural resonance within this setting. 

A significant number of companies differed from one (and only one) of 
the basic model types on only one dimension. We will refer to these as 
near-model types. For instance, about 3% of founders envisioned basing 
attachment on love, selecting based on fit, and utilizing direct control. 
This combination represents a near-commitment blueprint: it differs from 
the basic commitment model firm in terms of control (only), and differs 
substantially (i.e., on two or more dimensions) from the other four model 
types. Such an organization suggests an autocratic cult variant on the 
commitment model. Finally, we will use the terms aberrant or nontype 
to refer to all other blueprints—firms in which the blueprint either (1) 
differs from two or more basic model types on one dimension (and does 
not fall into any of the basic types) or (2) differs along two or more 
dimensions from every basic model type. 

Methodological concerms.—This effort to characterize the organiza- 
tional blueprints of entrepreneurs raises a host of conceptual and meth- 
odological issues (see Baron, Hannan, and Burton 1999b). Here we touch 
briefly on several concerns. First, our coding effort and our conceptual- 
ization of organizational blueprints sought to measure the premises of 
founders and CEOs. Blueprints might or might not bear a relationship 
to organizational reality (for some evidence that they do in this sample, 
see Baron et al 1996). In classifying firms on the three dimensions, we 
took pains to rely not on what respondents claimed they were actually 
doing, but instead on what they recounted about their underlying organ- 
izational model or conception. 

Second, founders might have selectively reconstructed the past. Al- 
though we cannot definitively rule out retrospection bias, some previous 
results provide reassurance on this score. For instance, Baron et al. (19994) 
reported that the founder’s initial organizational blueprint is strongly and 
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systematically related to an objective, independent measure of present- 
day managerial and administrative intensity—suggesting that the bur- 
eaucratization process was path dependent—whereas the current CEO’s 
blueprint was unrelated to present-day administrative intensity. If re- 
spondents were selectively tailoring their stories to match or rationalize 
reality, then the responses of present-day CEOs should do a better job of 
predicting present-day organizational arrangements than do founders’ 
recollection of their organization-building premises at the start-up phase. 
Furthermore, some founders acknowledged during interviews that their 
original models were naive or ill conceived. Their ability and willingness 
to be self-critical suggests that they were not simply reporting ex post a 
self-serving conception tailored to actual developments. On the other 
hand, given the retrospective nature of founders’ accounts and other lim- 
itations of the available data, our findings and inferences regarding the 
effect of changing organizational models should be treated as suggestive, 
not definitive. 

Third, we cannot tell for sure when employment models changed (if 
they did), and hence causality could run in the other direction: firms 
experiencing higher turnover might change their employment models in 
an effort to stem that turnover. Though we cannot rule out this competing 
account, several important pieces of evidence argue against it, as we 
discuss below. 

As noted above, previous research on the SPEC firms has documented 
that founders’ initial organizational blueprints shaped not only the evo- 
lution of human resource practices and the HR function, but numerous 
other facets of organizational evolution as well. The fact that founders’ 
models predict how firms develop over time provides some evidence of 
the validity of the typology of basic employment models. We can also 
assess whether the taxonomy captures real and meaningful distinctions 
in founders’ organization-building templates by examining whether model 
change disrupts the enterprise, as manifested in increased turnover. 


HYPOTHESES 
Effects of Organizational Model and Model Change on Turnover 


It is important to distinguish between two potentially competing effects 
of changing the organizational model on turnover. The first concerns what 
Barnett and Carroll (1995) call the process effects of change: the disruptive 
and destabilizing effects of altering deeply embedded organizational prem- 
ises. If our basic model types in fact capture distinctive systems or recipes 
for organizing, then efforts to change the founder’s initial employment 
model should be disruptive. We therefore predict 
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HYPOTHESIS 1.—The more that an organization’s blueprint or model 
has changed from what the founder initially envisioned, the higher the 
rate of employee turnover.’ 

Barnett and Carroll argue, however, that analysts can gain more precise 
and informative results by also taking account of a second set of effects 
associated with organizational change—-which they term content ef- 
fects—which reflect the potential improvement in consistency (and, pre- 
sumably, a concomitant decline in labor force turnover) from abandoning 
an initial model that was incoherent and relatively unfamiliar. (For re- 
views of the evidence on these two types of effects of organizational 
change, see Barnett and Carroll [1995] and Carroll and Hannan [2000], 
chap. 16). In our context, this implies that the potentially disruptive effects 
of model change might depend on an interaction between origins (the 
particular blueprint initially espoused by the founder) and destinations 
(the new blueprint). Altering deeply embedded organizational premises is 
likely to be most disruptive for firms that began with a coherent blueprint 
(i.e., one of the five basic model types). Changing the model should be 
less disruptive for firms that began with an aberrant blueprint. In par- 
ticular, for moves from a nontype blueprint to one of the five basic model 
types, the disruptive effects of change might be more than offset by en- 
hanced consistency of premises governing employment relations, serving 
to dampen turnover. Conversely, moves from one aberrant model to an- 
other presumably engender little or no improvement in consistency to 
offset the disruptive effects of change, leading us to expect that such 
transitions provoke particularly high turnover. 

HYPOTHESIS la—Changing the employment model increases turnover 
most in organizations that began with one of the basic employment models 
and least in organizations that began with an aberrant (nontype) blueprint. 

HYPOTHESIS 1b.—The effect of model change on turnover is larger 
(smaller) for transitions that increase (decrease) a firm’s distance from one 
of the basic employment models. 

Some transitions among basic model types are likely to be more dis- 
ruptive than others. We expect that abandoning the commitment and star 
models is particularly destabilizing, especially when the transition is to a 
bureaucratic model. Firms founded along commitment or star lines are 
more likely to bring in a nonfounder CEO and to do so sooner (Hannan 
et al. 1996). Hannan et al. speculate that these two models most strongly 
implicate the founders in implicit contracts with early employees: in star 
firms, star employees are often recruited to the enterprise by a prior per- 
sonal connection to the founder(s); in commitment firms, the founder 


7 All hypotheses assume that all other relevant determinants of turnover are held 
constant (Le., ceteris paribus assumptions apply). 
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represents the central figure in the clan. In contrast, the engineering model 
seems to represent the Silicon Valley default (Saxenian 1994) and to have 
an affinity with bureaucratic culture (Shenhav 1995), suggesting that the 
engineering blueprint might be easier both to reach and to abandon (es- 
pecially if it is being abandoned for bureaucracy) than other models. 
Accordingly, we predict: 

HYPOTHESIS 1c.—Abandoning the commitment or star model is more 
disruptive than abandoning the engineering model, especially for transi- 
tions to the bureaucratic model. 

Among companies that retained their employment blueprint over time, 
it seems reasonable to expect turnover to be particularly low in firms that 
adhered to a commitment model. Conversely, as firms age, grow, and 
become more complex, retaining star or autocracy models might prove 
increasingly contentious—in the former case, due to tension between the 
early stars and the rest of the organization, with whom they are increas- 
ingly interdependent; in the latter case, because autocratic control appears 
increasingly capricious and untenable as enterprises become larger, more 
complex, and more differentiated. Hence, we would expect to see higher 
turnover among firms that have retained a star or autocracy model 
throughout their existence. 

We do not advance predictions about the main effect of founder’s em- 
ployment model on turnover. Gross differences in turnover as a function 
of founder’s employment model might reflect the fact that some blueprints 
are inherently less stable (Le., less likely to persist over time) than others. 
As we shall see below, turnover rates differ significantly among founders’ 
employment models, even after we control for an extensive array of or 
ganizational and environmental characteristics. However, the pattern is 
subtle and, in some ways, counterintuitive; it reflects differences in the 
persistence of the various model types as well as their underlying turnover 
propensities. Therefore, rather than offering specific hypotheses, we un- 
ravel the issue empirically below. To capture both the process and content 
effects of organizational change, we examine the effects of changing the 
organizational blueprint per se, supplemented with more fine-grained 
analyses of how turnover varies as a function of stability and change in 
founder’s model (i.e., taking into account both the origin and destination 
blueprint). 


THE DISRUPTIVE NATURE OF TURNOVER 


Ecological perspectives imply that the turnover occasioned by altering 
the premises on which an organization was built should be disruptive. If 
this disruption reflects changes in skills, values, working relationships, 
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and routines associated with a change in the organizational blueprint, 
there is a clear implication for the observed pattern of turnover: 

HYPOTHESIS 2.—The turnover associated with change in an organi- 
zation's employment model is concentrated disproportionately among 
high-tenure employees. 

Note that there is a plausible alternative hypothesis.’ A distinctive and 
coherent blueprint helps organizations create and sustain a reputation in 
the labor market (Baron and Kreps 1999, chap. 3). Employees who con- 
sider joining a firm that has a history of espousing a particular model are 
likely to have a good sense of what they will encounter. If that model 
subsequently changes, however, it presumably takes some time before this 
change gets recognized widely within the labor market, especially if the 
new model lacks consistency and distinctiveness. Hence, employees who 
join a firm following a change in its model might be more likely to be 
mismatched to the organization and therefore to depart promptly. Ac- 
cording to this argument, changing the model prompts higher turnover 
among recently hired personnel who come to conclude that they do not 
fit the organization. If this story has merit, then we would expect that 
turnover in firms that have changed their employment models gets con- 
centrated among a mismatched new guard, rather than among a disen- 
chanted old guard. 

Another implication of the ecological perspective is that employee turn- 
over should adversely affect organizational performance (at least in the 
short run), particularly in young, knowledge-intensive, technology com- 
panies. To be sure, turnover can have beneficial organizational conse- 
quences, including enhanced innovation and adaptability (Pfeffer 1983). 
And numerous observers of Silicon Valley (e.g., Saxenian 1994) have em- 
phasized the beneficial effects of abundant labor mobility in fostering 
innovation and entrepreneurial opportunity. But the alleged benefits of 
turnover usually pertain to the industry or regional level, rather than to 
individual firms. Moreover, the assertion that turnover on balance proves 
beneficial seems somewhat at odds with the lengths to which many Silicon 
Valley start-up firms go in trying to bind employees (e.g., stock options, 
noncompete agreements, extensive benefits) and the frequency with which 
we heard senior executives in these companies fret about turnover as “a 
problem.” Accordingly, we also predict that: 

HYPOTHESIS 3.—Employee turnover has a negative effect on organi- 
sational performance. 


* We are indebted to Craig Olson for stimulating our thinking on this point. 
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OTHER DETERMINANTS OF TURNOVER 


Other factors must be controlled in assessing the net effect of change in 
organizational blueprints on turnover. In particular, CEO succession 
prompts change in start-up companies. Organizational researchers have 
demonstrated that changes in top management regimes can have powerful 
effects on employee turnover (Friedman and Saul 1991; Kesner and Dalton 
1994; Virany, Tushman, and Romanelli 1992), although past studies have 
focused almost exclusively on turnover among top management teams 
within large corporations. CEO succession likely has broader effects ‘on 
employee turnover within the companies that we examine, in which the 
CEO (and particularly the initial founder-CEO) typically is the architect 
of strategy, the chief spokesperson of the organization’s culture, and often 
the catalyst for recruiting key scientific, technical, marketing, and sales 
personnel into the venture in the first place. Hence, the departure of the 
organization’s leader likely disrupts goals, values, routines, social rela- 
tionships, and implicit contracts regarding the nature of employment. Not 
surprisingly, organizational models change more frequently among SPEC 
companies that have changed CEOs,’ so it is important to ensure that 
any observed effects of model change do not simply reflect CEO change. 

Although these relatively small, young, high-technology companies 
might be more dependent on their founders and leaders than other types 
of organizations, there are ways in which founders can institutionalize 
their conception of the organization’s employment model so that it per- 
sists, even after they depart. In particular, we expect that the longer an 
organization’s initial leadership regime has been in place, the lower the 
subsequent rate of turnover. This prediction is relatively straightforward 
for companies still led by the initial regime: the longer the regime has 
been in place, the more likely it is to have institutionalized a distinctive 
organizational blueprint and screened out employees who do not fit that 
blueprint. But the duration of the initial regime might influence turnover 
even after that regime has ended. This is because longer-lived founding 
regimes will have been better able to establish and institutionalize a co- 
herent organizational model and supporting culture that enables the en- 
terprise to attract and retain employees who suit that setting.” 

Our analyses hold constant the cumulative number of CEOs a firm has 


* Among firms in which a founder was still CEO, 643% were coded as not having 
changed on any of the three dimensions of the employment blueprint; among firms 
whose CEO in 1994-95 was not a founder, the corresponding figure was 23 6%. 
In supplementary analyses, we examined whether any effects of CEO succession on 
turnover depend on the tenure of the outgoing CEO and/or characteristics of the 
incoming CEO, such as whether he/she was promoted from within and, if so, was a 
member of the founding team. However, we did not detect any systematic interaction 
effects. 
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had at a given point in time. One might imagine that leadership changes 
have cumulative disruptive effects, so that turnover increases with the 
number of CEOs an organization has had over a given interval of time, 
controlling for when the last succession event occurred. Alternatively, 
organizations might become habituated to executive succession, so that 
the first regime change produces considerably more disruption than do 
subsequent ones.'' Given these competing predictions, we do not offer a 
specific directional hypothesis. 

We control for two key milestone events: procuring venture capital and 
making an initial public offering (IPO) of stock. These events might have 
opposing effects on employee turnover, insofar as they generally involve 
an infusion of resources into the organization while also bringing about 
significant changes in organizational arrangements, routines, and (in many 
cases) leadership. In supplementary analyses, we also include controls for 
industry, occupational composition, and the gender mix in the firm’s first 
year of operations to capture variations in the mix of jobs and workers 
across SPEC companies. 

Organizational size, growth, and age also may influence labor force 
turnover. Turnover is likely to be higher in rapidly growing firms for 
various reasons: (1) rapid growth often strains an organization and its 
members; (2) organizations that must scale up their workforce rapidly are 
more likely to make hiring errors, resulting in short-lived appointments 
that are reflected in high turnover; and (3) if organizational growth is 
correlated with economic growth in Silicon Valley generally (e.g., semi- 
conductor firms are expanding because the cyclical semiconductor in- 
dustry has rebounded), firms might face more labor market competition 
during periods of rapid growth. 

As organizations mature, they systematize employment relations and 
presumably become better informed about the needs and desires of their 
employees. Consequently, it seems reasonable to expect that turnover rates 
will decline with organizational age. Furthermore, older firms seem less 
likely going forward to face the need for major changes in employment 
relationships and employment levels (of the sort that would produce high 
turnover) than firms that are still in their infancy. Nonetheless, we predict 
that turnover actually increases with organizational age in our sample. 
First, growing routinization and bureaucratization as firms age might 


1 The same negative relationship between cumulative number of CEOs and turnover 
could obtain if early generations of leaders confront more fundamental sources of 
uncertainty that could influence strategy or if the actions of early leaders constrain 
the options of subsequent leaders. For instance, it seems easier to imagine the first 
CEO of a firm needing to completely reformulate the firm’s strategy or shrink the 
firm’s workforce by 50% than it is to envision the fifth CEO of a firm facing those 
same challenges. 


975 


American Journal of Sociology 


impede adaptation to rapidly changing environments (as suggested by 
stories of senescence and obsolescence—-see Hannan 1998) and encourage 
employees to migrate to firms that appear better suited to current con- 
ditions. Second, the relevant technical labor force has a renowned antip- 
athy to rigid bureaucracy. Moreover, it is widely believed among Silicon 
Valley’s labor force that superior financial rewards and technical chal- 
lenges come from getting in on the ground floor of a new enterprise, 
suggesting that employees might become more likely to depart as their 
firm ages. 


DATA, MEASURES, AND METHODS 
Data 


We sent a survey to the executive designated by the CEO as having 
oversight of HR (sometimes, the CEO himself or herself). The survey 
asked the respondent to provide the firm’s annual turnover rate for 1991, 
1992, 1993, and 1994. (For firms that were visited in summer or fall of 
1994, the survey asked about turnover annualized for the first six months 
of 1994, rather than the entire year.) A few firms visited in 1995 provided 
turnover data for the first half of 1995. The survey also requested infor- 
mation on various facets of the company’s HR system and attributes of 
its workforce, including the current tenure distribution and occupational 
and gender composition at the end of the first year of operations and at 
the time of the survey. Of the 173 firms in the study, 101 (59%) returned 
surveys with usable turnover data. We found no evidence of bias asso- 
ciated with the pattern of missing data resulting from some firms not 
returning the HR survey (Baron et al. 1999b, appendix). 

Because some independent variables vary over time, we constructed 
multiple spells for each firm corresponding to the reporting periods for 
which it provided turnover information (calendar years, except for in- 
stances in which firms reported turnover for the first half of the year in 
which they were interviewed). We use pooled cross-section time-series 
techniques to analyze the data (see below). The data set we analyze con- 
tains 327 spells for the 101 firms that provided usable turnover data on 
the HR survey. Of the 101 companies, 8 provided usable information for 
one turnover spell, 18 for two spells, 17 for three spells, and 58 for four 
spells. 


Measures 


Our dependent variable is the square root of the firm’s annual turnover 
rate. We use the square root transformation to normalize what is otherwise 
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a skewed distribution (for a variable that can take on zero values) and 
because we suspect that most factors that increase turnover (e.g., age, 
size) do so at a diminishing rate.” 

We measure leadership change with an indicator variable denoting 
whether a CEO was appointed or left the firm during the spell.” We also 
include a measure of the firm’s cumulative number of CEOs as of the 
start of a spell We measure the tenure of the firm’s first leadership 
regime as the tenure of the company’s first CEO in the firm, as of the 
start of the spell.“ (This is a time-varying covariate because it increments 
in value over time for firms that have only had one CEO.) By definition, 
this measure is missing for firms that had yet to designate a CEO as of 
the start of a spell. (Some Silicon Valley start-ups do not designate a CEO 


2 The HR survey requested information on the “annual turnover rate” for specific 
calendar years. (It was not feasible to gather data on turnover rates by occupation 
within these companies.) We were concerned that respondents might have varied in 
how they defined this rate We followed up by telephone with a sample of respondents, 
asking how they had defined the turnover rate in filling out the survey. Most respon- 
dents apparently defined turnover as the fraction of employees present at the beginning 
of the period who departed by the end of the period, and a number told us that this 
is how turnover is generally construed by Silicon Valley companies. Variations in how 
respondents defined turnover should add noise to the data, reducing the likelihood of 
uncovering significant statistical relationships. Moreover, in supplementary analyses, 
we estimated the effects of time-varying covariates on turnover in fored-effects spec- 
ifications that control for stable characteristics of each firm (including how it defined 
turnover). The results were largely unchanged, further suggesting that our findings 
are unlikely to reflect differences in how respondents defined the annual turnover rate. 
u Data on executive succession were obtained from a variety of sources, including 
interviews with firm informants, business press articles, annual reports, other public 
documents (e g., 10K filings), and company web sites When our data sources indicated 
that one chief executive departed in a given month and another person began in that 
role during the following month, we treated this as if there had been no interruption 
between the two Due to imprecision in dating the exact founding date of each company, 
we treated a CEO as having served since the inception of the firm if, according to our 
data sources, his/her tenure as CEO commenced within three months of the company’s 
founding. (Otherwise, the data record for that company would indicate that it did not 
have a CEO at the beginning of the spell in question.) In the same vein, because we 
could not always time venture capital fmancings or transitions to public status precisely, 
we treated a company as having had venture capital or being a publicly traded entity 
as of the start of a given spell tf our records indicated that it had received venture 
capital or went public by the end of the first month within that spell 

“ The key contrast in the data was between firms being led by their first CEO vs. a 
subsequent CEO; accordingly, we dichotomized this measure. 

u Notice that the measure captures how long the first CEO had been in the firm, not 
how long he or she had been CEO. This reflects our belief that it is whether the first 
CEO was part of the founding team and how much of the firm’s history he or she 
witnessed that affects the ability to institutionalize an organizational blueprint that 
employees will perceive as legitimate, even if the first CEO was not appointed to the 
top job until some time after joining the company. 
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until this is required by investors, the exigencies of going public, and the 
like. This was the case for about 2.5% of the firm-—spells in our sample). 
Consequently, in supplementary analyses, we also created an alternative 
measure of the duration of the firm’s first leadership regime: first-leader 
tenure within the firm as of the start of a spell, for firms that have had 
a CEO; or the organization’s age at the start of the spell, if the firm had 
not yet appointed a CEO. We reasoned that the first leadership regime 
for firms without a CEO involves some sort of shared power arrangement, 
in place since the firm’s inception. The findings were unchanged when 
we utilized this alternative measure (results available upon request). To 
simplify the exposition, we report results only from analyses of spells for 
firms that had had one or more CEOs as of the start of the spell. This 
has the effect of reducing slightly the number of available cases, thereby 
raising our burden in detecting significant effects. 

We represent founders’ organizational models with binary variables 
corresponding to the typology in table 1. Unless otherwise noted, near- 
model types (Le., firms that differed along only one of the three dimensions 
from one and only one of the five basic types) are grouped with their 
corresponding types. This potentially mutes some of the contrasts among 
model categories, but it also increases the number of cases in several 
categories, thereby providing more statistical power for detecting system- 
atic differences. We measure the magnitude of change in the organiza- 
tional model by the number of dimensions (0—3) that changed between 
the founder’s blueprint and the blueprint coded from the responses of the 
(then-current) CEO. We also report more fine-grained analyses that allow 
the impact of stability or change in a firm’s model to vary as a function 
of which particular model it started with and/or migrated to. 

We control for how long (in years) the firm had received venture capital 
financing and/or been publicly traded, as of the start of each spell. We 
transform both variables by taking the square root, because we suspect 
that the effect of getting venture capital and/or going public on turnover 
declines sharply with duration, with the major effect capturing the dif- 
ference between firms that had versus had not experienced these events. 

We capture organizational growth by controlling for employment size 
(in the square root metric) at the end of the year preceding each spell and 
at the end of the firm's first year of operations.'* Organizational age equals 
years since founding, as of the start of each spell. Based on information 


“The former measure was obtained from secondary sources (corporate directories, 
etc ), interpolated between intervals as necessary. Employment at the end of the first 
year of operations was measured based on the HR survey provided by the company. 
(For firms that did not provide an HR survey, this variable was imputed statistically 
from the secondary-source data and other variables related to employment size.) 
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provided by founders concerning the timing of various events, we defined 
the founding date to be the earliest of three events: legal incorporation, 
hiring the first employee, and start of “normal business operations.” In 
supplementary analyses, we also included controls for time (a linear time 
trend or dummy variables corresponding to the calendar year to which 
each spell corresponded), but these controls were insignificant and did 
not alter the pattern of results. 

Finally, we represent industry with two dummy variables, one denoting 
firms engaged in manufacturing and the other denoting firms engaged in 
research; the omitted category represents firms in computer hardware or 
software, telecommunications and networking, semiconductors, or med- 
ical devices and biotechnology. (Other industry contrasts were not sig- 
nificant.) In various supplementary analyses, we also controlled for the 
distribution of the firm’s employees in the first year across various broad 
occupational categories (e.g., scientific and engineering; administrative 
and managerial; clerical; sales) and the proportion of the work force that 
was female. These did not alter the basic pattern of effects of change in 
organizational model on turnover; accordingly, their effects are not re- 
ported here (results available on request). 


Estimation 
Our data structure is a pooled cross section and time series. The data are 
unbalanced: the number of observations varies among firms. Recent or- 
ganizational research typically models such data with fixed-effect esti- 
mators, which analyze only the within-firm over-time variation. This 
choice is unappealing in this context because some key independent var- 
iables (e.g., founders’ organizational blueprints and whether the blueprint 
changed) do not vary over time. Instead, we use robust estimators that 
analyze both between-firm and within-firm variation. Specifically, we 
use the method of generalized estimating equations (GEE) developed by 
Liang and Zeger (1986; also see Zeger and Liang 1986). This approach 
generalizes quasi-likelihood estimation to the panel context. Like quasi- 
likelihood, GEE requires specification of only the first and second mo- 
ments of the distribution of the outcome, rather than the full distribution 
as is required for maximum likelihood. Under mild regularity conditions, 
GEE estimators are consistent and asymptotically normal. 

The setup we estimate is the following. The outcome is a firm’s turnover 
rate (square root) in a given year. For the ¿th firm, we have #, observations, 


y= Oats Ya un. > Ima) 


and the vector of outcomes can be written as 
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Y = (Yo Jo «Yad 


The covariates vary over firms and (in some cases) over time for given 
firms: 


D Gi cia Peer FS q ar 
Xa = (Kary Xm >>> 1 Kien) 
If we represent the disturbances as 
u, = (0,,,0,, --. ,U,), 
then we can write the structural model to be estimated as 
y = Xbt+u 


We expect that the disturbance process will exhibit autocorrelation of 
the usual panel type: observations for the same firm will tend to be cor- 
related due to permanent and gradually changing, unobserved firm prop- 
erties. However, we assume that observations are uncorrelated for dif- 
ferent firms. In particular, we assume that the covariance matrix of 
disturbances has the following form: 


A, 0 0 
0 A; 0 0 
E (uu) = ¢ 0 a 
. 0 
0 0 A, 
where ¿is a scale parameter, 
A, = o,R, 


and the matrix R satisfies the properties of a correlation matrix. GEE 
requires a specification of a working correlation matrix. The implemen- 
tation we used—the XTGEE routine within version 6.0 of STATA 
(StataCorp 1999)—allows a menu of choices for the working correlation 
matrix. We experimented with several, including the classic exchangeable 
correlation structure from the standard random-effects setup, as well as 
first- and second-order serial autocorrelation. We found that models fit 
best when we used a completely unstructured working correlation matrix, 
in which each off-diagonal entry is unconstrained and estimated from the 
data. That is, we used as a working correlation matrix: 
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"a 1 

R=|/7,, %; 1 R 
Tu fu fu 1 
Tis Tis Yas Tys 1 


where the rows and columns correspond to the calendar years represented 
in our data set (1991—95). 

We conjectured that autocorrelation would decline over time and across 
waves of panels, because the hazard of major shocks that would coun- 
teract autocorrelation in the determinants of turnover is likely to increase 
as firms grow older. Consistent with our conjecture, autocorrelation did 
decline with the temporal distance between spells and across waves of 
panels (e.g., there was stronger autocorrelation between 1991 and 1992 
observations than between the 1991 and 1993 panels or between the 1992 
and 1993 panels), and models permitting this error structure fit consid- 
erably better than models that impose a more constrained error structure. 
We report robust standard errors, using the so-called sandwich estimators 
developed by Huber (1967) and White (1982). 


DESCRIPTIVE STATISTICS 


Table 2 summarizes the pattern of transitions from founder to CEO mod- 
els. Among firms that we classified as having a particular model at found- 
ing, the table reports the fraction that we classified as having each type 
of organizational model in 1994-95, based on our interviews with CEOs. 
Row (1) in the table reports results based on classifying the founder’s 
model as aberrant unless it corresponded perfectly to one of the five basic 
model types in table 1. Row (2) groups near-model-type firms (Le., those 
that differed in only one dimension from only one of the five basic types) 
with their corresponding basic type category. The parenthesized results 
in table 2 pertain to the entire sample of SPEC firms for which we had 
the requisite information to code the founder’s and CEO’s organizational 
model, whereas the results without parentheses are for the subset of firms 
providing valid data on employee turnover (and thus used in our analyses). 

Table 2 provides descriptive background, but it is substantively infor- 
mative in several respects. The diagonal entries in table 2 reveal that the 
commitment and bureaucratic models—which in many respects represent 
polar extremes—are the most persistent over time (¡.e., have the smallest 
fraction of firms that transitioned to a different model). And, despite the 
relatively frequent shifts in the sample toward bureaucracy, so firms 
founded along commitment model lines made that transition. Table 2 also 
portrays the engineering model as relatively compatible with other or- 
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ganizational models: a relatively high fraction of firms founded along 
engineering model lines transitioned to a bureaucratic model (and vice 
versa), and the engineering model seems to be a destination that is reached 
with some frequency by firms irrespective of their founding model (except 
for firms founded as autocracies). As we shall see below (table 5), the 
transitions that occur with relatively low (high) frequency in table 2 are 
generally the transitions that occasion relatively more (less) employee turn- 
over. In other words, firms seem less likely to have made the most “turn- 
over-prone” transitions than to undertake the less turnover-prone ones. 
This gives us some confidence that the model types capture distinct or- 
ganizational blueprints and suggests that the architects and leaders of 
firms are mindful of the disruptive consequences of changing organiza- 
tional blueprints. 

Table 3 reports descriptive statistics for all 327 spells containing valid 
turnover data. The annual turnover rate averages approximately 13%, 
though there is obviously substantial variation. (Though not reported in 
table 3, among the 93 firms with two or more turnover spells, 54.9% of 
the variation in turnover is between firms; for the square root of turnover, 
the corresponding figure is 62.6%.) Note that modest differences in turn- 
over, if sustained over time, can have quite dramatic implications for 
organizations. For instance, according to table 3, firms in the sample were 
on average about 4.5 years old at the start of a spell.” Consider the 
cumulative effect of being one standard deviation above the sample mean 
on turnover (26% vs. 13%). If annual turnover remained constant at 13% 
for a cohort over time, then after four years, 57% of the original cohort 
would remain; after six years, the fraction is 43%. In a firm experiencing 
26% turnover, just under 30% of the original employees would still be 
there after four years, and only 16% after six years. (The picture does not 
change much if we incorporate more reasonable assumptions about turn- 
over declining with tenure.) Such differences in the representation of the 
old guard seem likely to have significant organizational implications. 

A change in top leadership occurred during 11.6% of the spells. In 
about 2.5% of the spells, firms had not yet appointed a CEO; for nearly 


1 Note that the minimum organizational age reported in table 3 is -0 42%. For a few 
cases (just under 4% of spells), the birth of the organization (based on our criteria for 
defining age—see text) occurred sometime during the spell To be treated as a valid 
observation and included in our sample, a firm must have existed for more than half 
the year to which the turnover data corresponded, and the company had to have 
provided turnover information for the year in question. This was done to avoid cases 
in which, for Instance, a firm might have reported turnover data for 1991 but our 
measure of organizational age indicated that the firm came into existence in November 
or December of 1991, so the firm’s turnover report pertained to an extremely short 
period and thus was error-prone 
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a quarter of the spells, the firm had already experienced two or more 
CEOs as of the start of the spell. 

Among founders, the engineering model was the most prevalent (34%). 
We coded roughly 7% of the founders as having an autocratic model, 
13% as commitment, 11% as star, and just under 6% as bureaucratic. 
Thirty percent of founders gave responses that did not fit into any basic 
model type (or one of the near-types); we coded their blueprints as aberrant 
or nontype. The typical firm in our sample experienced change in its 
organizational model along one dimension, based on our classification of 
the blueprints associated with CEOs at the time of our interviews versus 
the models envisioned by founders. (For 44.1% of the firms and 42.4% 
of the spells, the founder and CEO models were identical on all three 
dimensions.) Not surprisingly, model changes were more frequent and 
extensive in firms that had also changed leadership. Among companies 
with valid employee turnover data, 44% were still led by a founder in 
1994-95; of those, 57% were coded as not having changed the blueprint 
on any dimension and only 12% had changed on two or more dimensions. 
Among companies with a nonfounder CEO by 1994-95, only 23% had 
not changed the model on any of the three dimensions, whereas 40% 
changed on two or more dimensions. 


RESULTS 
Effects of Organizational Model and Change in Model on Turnover 


Table 4 reports results from multivariate analyses predicting the square 
root of turnover for each firm-year spell. In a simple bivariate regression 
(not shown in table 4), the gross effect of blueprint change (number of 
dimensions that differed between the founder’s and CEO’s blueprints) is 
0.660 ( = 5.084, P < .001). Thus, relative to firms with stable employment 
blueprints, a firm in which the blueprint changed on all three dimensions 
is predicted to have a turnover rate that is (3 x .660)? = 3.92 points 
higher.” This strong positive effect persists after controlling for other 
determinants of turnover. Model 1 in table 4, for instance, adds dummy 
variables depicting the founder’s blueprint (with bureaucracy as the omit- 
ted category). Unexpectedly, firms founded on a bureaucracy model have 
lower turnover rates than firms founded on different blueprints, and that 


= These percentages are based on combining “neartype” cases with their model-type 
counterparts. For “basic model type” firms alone, autocracy is 4.3%, commitment is 
6.3%; star is 9.3%, engineering is 33.8%, and bureaucracy 15 5.6%. 

’ This bivariate regression (based on 302 observations, 93 firms) has a constant of 
2.594 (s = 14.440, P < .001); scale perameter = 2.863; Wald x7 = 2585 (df = 1, P< 
001). 
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effect persists in model 2, which controls for a variety of organizational 
characteristics. 

Of course, the effects of founders’ models might reflect differences 
among the blueprints in their persistence and in the types of transitions 
they will likely experience, which are not captured simply by the number 
of dimensions that changed. As a first means of examining that possibility, 
model 3 adds dummy variables depicting the CEO’s blueprint (with bu- 
reaucracy as the omitted category). The main contrast among CEO blue- 
prints is between autocracy, which exhibits the highest turnover rate net 
of other controls, and the commitment model, which displays the lowest. 
Accordingly, model 4 in table 4 presents the same specification with the 
commitment blueprint as the reference category for founder’s and CEO’s 
organizational model.” Model 4 reveals that firms whose CEOs’ blue- 
prints were autocracy or bureaucracy had significantly higher turnover 
than otherwise comparable companies whose CEOs had a commitment 
model. According to model 3, firms whose CEO had a bureaucratic model 
also display significantly higher turnover than firms whose CEO model 
was classified as star. 

It is important to acknowledge the causal ambiguity involved in relating 
CEO model and change in the model to turnover. Because we cannot 
date when a firm’s model changed (if it did), turnover might be the cause, 
not a consequence, of the CEO’s present-day model. We return to this 
issue below and present some results that provide reassurance against 
this possibility. 

Models 3 and 4 of table 4 suggest that the effects of stability and change 
in organizational blueprints depend substantially on the specific blue- 
print(s) involved. For instance, relative to a firm that retained a bureau- 
cratic model (the reference category in model 3), a stable autocracy is 
predicted to have experienced a turnover rate that is (0.365 + 
1.459)? = 3.33 points higher. Or consider two firms whose models changed 
on two dimensions: firm A changed from engineering to commitment; and 
firm B shifted from star to bureaucracy. Relative to firms with a stable 
bureaucracy blueprint (the reference category in model 3), annual turnover 
is predicted to be only 0.90 higher in firm A, compared to 7.00 higher in 
firm B.* When compared to firms with a stable commitment blueprint 


* Results for model 1 are comparable when reestimated on the same 271 cases as 
models 2—4 in table 4 

11 For the star to bureaucracy transition, e.g., the predicted effect in model 3 is the 
main effect of a star founder’s model (1.600), plus the main effect of a bureaucracy 
CEO’s model (0), plus the effect of changing the model on two dimensions (2 x 
0.523), for an overall effect of 2.646 in the square root metric or 7.00 on the turnover 
rate. 
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(the reference category in model 4), firm B is predicted to have an annual 
turnover rate that is 5.43 higher, whereas firm A’s is only 1.51 higher. 


Changing the Organizational Model: A Closer Look 


The additive specification in table 4 potentially masks interactive effects 
of origins and destinations in determining the disruptive effects of chang- 
ing the organizational blueprint. To differentiate between the process and 
content effects of organizational change, table 5 provides a finer-grained 
portrait of how turnover varies as a function of stability and change in 
founders’ employment blueprints. It reports estimates of specifications 
that incorporate the same set of covariates as in model 2 of table 4, but 
we replace the covariates representing founders’ blueprints and the num- 
ber of dimensions that changed with a vector of dummy variables rep- 
resenting specific combinations of founder and CEO blueprints. The par- 
ticular combinations of origin and destination states incorporated in table 
5 capture the main contrasts that we thought interesting (how close to 
one of the five basic model types; distance between origin and destination 
model; and so forth) and the main transitions observed in our data. (For 
several transitions of particular substantive interest, the number of cases 
was very small, but we have nonetheless reported the detailed results.) 
The coefficients in table 5 represent predicted differences in turnover 
(square-root metric) between firms that experienced a given transition and 
two different reference categories: stable bureaucracies (the first set of 
coefficients in table 5); or all firms that did not change their employment 
blueprint (the second set of coefficients). Table 6 reports significance tests 
on contrasts between specific coefficients (rows) in table 5, rather than 
contrasts vis-a-vis the reference categories. 

Table 5 suggests two basic conclusions. First, abandoning a basic model 
type was generally associated with higher turnover, in support of hy- 
pothesis 1a (but see below). Consistent with hypothesis 1c, the disruptive 
effect of abandoning a basic model type in favor of an aberrant blueprint 
seems especially large for firms founded along commitment or star lines 
(see comparison in table 6 between rows 7 and 8).” Second, consistent 
with hypothesis 1b, changing to a basic model type was less disruptive 
than moving to an aberrant blueprint. For instance, among firms that 
abandoned the commitment or star model, those migrating to the engi- 
neering model had somewhat lower turnover than those adopting an 


2 Even moving to the engineering model (a model that seems robust and flexible) from 
the commitment or star model seems slightly more disruptive than migrating to the 
engineering blueprint from any other pure model (cf rows 12 and 13), though table 6 
reveals that the contrast is not significant (P = .15). 
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aberrant blueprint (see comparison in table 6 between rows 7 and 12; 
P = .097, two-tailed).” 

The bureaucracy and autocracy models do not fit this pattern, however. 
Generally, moving to either of them increased turnover, whereas aban- 
doning either of them reduced turnover. Moving from any other basic 
model to bureaucracy increased turnover considerably, though the ap- 
parent dislocation was considerably smaller among firms that migrated 
to bureaucracy from the engineering model (cf. rows 10 and 11), consistent 
with the claimed compatibility between engineering culture and bureau- 
cratic culture. This result illustrates that origins and destinations matter. 
movement to bureaucracy from either the engineering model or an ab- 
errant blueprint produced significantly less turnover than did 
abandonment of another basic model (especially the star type) for bu- 
reaucracy (row 11). Similarly, in one case abandoning a basic model seems 
to have reduced turnover: movement from bureaucracy to engineering 
(row 13). 

These findings suggest that, in at least some cases, the disruptive effects 
associated with altering an organization’s employment model are more 
than offset by an increase in the attractiveness of the new model. In short, 
the content of organizational changes affects how disruptive they are. 
Dismantling the commitment and star models apparently was most de- 
stabilizing, whereas any disruptive effects associated with dismantling 
bureaucracy seem to have been more than offset by favorable responses 
to the change, resulting in lower turnover. Conversely, in shifts to bu- 
reaucracy or autocracy, the virtues of a clear and consistent model as a 
destination state were offset considerably by employees’ apparent strong 
dislike for these particular models. 

Another result in table 5 suggests that the basic model types represent 
relatively desirable origin and destination states, in terms of minimizing 
labor turnover: companies that replaced one aberrant or nontype blueprint 
with another experienced especially high turnover (row 9). Only firms that 
retained or adopted autocracy or that migrated to bureaucracy from an- 
other basic model other than engineering had higher predicted turnover 
levels (cf. rows 14, 15, 15a, and 15b). 

In other words, if a founder adopted an aberrant HR model, then 
changing to another aberrant model made the organization especially 
vulnerable to turnover. The contrasts among these effects are not highly 
significant, however, for several reasons. First, the transition from an 


> All firms that migrated to bureaucracy from a model type other than engineering 
were founded along star-model lines (see row 11). This transition was characterized 
by particularly high turnover, because abandoning the star model and migrating to 
bureaucracy both seem to be destabilizing for technology companies. 
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TABLE 6 
ADDITIONAL CONTRASTS AMONG COEFFICIENTS IN TABLE 5 


Row Contrasts Coefficlent s prob > |3| 
7 versus (2,3) . . 1.244 2 161 .031 
7 versus 8. ` A 1.118 1930 054 
7 versus 12 .. .. 1598 1.659 097 
9 versus (5, 6) ....... 2.183 3.888 000 
9 versus8 . > 1.134 1771 .077 
9 versus 14 .. a 1.134 1583 114 
9 versus 15 ........ 528 .766 444 
9 versus 15a ........ -918 1.279 201 
9 versus 15b ...... š —.925 — 1.698 090 
10 versus (4, 5, 6) ... 1.114 2.577 010 
10 versus 11 . dete —1.205 —2 774 006 
10 versus 13 ae 2134 6.514 000 
12 versus 13 . DR 1.252 1.433 .152 
14 versus (5, 6) ...... 1.031 1.847 .065 
14 versus 15 .. A —.606 —.909 363 
14 versus 15b . A —2 073 —3 495 .000 
15a versus 15b ...... —1 843 —2.718 007 
15 versus (5,6)... 1.629 3.432 001 


aberrant blueprint to one of the five basic models compounds two op- 
posing effects—the disruptive effects of altering the employment model, 
and the (presumably) beneficial effect of adopting a consistent model. 
Second, the benefit associated with shifting to a basic model type depends 
on the underlying attractiveness of that model. For instance, relative to 
firms that retained an aberrant blueprint, companies that shifted to one 
of the basic types generally experienced higher turnover, consistent with 
the notion that changing the premises governing employment relations 
disrupts an organization’s equilibrium. 

The magnitude of the effect of change depends significantly on the 
destination (the new blueprint). Consider several examples. For transitions 
to the commitment or star blueprint from an aberrant model (which were 
few in number), predicted turnover was only modestly higher (1.031) than 
among firms that stayed with a particular aberrant blueprint (s = 
1.847; P = .065). Moving to the engineering model from a nontype blue- 
print occasioned somewhat higher turnover, significantly more than 
among firms that retained a particular aberrant blueprint (if row 15a in 
table 5 is contrasted against rows 5 and 6, b = 1.300; z = 2.704; P = 
.007; supplementary analyses not shown in table 6). The transition from 
an aberrant model to bureaucracy also produced significantly higher turn- 
over than occurred among companies that retained their founder’s ab- 
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errant blueprint.’ Migrating from an aberrant form to autocracy, although 
quite rare, seems to have been even more disruptive; firms making this 
transition experienced significantly higher turnover than companies that 
retained an aberrant blueprint (b = 3.193; s = 6.389; P <.001; supple- 
mentary analyses not reported in table 6), and even somewhat higher than 
in companies that cycled from one aberrant model to another (see contrast 
between rows 9 and 15b in table 6). 

In short, model consistency and cultural resonance are not virtues in- 
dependent of the model’s content. Rather, the effect of abandoning or 
moving to one of the five basic models seems to depend quite a bit on 
the specific model. The commitment and star models appear to be par- 
ticularly risky to dismantle and less contentious to adopt. In contrast, 
moving from bureaucracy and autocracy entails little disruption, whereas 
moving toward these models seems especially unsettling. 

The fine-grained analyses in tables 5 and 6 cast light on the differences 
in turnover rates as a function of founder’s employment model that were 
reported in table 4. Among firms that retained their original employment 
blueprints, the only’ model types with significantly higher turnover than 
bureaucracy were autocracy and star. According to table 5, autoc- 
racy—whether adopted at the firm’s inception or subsequently (cf. rows 
1 and 15b)—fosters high levels of employee turnover. This is hardly sur- 
prising, given the considerable education, experience, and professionalism 
among Silicon Valley’s labor force; the intense competition among em- 
ployers for key personnel; and employees’ high expectations for autonomy 
and self-actualization at work. The star model is widely perceived in 
Silicon Valley as turnover-prone, due to perceived inequities it can foster, 
the high mobility of scientific and technical stars to whom the model is 
targeted, and the tendency of star-oriented firms to rely more on stock 
options (which either stay underwater or else vest and are exercised, in 
either case making the firm vulnerable to turnover). 

Other differences in turnover rates as a function of the founder’s model 
are not significant (in table 5) after controlling for the specific transitions 
that firms experienced. Although turnover appeared (in table 4) to be 
lower in firms founded along bureaucratic lines than in companies 
founded on commitment, engineering, or hybrid models, tables 5 and 6 
reveal that this is simply because the latter firms were more likely to 
change their employment models—which on its own is destabilizing—and 


™In supplementary analyses, we subdivided row 10 into firms that transitioned to 
bureaucracy from the engineering model versus from an aberrant (nontype) blueprint. 
Turnover within the latter group was significantly higher than among firms with a 
stable neartype or nontype blueprint (b = 1.735; + = 2.899; P = .004). 
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to make the kinds of changes that foster higher turnover (i.e., toward 
bureaucracy, autocracy, or an aberrant blueprint). 

Yet table 5 indicates that no other employment model, retained over 
time, displays significantly lower turnover than bureaucracy. We find this 
result somewhat surprising in light of the apparent low regard with which 
Silicon Valley employees view bureaucracy, as evidenced by the reductions 
(increases) in turnover that seem to accompany firms’ abandoning (em- 
bracing) a bureaucratic model (see rows 10, 11 and 13 in table 5). We 
speculate that firms founded along bureaucratic lines might have antic- 
ipated long-term growth and hired employees better prepared to handle 
the transition to a more mature organization, whereas that transition 
requires more “churning” of personnel in firms built on other models.” 
(This is especially plausible to the extent that founders who embraced a 
bureaucratic model had prior exposure to larger, well-established tech- 
nology companies.) Some evidence consistent with this speculation comes 
from the summary notes provided by a member of our research team 
after interviewing a founder of one of the firms we coded as adhering to 
a stable bureaucratic model. That founder had previously worked in one 
of Silicon Valley’s largest and most prominent companies: 


During her tenure, Z (one of the founders) has put in place systems and 
controls that would be appropriate for a far larger company. Z has prepared 
for the growth (little so far) that the company anticipates and she points 
out that some of the company’s systems have yet to be fully utilized. The 
firm under Z seems to be employee friendly. Z has a reputation for great 
relations with her employees—or her “people,” as she’s been known to call 
them—a fact that is borne out in the firm’s low turnover. She mentioned 
two former employers (large, well-established technology companies) as in- 
fluences with regard to organizational design... . CEO Z says she is the 
person most responsible for HR at the company. HR at the firm is part of 
strategic planning. When the company reaches 50 employees . . a full- 
time HR specialist will be hired. Indeed, Z knew exactly where in her file 
cabinet to find the company’s employee handbook, developed in the com- 
pany’s first year as Z’s first task as CEO.. . In addition to the employee 
handbook, she also instituted in the first year written performance evalu- 
ations, nondisclosure agreements (NDAs), and instituted regular company- 
wide meetings. (ID #141) 


Another company that we classified as a stable bureaucracy seems to 
fit a similar pattern—planning in advance for rapid growth by a founder 


* Precisely because bureaucracies are rather atypical within Silicon Valley, it is also 
possible that firms founded along such lines transmit clearer and more accurate pre- 
employment expectations for individuals joining the firm, relative to organirations 
built on other models (particularly the commitment, star, and autocracy models), re- 
sulting in higher posthiring attrition in the latter 
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with prior work experience in large, well-established technology com- 
panies. This firm’s founder-CEO, a Ph.D. scientist who had worked pre- 
viously in several large electronics, semiconductor, and defense companies, 
anticipated the explosive growth in portable computers and in “smart 
devices” (digital cameras, mobile phones, personal digital assistants) and 
hoped to compete against Intel and Advanced Micro Devices in producing 
flash memory storage devices. He sought to capture early mover advan- 
tage by developing technology, key supplier and customer partnerships, 
and organizational capability in advance of a market that was only just 
emerging when the company was launched: 


Founder. The catalyst for starting the company was that I believed that 
there was going to be a need in the marketplace for the kind of products 
that I had in mind several years after we were going to start a company. 
... I felt I had a solution to a problem that was not yet evident as a problem. 


Interviewer. (Did you have) a model or a blueprint for organizing the com- 
pany at the beginning, whether from your past experience or from other 
companies that you admired? 

Founder Yeah, it was basically, the first five years we would be a functional 
organization, we would have VP’s for the various functions, no separate 
divisions ..., and have very good executive staff, hire a good CFO, a good 
VP of engineering, VP of technology, VP of operations, traditional structure. 
And I did not expect that (to) change in the first five years. I did not look 
beyond the first five years. (ID #19)* 


In sum, differences in turnover across founder models for the most part 
reflect differences in the vulnerabilities of the particular models to sub- 
sequent change and the dislocating effects of such changes. More broadly, 
the results in tables 5 and 6 provide some indirect validation of our 
typology of employment models. For instance, the high turnover char- 
acteristic of firms that moved between aberrant blueprints suggests that 
consistency of the organization’s employment model does indeed reduce 
turnover. Moreover, transitions among particular origin and destination 
models vary in sensible and predictable ways in affecting turnover (e.g., 
the dislocating effects of abandoning the commitment or star model and 
of migrating toward bureaucracy or autocracy). Altering the employment 
blueprint generally destabilizes organizations, especially when the blue- 
print is relatively coherent. However, the content of the changes can 
attenuate or exacerbate the process effect (the tendency for changing the 
employment model to foster turnover). 


* Interestingly, when asked whether he had any specific company in mind as an 
organizational model, the founder mentioned Intel, the largest firm in the industry. 
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OTHER DETERMINANTS OF TURNOVER RATES 


We briefty summarize the effects of other variables on firms’ turnover 
rates in table 4. We find no net effect of change in CEO during a spell 
on turnover. Interestingly, the gross effect (not shown in table 4) is sizable 
and statistically significant (b = 0.687;s = 2.539; P = 0.011), but it be- 
comes insignificant once we control for change in the founder’s organi- 
zational model.” In other words, executive succession is associated with 
turnover primarily because it is associated with model change. The fact 
that the (time-varying) effect of executive succession is mediated by the 
(time-invariant) effect of the number of model dimensions that changed 
is substantively important. It gives us some confidence that change in the 
blueprint operates as a cause, rather than as a consequence or correlate, 
of turnover, even though we cannot be certain that all changes in blue- 
prints preceded the turnover spells they are supposed to predict.™ 

We find only weak evidence of the hypothesized imprinting effect of 
the organization’s first CEO. Controlling for organizational age, the more 
of the firm’s history that was “presided over” by the first CEO, the lower 
the subsequent turnover.” However, this effect is moderated somewhat 
and becomes statistically insignificant when other variables (specifically, 
blueprint change) are controlled for. 

According to model 2 of table 4, firms led by their second or subsequent 
CEO experienced slightly lower turnover.” However, this effect tends to 
be pronounced only in specifications that also control for model change. 


2 Tn a specification with CEO change (time-varying) and number of dimensions of 
model change (time-invariant) as regressors, the effect of CEO change is 0.280 (s = 
1.494; P = .135), whereas the effect of model change remains strong and significant 
(b = 0 643; s = 4931; P< 001). 

™ We also find a modest positive effect of changing the CEO on turnover in a standard 
“fixed effects” model that controls entirely for between-firm variaton in turnover and 
includes only time-varying independent variables (results avaflable on request). How- 
ever, the size and significance of the effects are reduced considerably if a time-varying 
measure of the cumulative number of CEOs ın the firm is also added to the specification, 
which has a significant negative effect on turnover in the fixed-effects model 

” In a specification with just those two covariates, first CEO duration has a strong 
negative effect on turnover (6 = —0.186; s = —2.345; P = 019). 

% In supplementary analyses that included spells for firms that had not yet designated 
a CEO, we also found that companies experienced significantly lower turnover before 
appointing their first CEO (relative to the omitted category, first CEO), even controlling 
for organizational age (results available on request). We suspect that appointing a CEO 
after a firm has functioned for some time without one destabilizes organizations for 
two reasons’ (1) it signals a move to a more business-like and hierarchical approach, 
which often conflicts with the original vision of the founders; and (2) designating one 
of the founders (or an outsider) as CEO after a period of collective control often causes 
the remaining founders to feel disenfranchised and to depart, sometimes taking other 
employees with them. 
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This is because firms with a second or subsequent CEO are significantly 
more likely to have experienced a model change, resulting in higher turn- 
over; net of that tendency, however, such firms actually display somewhat 
lower turnover. In other words, if we compare two firms that both altered 
their original blueprint dramatically, the estimates of model 2 of table 4 
imply that a firm still being run by its first CEO will have slightly higher 
turnover than one in which the initial CEO has been replaced. We spec- 
ulate that this result reflects the nature of implicit contracts. Founders 
generally establish the implicit contracts with employees that are embed- 
ded in the organizational blueprint; hence, it might be more contentious 
for a founder-CEO to alter that blueprint and to remain at the helm, as 
a continuing reminder to employees of how the enterprise has strayed 
from its initial model, than it is for a newcomer CEO.” 

Turnover increases with the length of time since a firm first received 
venture capital. Venture capitalists (VCs) often help young organizations 
formalize and fill key managerial and technical roles (Baron et al. 1999b), 
especially as their relationship with the firm develops over time, because: 
(1) they often maintain an initial hands-off approach; (2) if the enterprise 
is proceeding on a path towards going public, VCs want to ensure that 
the firm possesses the management capability and organizational routines 
to handle this transition; and (3) if the firm is doing poorly, VCs will 
frequently insist on changes in management and/or organizational routines 
(using their board positions to bring this about). The effect of venture 
capital in model 2 of table 4 might capture turnover that is induced when 
VCs facilitate bureaucratization of firms and the replacement of early 
employees.” Turnover appears to decrease somewhat the longer a firm 
has been public. This result is consistent with qualitative observations 
from our interviews about disruptive effects that accompany the initial 
public offering (IPO). Respondents noted that going public can disrupt 
organizations in a variety of ways—for instance, requiring new sorts of 
skills (e.g., financial reporting, public and investor relations) not previously 
represented among employees, which might require both adding new 


” Consistent with that conjecture, adding an interaction term between the cumulative 
number of CEOs and the number of dimensions that changed in the blueprint to model 
2 of table 4 produces a negative effect that approaches statistical significance (b = 
—0 169, s = —1.634 , P = .102, two-tailed), and the main effect for number of CEOs 
is no longer significant. Thus, turnover associated with changing the model is greatest 
for companies still being led by their first CEO at the beginning of a spell 

* Consistent with that interpretation, note that the effect of venture capital on turnover 
becomes insignificant after controlling for the CEO’s organizational blueprint (model 
3), which VCs presumably are likely to help shift toward bureaucracy. In addition, 
supplementary analyses reveal that firms in which the CEO championed the bureau- 
cratic model were significantly more likely to have venture capital and to have had 
it longer than all other firms. 
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kinds of employees and replacing some individuals as well. Moreover, 
respondents emphasized that going public often represents a financial and 
symbolic milestone, which, once surpassed, can leave a motivational void. 
One founder-CEO of a company engaged in biotechnology instrumen- 
tation told our research team: 


We worrled about the IPO a lot because from the earliest days that was a 
clear corporate focal point. Get to the IPO point, get the company public. 
It’s the big payoff for people who have stock. Every person in our company 
is a stockholder. We grant them options when they join. Everyone worked 
very hard for six years to get to that point. Our concern was, after the IPO 
and after the lockups expire (so that) people have the ability to sell stock, 
we were concerned what the motivation levels in the company would look 
like (and) what we could do to influence that motivation level. One thing 
we are working very diligently on right now is identifying what the next 
corporate milestone will be. 25%-30% growth isn't the kind of corporate 
objective or singularity of purpose that gets people riled up. We are looking 
for something a little more specific, like that $100 million benchmark. We’re 
in the process of making a final decision of what that overall, superordinate 
goal is going to be (ID #23) 


The negative relationship between duration as a public company and 
turnover might also reflect vesting of stock options. Among the spells 
involving public companies, the typical duration as a public company 
was only about two years (with more than 85% of spells involving com- 
panies that had been public less than four years). The tendency for turn- 
over to decline with duration as a public company might reflect the in- 
creasing sway that stock options have as employees approach the vesting 
date when those options can be exercised (often three to five years after 
the initial public offering).” 

As predicted, organizational age has a strong positive effect on employee 
turnover, consistent with the notion that younger technology companies 
are perceived as offering greater technological challenges and larger pros- 


* In “spline” specifications that restrict the effect of duration to companies that were 
already public, the effect is still strongly negative @ = —0.567), though only marginally 
significant (P = .068) We also experimented with numerous other specifications, in- 
cluding dummy variables denoting whether these events occurred in a given (or im- 
mediately prior) spell and specifications incorporating a time-invariant effect of having 
venture capital or being publicly traded. In some of these models, we found significant 
effects of having received venture capital or gone public in the prior spell, which 
worked in the opposite direction from the effect of duration—that is, turnover declined 
in the spell following VC funding, but increased subsequently; and turnover increased 
in the spell after companies went public, but subsequently declined the longer firms 
had been publicly traded. However, these lagged effects do not persist if employment 
change in the preceding spell 1s controlled, suggesting that the lagged effects on turnover 
of receiving VC financing or going public simply reflect the short-term impact of those 
events on employment growth and decline. 
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pects for financial gain through stock option grants. In some specifications, 
organizational growth increased turnover, but the magnitude and signif- 
icance of this effect varies depending on the particular specification and 
set of observations involved. Turnover tends to be lower in research and 
manufacturing companies (as contrasted with computer hardware and 
software, medical devices and biotechnology, semiconductors, and tele- 
communications and networking). Finally, in supplementary analyses, we 
controlled for occupational and gender composition at the end of the first 
year of operations for the subset of companies that provided this infor- 
mation. Those controls do not appreciably alter the effects associated with 
founders’ employment blueprints or changing the blueprint (results avail- 
able on request). Hence, our findings do not appear to be artifacts of gross 
differences in occupational composition or gender mix. 

Overall, the effects of model change persist even after we control for 
many other factors that could be expected to influence turnover rates, 
including organizational size and growth, age, industry, occupational and 
gender composition, and the duration of venture capital financing and 
public status. 


WHO IS LEAVING: DISENCHANTED OLD GUARD OR MISMATCHED 
NEW GUARD? 


If altering organizational blueprints changes the skills, values, working 
relationships, and routines that have developed within an enterprise, then 
turnover associated with changing blueprints should be concentrated dis- 
proportionately among more senior employees. Our data do not permit a 
direct test of this prediction. However, companies completing our HR 
survey did provide a tenure distribution for the firm (the fraction of the 
labor force with tenure of six months or less, seven to twelve months, 
etc.) at the time of the interview (1994 or 1995). To examine this issue, 
we conducted exploratory analyses, predicting the percentage of employ- 
ees in 1994—95 having six months or less tenure in the firm. These analyses 
were limited to the last spell for which we had information within each 
firm (1994 or 1995). We included as predictors the same covariates as in 
model 2 of table 4, as well as contemporaneous and lagged measures of 
turnover. For firms interviewed in 1995, there is some potential ambiguity 
regarding the time frame to which firms’ responses regarding turnover 
rates and tenure distributions pertain. We handled this ambiguity by 
performing the analyses in various ways.“ The key results of interest are 


= The 1995 survey form asked for the firm’s current tenure distribution but inadver- 
tently asked for turnover data pertaining to the first half of 1994 (as we had requested 
for firms interviewed in mid-1994). Some companies interviewed in 1995 provided, on 
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stable across specifications and subsamples analyzed and are unaffected 
by weighting, so we report (in table 7) only one set of those analyses, 
which are unweighted. 

Table 7 reveals that change in the employment blueprint has a strong 
and significant positive effect on the fraction of a firm’s workforce having 
tenure of six months or less. In other words, the more a firm had changed 
its organizational blueprint by 1994-95, the larger the fraction of recently 
hired employees on its payroll, even controlling for recent turnover ex- 
perience.” Because we have controlled for turnover in the analysis, the 
effect of model change cannot be attributed simply to the higher levels 
of turnover experienced by firms that changed their employment blue- 
print. Rather, a more plausible explanation for this effect holds that firms 
experienced turnover disproportionately from their old guard when em- 
ployment blueprints changed, consistent with hypothesis 2. 

This interpretation is buttressed by parallel analyses in table 7, model 
2, that focus on the size of the most senior tenure cohort in the firm (five 
or more years). Companies that changed their employment blueprint most 
have a significantly smaller proportion of old-guard employees. Effects 
of other variables in the analysis seem quite plausible—for instance, the 


their own initiative, turnover data for all of 1994 and the first half of 1995 In one set 
of analyses, we excluded firms that were interviewed in 1995 but that did not provide 
turnover data pertaining to the same time period as the tenure distribution. In other 
analyses, we constructed a new lagged turnover measure for year t — 1, equal to (a) 
1993 turnover for firms interviewed in 1994, (b) turnover in 1994 for firms interviewed 
in 1995 that also provided turnover data for 1995, and (c) turnover in the first half of 
1994 for firms interviewed in 1995 that did not furnish any more recent turnover data. 
In another variant, we restricted the analysis to firms interviewed in 1994. These 
analyses produced results comparable to those reported in table 7, which handles the 
ambiguity by: (a) including a dummy variable for whether or not a given firm provided 
contemporaneous turnover data for the same time period as the tenure distribution; 
(b) controlling for 1994 vs. 1995 interview year; and (c) permitting the effect of lagged 
turnover to vary between firms that did versus did not furnish contemporaneous 
turnover data. 


* Turnover in the prior year generally has a negative effect on the size of the “new 
guard,” presumably because firms experiencing high turnover in the previous year 
hired replacements, who had already accumulated more than six months’ seniority 
within the firm. However, model 1 ın table 7 reveals an exception. among firms in- 
terviewed in 1995 that did not provide contemporaneous turnover data for 1995, lagged 
turnover exhibits a positive relationship with the fraction of employees hired in the 
last six months One possible explanation for this result is that these firms (all studied 
1 1995) were (justifiably) confused by the different time frames to which the turnover 
and tenure questions pertained, and they unwittingly reported their tenure distribution 
for 1994 and/or their recent (1995) turnover experience instead of consistently providing 
information pertaining to 1994 or 1995 (Le., it is because they had a relatively low 
tenure workforce in 1994 that they display higher turnover in 1995). This would 
account for the appearance of a significant positive relationship between reported 
turnover and the size of the recently hired contingent. 
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fraction of senior employees is larger in older companies, in those that 
were larger in their first year of operations, and in those founded along 
commitment lines. Recognizing that not all firms in the sample had been 
in existence for five years, we also replicated these analyses on the subset 
of companies that were five years old or more as of the start of the spell, 
and the results (available on request) were comparable to those reported 
in table 7. 

We cannot make strong inferences on this issue, given the considerable 
limitations of the data available to us. However, these exploratory results 
seem consistent with the view that adopting a new organizational model 
increases turnover principally by dislocating the old guard. 


TURNOVER AND ORGANIZATIONAL PERFORMANCE 


We have argued that the tendency for changes in organizational blueprints 
to increase turnover is interesting for numerous reasons, irrespective of 
the relationship between turnover and firm performance. Nonetheless, we 
wish to test the prediction, derived from ecological perspectives, that the 
disruptions occasioned by turnover destabilize organizations and ad- 
versely affect performance. We report a preliminary analysis of the re- 
lationship between turnover and one compelling indicator of performance: 
revenue growth. Bear in mind that many young technology companies 
are incurring significant set-up costs that might dampen profitability, such 
as conducting basic research, developing distribution channels, building 
infrastructure, and so on. Hence, the ability to accelerate the flow of 
revenues is a reasonable indicator of early success in young technology 
companies, one over which the labor force has some control and one that 
is tracked closely by external constituencies and stakeholders. In addition, 
the data we analyze on revenue growth were provided by the companies 
to independent sources—annual reports to the SEC (for public companies) 
and annual editions of the Technology Resource Guide to Greater Silicon 
Valley (a commercial database distributed by CorpTech)—rather than 
directly to us, reducing the chance of any bias. 

We model performance by relating revenues (in $100,000s) in year 
T + 2 to: (a) revenues in year T; (b) average turnover in years T — 1 and 
T; and (c) other controls (described below). We calculated each firm’s 
average turnover rate over the last two spells for which it provided data 
(for most firms, 1993 and the first half of 1994; for a few firms, 1994 and 


* Not surprisingly, with age, size, and growth controlled, the effect of recent turnover 
on the relative size of the most senior tenure cohort is modest 
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the first half of 1995).°’ We transform revenues into the square root metric 
to deal with nonlinearities and to accommodate firms with zero revenues 
at either or both time points. The regressors include firm age at year 7, 
the time period to which turnover data pertain for the firm (1993-94 or 
1994-95), change in the organizational blueprint (number of dimensions, 
0-3), and employment growth (the ratio of employment in 1994 to 1993). 
The last variable is included to control for the possibility that any observed 
effect of turnover on revenue growth reflects downsizing. 

Table 8 reports WLS estimates for the 54 companies with complete 
data.“ The table reveals a significant negative effect of turnover on rev- 
enue growth, consistent with hypothesis 3. This effect of turnover is quite 
robust across different specifications, functional forms, and measures of 
turnover.” Thus, changes in the organizational model not only foster 
turnover, but that turnover appears to have adverse consequences for 
organizational performance, at least in the short run, consistent with the 
ecological perspective. 


ISSUES OF CAUSALITY AND OMITTED VARIABLES 


The available data do not permit definitive conclusions about the causal 
relationships between model change, turnover rates, and organizational 
performance. We cannot tell for sure when employment models changed 
(if they did), and hence causality could run in the other direction: firms 
experiencing higher turnover might change their employment models in 
an effort to stem that turnover. Though we cannot rule out this competing 
account, various pieces of evidence argue against it. First, recall that we 
found that the (time-varying) effects of CEO succession on turnover tend 
to vanish once we control for model change. In other words, our time- 


” Firms generally reported on turnover through the middle of the year in which they 
were interviewed (1994 or 1995) Consequently, because the turnover rate reported for 
year T pertains only to first half of that year, whereas the report for year T — 1 pertained 
to an entire year, we weighted the rate for T— 1 twice the rate for T in calculating 
our measure of average turnover. 

* Not surprisingly, the revenue growth data are heteroscedastic, with larger error 
variance among smaller firms, so we weight observations as a function of 1994 em- 
ployment Other weighting schemes, such as revenues at year T or employment av- 
eraged over several years, produce comparable results, as do analyses that control for 
the possibility of non-random missing data (detailed results available on request). 

» In supplementary analyses, e.g., we also controlled for other potential determinants 
of organizational performance, such as changes in CEO, the fraction of the firm’s 
workforce in sales occupations (and change in that fraction since founding), and 
whether and for how long the firm had been public and/or received venture capital 
financing. The basic results in table 8 were unchanged (detailed results available on 
request). 
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TABLE 8 
DETERMINANTS OF REVENUE GROWTH WLS REGRESSION 


Variable b B t P 
Revenues in $100,000 (square root), year T. ...... 1.198 939 31426 <001 
Average annual turnover rate, years T—1andT ... —.124 —094 —2.973 .005 
Employment growth (ratio of year TtoT—1)...... x .989 087 3.376  .002 
CEO’s vs P AAE OEA 541 046 1385 .173 
Firm age, year T .. ..... cerere 7.210 —044 —1649 106 
Indicator for year T? . oo... ooo coco. 1,309 —.030 —1.087  .283 
Codos is a ies > 2.529 1.811 077 


NOTE — Dependent variable 15 the square root of revenues (m $100,000s) for year T +2 Observations 
(N =54) weighted by 1994 employment sire Mean (SD) of dependent variable 15 12.601 (11.7243), based 
on weighted data R’ = .970; adjusted R? = 967. 

* 1995 = 1, 1994 = 0 


invariant measure of model change mediates the effects of the time- 
varying measures of CEO succession and cumulative number of CEOs. 
This suggests that executive succession disrupts organizations when it 
entails changes in employment blueprints. Second, study of respondent 
interview transcripts reveals numerous mentions of changes in premises 
(“culture”) as being disruptive and divisive. But, we cannot find any men- 
tions of changes in premises in response to unacceptably high (or low) 
turnover. 

Moreover, several results in table 5 make no sense, in our view, under 
reverse causality. For instance, turnover is particularly high for changes 
from aberrant blueprints to autocracy (row 15b), from one nontype model 
to another (row 9), and from commitment or star to aberrant (row 7). 
Firms facing high turnover seem unlikely to have responded by under- 
taking these particular transitions, whereas it is straightforward to un- 
derstand why these transitions might cause high turnover. 

We also investigated whether the arrival of a new CEO, changes to 
the organizational model, and high turnover might all arise because of 
some unobserved crisis. We found that distinguishing spells in which a 
firm’s employment declined by 10% or more does not alter materially the 
pattern of findings we reported above, suggesting our results do not simply 
capture large-scale downsizings.* We also utilized qualitative information 
from interviews and surveys to code whether the founder reported that 
any of the following events occurred within a given firm—year spell: (1) 


* As one might expect, there was moderate positive effect of current employment 
decline (during the spell) on turnover (e.g., m a model only controlling for age, b = 
0.499; z = 2.158; P = .031). However, the other covariates in our model capture many 
of the factors associated with the likelihood of decline, and controlling for those var- 
iables weakens the effect (b = 0.378; x = 1.451; P = .147). 
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major downsizings or office closings; (2) general financial or legal turmoil, 
such as large losses, cash flow problems, market collapse, and law suits; 
(3) major executive changes; and (4) other executive changes (e.g., de- 
parture of a chief technical officer). All four dislocating events have pos- 
itive effects on turnover. Nonetheless, the effect of changing the organi- 
zational blueprint remains positive and highly significant even when we 
control for these milestone events. Nor were the results appreciably 
changed when we included controls for evidence of a change in business 
strategy within the firm. These supplementary results offer some reas- 
surance that our analyses do not overlook some major factor that produced 
the observed associations between changes in employment blueprints, 
turnover, and organizational performance. 


CONCLUSION 


Organizational theorists, particularly ecologists, have emphasized the dis- 
ruptive effects of fundamental organizational change. Such change is 
thought to destabilize organizations primarily by altering the premises, 
values, and routines that organizational members have come to internalize 
(for a programmatic statement, see Hannan and Freeman [1984]. Ac- 
cordingly, we have tried to get closer to the mechanisms at the heart of 
theories of organizational inertia, by (1) operationalizing the premises 
(employment models) on which founders built their new organizations, 
(2) measuring changes over time in those premises, (3) relating those 
changes to employee turnover, and (4) exploring the effect of turnover on 
organizational performance. If altering organizational premises dislocates, 
then this should be clearly manifested in turnover, especially among the 
most senior employees within an organization. Turnover seems an espe- 
cially appropriate indicator of the disruptive effects of organizational 
change within the setting we examined—high-technology companies in 
Silicon Valley—because retaining the key human assets in young tech- 
nology firms is often viewed by senior management, investors, and other 
informed parties as a crucial requirement for organizational survival and 
success. 

We found considerable evidence that changing organizational blue- 
prints fuels employee turnover, which is concentrated disproportionately 
among old-guard employees. Turnover, in turn, adversely affects the 
ability 
of young firms to grow their revenues (at least in the short run), a crucial 
dimension of performance for emerging technology companies. On bal- 
ance, our results support the claim by neoinstitutionalists and organiza- 
tional ecologists (following Stinchcombe 1965) that cultural blueprints are 
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superimposed by founders on nascent organizations, as well as ecologists’ 
claim that altering such blueprints is disruptive and destabilizing. 

A broad conclusion of this analysis, and of others using the SPEC data, 
is that origins matter. Future research should devote more attention to 
conceptualizing and measuring how cultural blueprints are selected and 
imprinted on organizations during their infancy; how they are sustained, 
modified, or discarded over time. Such studies would not only sharpen 
organizational theory but also shed light on some important real-world 
issues, such as the conditions under which firms find it relatively easy or 
difficult to merge or to transplant a particular organizational model into 
a new country or line of business. Moreover, additional work along these 
lines is needed to gauge whether our two main findings—that changes in 
employment blueprints fuel turnover in Silicon Valley high-tech start-ups, 
and that turnover in turn adversely affects organizational perform- 
ance—generalize to other organization-building activities, other kinds of 
enterprises, other environments, and other stages of organizational 
development. 

However, pursuing this research agenda will require some adjustments 
to conventional research methodology. It obviously requires collecting 
observations that pertain to multiple points in time, beginning at the 
origins of the enterprise. It also requires analyzing the resulting panel 
data in non-standard ways. As we noted above, the current standard in 
analyzing panel data on organizations is a fixed-effects approach. How- 
ever, each organization’s initial conditions are constant over its history. 
Thus, origins are swept out of the picture in a fixed-effects analysis What 
seems needed is to employ analytic methods, such as those used in this 
article, that allow flexible specification of models in which both fixed and 
age-varying characteristics affect levels and changes in outcomes of in- 
terest. Further development and refinement of such approaches should 
be high on the agenda for organizational analysis. 

Our detailed analyses of firms’ transitions suggest that the disruptive 
effects of changing organizational blueprints depend significantly on firms’ 
origin and destination states. Abandoning a coherent model proved to be 
particularly destabilizing, especially if the founder embraced the com- 
mitment or star model, but not if the firm migrated away from bureauc- 
racy or autocracy. Similarly, firms experienced markedly different turn- 
over rates in transitioning to particular destination states, such as 
bureaucracy, as a function of their initial blueprint. The engineering 
model, in contrast, appeared to be relatively more flexible and adaptable 
i.e., easier to dismantle and easier to migrate to) than various other mod- 
els. Perhaps this offers some insight into its widespread prevalence in 
Silicon Valley. Not only does the engineering model seem to be a relatively 
hospitable origin and destination state, but retaining that model over time 
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might also expose firms to less severe forms of misalignment than, say, 
retaining the commitment or star or autocratic model. Put differently, the 
engineering model might have less upside but also less downside risk, 
compared to more fragile and distinctive models, such as the commitment 
or star blueprint, which entail greater potential returns aad risks. Organ- 
izational ecologists might profitably examine how different blueprints or 
models fare competitively under different environmental circumstances. 

Scholars and practitioners alike extol the virtues of creating and sus- 
taining a coherent and consistent system of practices concerning employ- 
ment relations. Yet our results suggest that complementarity, consistency, 
and salience are not unqualified assets for an organizational model. Those 
very same attributes might help explain, for instance, the high turnover 
that firms experienced when moving to bureaucracy or autocracy: if Sil- 
icon Valley “techies” tend to dislike bureaucracy and autocracy, they dis- 
like them most in their purest, most consistent, and most salient incar- 
nations. We still know remarkably little about the parameters and 
consequences of consistency: what determines the degree to which a sys- 
tem of beliefs or HR practices gets perceived as consistent, and how/when 
organizations specifically benefit from such consistency (for a notable ex- 
ception, see Ichniowski, Shaw, and Prennushi [1997]. 

Moreover, achieving and sustaining consistency in employment systems 
raises complicated dynamic trade-offs. Some transition paths from foun- 
der’s to CEO’s blueprint seem to be particularly disruptive for organi- 
zations, underscoring the point that organizational design should attend 
to more than just getting the model “right” It should also balance the 
benefits of getting the right model against the costs associated with tran- 
sitioning to that model. As an organization and its environment change, 
the merits of a given model are likely to change as well. However, the 
economic, social, and psychological costs associated with dismantling the 
previous blueprint and implementing a new one might outweigh the con- 
tent advantages offered by the new blueprint. Hence, in some ecologies 
and for some strategies, adhering faithfully to a second-best (or even third- 
best) model might be superior to rapid oscillation among shorter-lived, 
first-best models. This issue of balancing stability versus change—and 
weighing the benefits of altering organizational arrangements against the 
adjustment costs involved in making those changes—has received little 
attention from organizational analysts and those interested in employment 
systems and human resource practices. The strong complementarities 
among elements of a firm’s employment system, coupled with employees’ 
strong emotional attachments to personnel practices, might make these 
trade-offs particularly complex in the domain of human resource man- 
agement. Future research should examine these trade-offs and how they 
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vary across types of organizations, environments, and employment 
practices. 
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Delinquent Peers Revisited: Does Network 
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This study examines whether structural properties of friendship net- 
works condition the association between friends’ delinquency and 
an individual’s own delinquent behavior. Data from the Add Health 
allows a more accurate conceptualization of the peer network and 
a more rigorous measurement of peer delinquency than previous 
research. Findings from this study indicate that friends’ delinquency 
is associated with an adolescent’s own delinquency involvement. 
However, characteristics of adolescents’ friendship networks, such 
as its density and adolescents’ centrality and popularity, condition 
the delinquency-peer association. Network density, in particular, 
emerges as an important component of the delinquency-peer asso- 
ciation, with very cohesive networks containing stronger delin- 
quency-peer associations than those that are less cohesive. These 
findings suggest that it is necessary to consider the underlying struc- 
tural properties of friendship networks in order to understand the 
impact of peer influence on adolescent delinquency. 


INTRODUCTION 


An understanding of the relationship between peers and delinquency is 
at the heart of delinquency research. One of the most consistent and robust 
findings in the literature on adolescent delinquency involves the associ- 
ation between friends’ delinquent behavior and a respondent’s own de- 
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linquency.’ Dating back to the 1930s with Shaw and McKay’s (1931) 
discovery that more than 80% of juveniles appearing before court had 
peer accomplices, researchers have noted the strong tendency for offenders 
to commit delinquent acts in the company of others (Akers et al. 1979; 
Elliott, Huizinga, and Ageton 1985; Elliott and Menard 1996; Jensen 1972; 
Kandel 1978; Krohn 1974; Matsueda 1982; Matsueda and Anderson 1998; 
Matsueda and Heimer 1987; Reiss 1986; Short 1957; Voss 1969; Warr 
1996). In fact, many studies find that the relationship of peer delinquency 
to self-report delinquency exceeds that of any other independent variable, 
regardless of whether the focus is on status offenses, minor property 
crimes, violent crimes, or substance use (Akers et al. 1979; Elliott et al. 
1985; Jensen 1972; Matsueda and Heimer 1987; Short and Nye 1958; 
Warr and Stafford 1991). Therefore, causal explanations of delinquency 
that emphasize delinquent peer associations (e.g., differential association, 
social learning, and developmental theories such as Thornberry’s [1987] 
interactional theory) appear justified. 

Although prior delinquency research establishes that adolescents are 
likely to behave in a manner consistent with their friends, it has yet to 
incorporate the network structure of friendship relations into empirical 
models. This omission of network structure is attributable, in part, to 
researchers’ reliance on a one-dimensional conceptualization of peer in- 
fluence as “exposure” to delinquent friends. However, by ignoring the 
underlying social structure of friendship patterns, prior research has failed 
to address whether network structure moderates peer influence. For ex- 
ample, in closely knit (dense) friendship networks, peer behavior may 
more strongly influence members’ behavior than is the case in less cohesive 
networks. 

This study departs from previous research by recognizing the multi- 
faceted nature of peer groups. Most adolescents do not belong to a single, 
densely knit, isolated friendship clique, but instead are affiliated with 
many loosely bounded friendship groups with varying degrees of cohesion 
and permeability (Dunphy 1963). A measure of the number of delinquent 
friends is thus insufficient to truly capture the dynamics involved in the 


2 Adolescent delinquency is typically defined as activities that place youth at risk for 
adjudication, that is, violating the juvenile code (Henggler 1989; Thornberry 1987). 
These activities can vary widely in their seriousness and detrimental effects on others. 
Status offenses are at the low end of the seriousness continuum and reflect those 
activities for which adults would not be arrested or prosecuted (e.g., truancy, alcohol 
use, smoking, running away). Serious offenses are at the other end of the continuum 
and reflect criminal activity, which can have very detrimental consequences for victims 
and other members of the community (e.g , homicide, rape, aggravated assault, bur- 
glary, auto theft, and arson). Most criminology studies use an omnibus rather than 
ific measure of delinquency 
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peer delinquency association. Since it is generally acknowledged that peers 
play an important role in delinquency involvement, it is important to 
examine the factors that may affect the peer-adolescent relationship. This 
study incorporates the friendship networks in which delinquent and non- 
delinquent adolescents are enmeshed, in order to address the question: 
Do structural characteristics of the adolescent’s friendship network con- 
dition the association between friends’ behavior and delinquency 
involvement? 

Data from the National Longitudinal Study of Adolescent Health (here- 
after, Add Health) enable this question to be addressed in a novel manner, 
through the application of a social network perspective and methods.’ 
Because these data contain detailed social network information on high 
school-age adolescents, more valid measures of peer delinquency are pos- 
sible. In contrast to previous work, this study uses measures of peer de- 
linquency that are based on responses from the peers themselves, rather 
than perceptions from respondents. In addition, these data allow incor- 
poration of the underlying structural characteristics of friendship net- 
works. This allows for a determination about whether the strength of the 
delinquency-peer association depends upon how tightly an adolescent is 
integrated into the peer network (density), the adolescent’s position within 
the network (centrality), and the adolescent’s prestige (popularity) among 
network members. The use of network methods will provide a deeper 
understanding of the role that adolescent friendship networks play in 
either facilitating or discouraging delinquent behavior. 


LITERATURE REVIEW 
Friendship Networks 


Ethnographic studies of adolescents in school settings provide most of the 
information on the importance of friendship networks during adolescence. 
These studies consistently report that being with friends is the most im- 
portant aspect of school life for most students (e.g., Corsaro and Eder 
1990; Cusick 1973; Everhart 1983; Willis 1981) and that relational prob- 
lems with peers are particularly distressing to adolescents (Ambert 1994). 
Part of the importance attributed to friendships derives from structural 
changes that occur in the school environment during the transitions from 
elementary to junior and senior high school. After this transition, ado- 
lescents are confronted with a larger and more heterogenous population 
of students, and status in this new setting often is based on “being known” 
by peers (Eder 1985). Subsequently, many students speak of the need to 


3 Add Health can be accessed online at http:/Ayww.cpc.unc.edu/addhealth. 
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expand their personal networks to avoid becoming lost and isolated in 
the new school setting (Eckert 1989). 

The importance of finding a position within larger friendship networks 
suggests that adolescents are particularly susceptible to peer influence 
during these transition years, including behavioral constraints that may 
pull them toward or away from delinquent behavior. This concern over 
locating position within the school hierarchy and gaining a sense of be- 
longing among their peers leads students to adapt a variety of strategies 
to enhance peer solidarity. One strategy involves the use of gossip to 
constrain friends’ behaviors. As Eder and Enke (1991) explain, “this con- 
straint stems from the fact that while adolescents appear comfortable 
challenging the evaluations made by an individual, they seem reluctant 
to challenge a ‘group’ evaluation. Once an initial evaluation was sup- 
ported by another group member, it was never challenged” (p. 505). For 
boys, a mechanism of ensuring conformity among peers involves “policing 
masculinity” (Best 1983) with behaviors such as aggressiveness, domi- 
nance, and toughness encouraged among peers (Messerschmidt 2000). To 
ensure compliance with this masculine ideology, boys often use derogatory 
female and homosexual references to keep group members in line (Best 
1983; Messerschmidt 2000). 

These findings suggest that friendship networks and peers exert con- 
siderable influence over adolescents’ behavior, including delinquency. 
While ethnographic research is invaluable in highlighting the social mech- 
anisms that peers use to constrain friends’ behavior, it cannot tell us 
whether structural characteristics of the friendship networks provide more 
or less opportunity for these mechanisms to operate. This requires detailed 
information on the structural characteristics of friendship networks for a 
large number of adolescents situated across many different contexts. 


Delinquent Peer Networks 


Though much is known about the relationship between delinquency and 
friends’ behaviors, only a few studies present detailed information on 
friendship characteristics among delinquent adolescents. Warr (1996) in- 
novatively uses Gold’s (1970) National Survey of Youth (NSY) to examine 
specific features of delinquent subgroups, such as group organization and 
the instigator role within delinquent groups. A particularly important 
finding from a network perspective is highlighted in his study: individuals’ 
network structure influences behavior over and beyond that of their stable 
individual traits. Specifically, Warr finds that the structure of the group, 
not an individual’s attributes, affects which individual instigates delin- 
quency. Results from his study also indicate that groups are more spe- 
cialized in terms of delinquency involvement than individuals tend to be, 
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so that most delinquent offenders belong to multiple groups, with each 
group specializing in a smaller range of offenses. This finding also high- 
lights the multifaceted nature of peer groups; individuals in school settings 
can be members of many different friendship groups and face differing 
degrees of constraint depending upon whether the behavior, norms, and 
values of the groups coincide or diverge.’ 

The nature of friendship relations in delinquent versus nondelinquent 
networks is developed in two influential studies. First, an often cited study 
by Giordano, Cernkovich, and Pugh (1986) finds that various dimensions 
of friendship relations do not differ markedly between delinquent and 
nondelinquent adolescents. In contrast to control theory’s depiction of 
delinquents’ friendships as cold and exploitive (Hirschi 1969), Giordano 
et al (1986) find that both delinquent and nondelinquent adolescents 
report similar levels of attachment, intimacy, and contact with friends.* 
Similarly, Kandel and Davies (1991) investigate whether friendship net- 
works among adolescents who do and do not use illicit drugs differ in 
terms of intimacy. Again, contrary to social control theory, the authors 
find few differences in the characteristics of the friendship network. In 
fact, among frequent drug users, they find more intimate friendships than 
among adolescents with no drug use or lower levels of drug use.* 

While these two studies go beyond others by incorporating the nature 
of friendship relations, they are largely descriptive and do not consider 
whether characteristics of friendship relations condition the influence of 
friends’ delinquency on an adolescent’s own delinquency involvement. 
This is an important issue since Agnew (1991) finds that subjective char- 
acteristics of the social relations of friendships (e.g., attachment, contact, 
delinquency) condition the effect of friends’ behavior on adolescents’ se- 
rious delinquency. Specifically, the effect of peers is strongest when an 
adolescent has strong attachment to peers, much contact with friends, 
and when their peers display delinquent patterns. 

These are noteworthy findings because they indicate that even in de- 
linquent peer networks, individuals display substantial variation in their 


* However, by incorporating the behavior of the group as a defining characteristic (Le., 
delinquent group), Warr's study, while provocative, only examines descriptive char- 
acterization of group characteristics for delinquent adolescents. 

5 Girodano et al. (1986) do find that delinquent adolescents tend to report more conflict 
involved in their friendship relations than their nondelinquent counterparts; however, 
delinquent friendship patterns are also characterized by greater loyalty toward peers 
than is found in nondelinquent friendships. 

* These findings stand in contrast to some research coming out of the developmental 
field that suggests that children moet at risk of delinquency are those who are rejected 
by their peers at an early age and later in middle adolescence form less stable and less 
affectionate friendships with other peer-rejected adolescents (Patterson, DeBaryshe, 
and Ramsey 1989). 
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attachment to and the amount of time they spend with their peers—both 
of which condition peer influence. While Agnew’s research suggests that 
members of a friendship network are not affected by the network’s be- 
havior in exactly the same way, it is unknown whether structural prop- 
erties of the network, such as density, centrality, and popularity, similarly 
condition peer influence. The latter finding may suggest that both struc- 
tural and emotional ties play a role in determining the strength of the 
association between delinquent peers and delinquency involvement. 

Therefore, despite advancing understanding of the role that peers play 
in adolescents’ delinquency, prior research does not account for the role 
of network structure on adolescent delinquency. The criminological lit- 
erature, in particular, and sociological literature, in general, reflect a sur- 
prising lack of clarity in conceptualizing peer influences and little con- 
sideration of peer network structure. By focusing only on the number of 
delinquent friends reported by an adolescent, prior research assumes that 
everyone in the friendship network is affected similarly by the behavior 
of the friendship network, regardless of their position within the network 
or their status among network members. The assumption that persons 
and groups are independent overlooks how interconnections among per- 
sons influence behavior. 

In contrast, a social network perspective posits that examining an im- 
portant element of peer relations, the observed pattern of interpersonal 
friendship relations, provides insight into whether the susceptibility ad- 
olescents have to their friends’ behaviors derives in part from their po- 
sition within the peer network. The interrelationships among adolescents 
serve as the context in which social norms and cultural values regarding 
delinquency are shared. Because of the importance of peer relationships 
in delinquency, the fact that few accounts of the delinquency-peer asso- 
ciation are based on a social network perspective provides an opportunity : 
to see whether a network approach provides a deeper understanding of 
the delinquency-peer association. 


Selection versus Socialization 


In spite of the volume of research on the delinquency-peer association, 
the causal effect of peers on delinquency remains subject to debate. This 
dispute centers on two aspects of the relationship, the definition of peer 
delinquency used and the appropriate temporal ordering of delinquent 
friends and delinquency. Although this study addresses the definitional 
aspect of the debate (by using peer self-reported delinquency measures 
rather than respondents’ reports), it is unable to address the second issue 
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adequately.’ In regard to temporal ordering, there is some evidence that 
adolescents sort themselves into friendship networks based on aggressive 
personality (Cairns et. al. 1988; Patterson and Dishion 1985). Although 
this selection into particular friendship networks may partially explain 
the association between delinquent peers and delinquency, it should also 
increase delinquency through differential association and processes of so- 
cial contagion (Cairns and Cairns 1994; Elliott and Menard 1996; Kandel 
1978; Krohn et al. 1996; Matsueda and Anderson 1998; Thornberry 1987). 

Furthermore, while some criminologists speak of friendship formation 
as a self-selection process where “birds of a feather flock together” (Glueck 
and Glueck 1950), early child socialization researchers (e.g., Kupersmidt, 
Coie, and Dodge 1990) suggest that a process of peer rejection drives the 
formation of more problematic (Le., aggressive and delinquent) friendship 
networks. Additionally, many children and adolescents have to wait until 
a peer group selects them, although they can accept or veto participation. 
This is because the number and type of networks open to them are limited 
by many factors including age, gender, cultural, and socioeconomic fac- 
tors—which may be even more important factors than personality or 
behavior orientation (Brown, Lohr, and Trujillo 1990). This suggests that 
children and adolescents do not have as much control/freedom in selecting 
their friends as the selection perspective argues. Limited opportunities to 
join particular networks may make the peer network that adolescents 
eventually find a particularly important source of influence on their 
behavior. 


THEORETICAL FRAMEWORK 


Two dominant perspectives on the etiology of delinquent behavior are 
Hirschi’s (1969) social control theory and Sutherland’s (1947) differential 
association theory. Because these theories rest on contradictory assump- 
tions about motivation for delinquency, the model incorporated into this 


7 Although the Add Health data currently consists of two waves of data, the network 
data is only collected during the first wave of data collection via the in-school survey 
(see “Data” section for additional details on the data collection procedures). Therefore, 
despite having measures of delinquency involvement collected at two points in time, 
the network measures used to describe characteristics of friendship networks (Le., 
egocentric networks) are only available at the first point in time. Sufficiently accounting 
for self-selection into friendship networks necessitates information from two points on 
time on both delinquency involvement and network characteristics. Controlling for 
earHer delinquency involvement is not sufficient since there is no way to untangle 
whether the previous delinquency resulted from network processes (with friendship 
formed at an earlier point in time) or determined the type of friendship network the 
adolescent self-selected into. 
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study relies on theoretical elaboration. Theoretical elaboration is useful 
because it does not require a resolution of differences in assumptions, only 
that the propositions of the model be consistent (Thornberry 1987). Draw- 
ing on Krohn’s work explicating the possibilities of network studies for 
understanding delinquency (1986; Krohn et al. 1988; Krohn and Thorn- 
berry 1993), I discuss the important concepts from social control and 
differential association theory with emphasis on using a network per- 
spective to link and elaborate on the concepts. This leads to the expec- 
tation that although friendship networks, on average, exert considerable 
social control over adolescents, this constraint will depend on (1) the 
friendship networks’s normative behavior and (2) structural properties of 
the friendship network. 


Social Control Theory 


Travis Hirschi’s (1969) social control theory of delinquency is based largely 
on the notion of social integration focal to the work of Durkheim (1951). 
Instead of focusing on why certain individuals commit crime, social con- 
trol theory emphasizes the necessity of explaining why individuals refrain 
from criminal activity (Hirschi 1969). Offering motivation for criminality 
is unnecessary, according to Hirschi, because “we are all animals and thus 
all naturally capable of committing criminal acts” (1969, p. 31). Instead, 
he focuses on constraining influences and argues that adolescents tightly 
bonded to family, the school, and peers are less likely to engage in delin- 
quent acts because these bonds restrain them from acting on their natural 
antisocial impulses (1969).* 

Although Hirschi discusses four bonds responsible for constraining de- 
linquent behavior—attachment, involvement, commitment, and be- 
lief—most research concentrates on. the importance of the bond of at- 
tachment (Vold, Bernard, and Snipes 1998). According to social control 
theory, adolescents who have strong ties of affection to others (family, 
friends, school) are more behaviorally constrained than those lacking these 
bonds and thus less likely to become involved in delinquency. In terms 
of friendship networks, social control theory posits that the more bonds 
an adolescent has via friendship ties, which carry a connotation of at- 
tachment, the less delinquent the adolescent will be. 

While research provides modest empirical support for Hirschi’s theory 


* Additionally, Hirschi argues that adolescents who have a stake in conformity (Le., 
commitment) are far less likely to select delinquents as friends. On a related note, 
Hirschi believed that adolescents lacking stakes in conformity were Hkely to be delin- 
quent and to self-select other delinquents as friends. This issue is part of the selection- 
socialization debate mentioned earlier. 
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(see a review of research on this issue in Jensen and Rojek [1992], it is 
evident that a complete explanation of delinquent behavior is not offered 
by the theory. One of the more problematic aspects of social control theory 
involves its neglect of the context in which the social bonds occur. Spe- 
cifically, while research establishes that in most cases social bonds via 
attachment are associated with a reduction in delinquency, they are not 
likely to reduce delinquency when adolescents are bonded to delinquent 
friends. When bonded to delinquent peers, the constraint of the bond is 
toward delinquency. Despite Hirschi’s denial of the importance of delin- 
quent friends, it is these delinquent associates who are implicated in the 
transmission of delinquency and to whom differential association theory 
attaches primary importance. 

Although social control theory pays limited attention to the context in 
which social bonds occur, its focus on the constraining influence of social 
integration is consistent with a social network perspective (Krohn 1986).° 
Being integrated within a friendship network where adolescents are likely 
to report high attachment and time spent with peers (Giordano et al. 1986) 
either facilitates or discourages delinquency involvement depending upon 
the norms, values, and behaviors evident in the network (see also Krohn 
1986; Elliott et al. 1985). Consistent with Eder and Enke’s (1991) finding 
that although adolescents often discount a peer’s evaluation, but never a 
group evaluation, is the notion that embeddedness within a social struc- 
ture (such as a friendship network) acquires additional influence because 
it creates expectations for behavior, while reinforcing the social norms 
and beliefs of the network (Granovetter 1973, 1974, 1985). This idea of 
embeddedness ties nicely into Sutherland’s theory of differential associ- 
ation since being enmeshed in a peer network provides access to expec- 
tations, norms, and sanctions that either support or discourage delinquent 
behavior. 


Differential Association Theory 


Edwin Sutherland’s (1947) differential association theory is based on the 
premise that delinquency is learned through intimate social relations with 
friends where attitudes or “definitions” favorable to law violation are 
acquired. Thus, not only are adolescents’ attachments to peers important 
for delinquency involvement, but more important, the context or norms 
of the friendship group determine whether attachment to friends results 


’ Because Hirschi's concept of a social bond is often construed around attachment, it 
is important to recognize that a social tle to others does not necessarily represent 
integration in Hirschi’s framework. However, a friendship network where ties are 
impHed on the basis of attachment to peers can be construed as integration. 
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in conventional or delinquent behavior. The social transmission of delin- 
quency occurs within the friendship network through the dissemination 
of attitudes about the appropriateness of delinquent behavior (Sutherland 
1947). While Sutherland’s theory emphasizes the attitudes of peers in the 
transmission of delinquency, Aker’s (1985) extension to differential rein- 
forcement theory suggests that the adoption of delinquent behavior occurs 
through the imitation of peers’ behavior or through the observation of 
its consequences (either positive or negative). Consistent with Aker’s re- 
formulation of differential association theory, research finds that the be- 
havior of peers is more important than the attitudes of peers in influencing 
an individual’s own delinquency (Warr and Stafford 1991). 

Differential association theory, and Aker’s extension to it, are partic- 
ularly suited to an examination of friendship networks because “definitions 
favorable to violation of law” are learned in the intimate social networks 
of individuals (Sutherland and Cressey 1974). Moreover, several research- 
ers drawing on differential association theory have argued that the effect 
of delinquent peers is conditioned by specific features of social relations 
(Agnew 1991; Orcutt 1983, 1987; Short 1957, 1960; Voss 1969). Because 
Sutherland’s theory stipulates that the frequency, duration, priority, and 
intensity of associations are the most relevant for differential association, 
researchers have explored whether these features of social relations con- 
dition delinquent peer influences. Unfortunately, the imprecise definitions 
offered by Sutherland for the features of differential association leave 
room for subjective interpretation of the defining characteristic of social 
relations, and subsequent researchers have offered various definitions. 

Instead of focusing on subjective properties of social relations, such as 
affection, a network perspective suggests that useful conceptualizations 
of social relations should also incorporate the underlying structural prop- 
erties of interpersonal relationships in which definitions, favorable or un- 
favorable, to delinquency are transmitted. This may particularly be the 
case for Sutherland’s property of intensity. For example, a more cohesive 
peer network in which everyone is friends with everyone (i.e., high density) 
will provide greater intensity in differential associations for adolescents 
than will location in a friendship network in which only certain members 
identify each other as friends. Additionally, an adolescent’s structural 
location within the network may affect the intensity of differential as- 
sociations. For example, adolescents located in more central positions 
within the network should experience greater differential association than 
adolescents located in more peripheral positions in their peer network. 

In summary, a network perspective offers a structural approach to 
defining characteristics of social relationships. As discussed next, a net- 
work perspective suggests that (1) some group members are more sus- 
ceptible to control by their friendship network due to their position within 


1022 


Delinquent Peers 


the friendship network, and (2) some friendship networks are more ef- 
fective in controlling the behavior of their members due to structural 
characteristics of the network. By reconnecting adolescents to the struc- 
tures of relationships in which they are embedded, network methods and 
theory provide important tools to deepen understanding of the delin- 
quency-peer association. 


Social Network Perspective 


A social network perspective provides a useful framework for expanding 
on social control and differential association theory.” Not only does the 
network perspective assume that networks constrain individual behavior 
to become consistent with the norms, expectations, and behaviors of the 
network, but that “the structure of a network has consequences for its 
individual members and for the network as a whole, over and above 
effects of characteristics and behaviors of the individuals involved” (Klo- 
vdahl 1985, p. 1204). This suggests that after accounting for relevant 
individual attributes, structural location and positioning within social net- 
works influence behavior by moderating the effect of the network’s 
behavior. 

Social network analysis is uniquely suited for measuring and under- 
standing the behavior of peers because it provides a formal means for 
“mapping” friendships and measuring properties of those friendships (En- 
net and Bauman 1996). In most prior criminology/delinquency research 
designs, adolescents simply describe their perceptions of their friends’ 
behavior. This is then construed as a measure of peer influence. Network 
analyses offer an alternative and more methodologically rigorous ap- 
proach, where the beginning point is asking respondents both to describe 
their own behavior and to identify their friends. The second step involves 
locating and interviewing the friends, with the friends describing their 
own behavior and then identifying their friends, and so on. In a best case 
scenario (which the Add Health comes very close to), all adolescents and 
friends in the population of adolescents provide this information. This 
allows the links among friends to be established for the purpose of con- 
structing analytical friendship networks with identifiable structural prop- 
erties. The characteristics of the adolescent’s friendship network, includ- 


1 A social network is defined as a set of nodes (e.g., persons) linked by a set of social 
relationships (e.g., friendships) of a specified type (Laumann, Marsden, and Prensky 
1989). In network analysis, the pattern of linkages among a group of nodes is of primary 
interest (Friedman et al. 1999). An egocentric network, which is the focus of this study, 
centers on the individual and represents direct friendship links to other adolescents 
within the social environment (e.g., school). 
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ing peer delinquency, can then be treated as individual attributes of that 
respondent. 

Because network methods are premised on the idea that patterns of 
friendship ties structure the flow of information, social norms, and social 
support, incorporating the friendship context is important for the study 
of delinquency since these ties potentially provide linkages for the trans- 
mission of delinquent behavior (Ennett, Bailey, and Federman 1999). 
These are the links through which differential associations are believed 
to occur. Despite advances in network methodologies that have pointed 
to the importance of social relations for understanding behavior (e.g., Bott 
1957; Coleman 1961; Granovetter 1973; Kapferer 1969), most criminol- 
ogists have not yet examined the different ways social networks influence 
adolescents’ participation or nonparticipation in delinquency. Theories of 
adolescent delinquency have focused on the existence of peer networks 
(differential association theory) or on adolescents’ feelings toward friends, 
such as attachment to friends (social control theory), without considering 
characteristics of the networks themselves (Krohn 1986). 

Krohn and his colleagues (1986, 1988, 1993) represent some of the only 
criminologists who have recognized the benefits of applying a network 
perspective to the study of delinquency. To evaluate the degree to which 
delinquent behavior is within the purview of an individual’s network, 
Krohn directed attention to two network characteristics he believed were 
especially important: muliplexity (defined as the number of activities in 
which individuals interact jointly) and density (defined as the extent to 
which all members in the network are connected by direct ties). Despite 
relying on survey data (where respondents were asked to describe network 
characteristics), Krohn’s study is one of the first to suggest the importance 
of structural properties of networks for evaluating the probability of de- 
linquency. That is, structural characteristics of friendship networks can 
help elucidate when network members have, more or less, control in 
enforcing common norms, expectations, and obligations for behavior. Un- 
fortunately, little research in criminology has followed up on Krohn’s lead. 

This study expands on prior research in two important ways. First, 
peer networks are defined more rigorously as adolescents’ egocentric 
friendship networks and are measured with network data on friendship 
nominations linking adolescents within schools. An adolescent’s friend- 
ship network consists of all adolescents who the respondent directly nom- 
inates as friends, as well as those adolescents who directly nominate the 
respondent as a friend. Second, the behavior and structural characteristics 
of the personal friendship network are measured directly through infor- 
mation provided by all members of the friendship network, rather than 
by the perceptions of the respondent. Information on how delinquent the 
network is, as well as the structural properties of the friendship network, 
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are also based on friendship nomination data. If network structure mat- 
ters, the structural properties of adolescents’ friendship networks will 
condition the association between networks’ delinquency and respon- 
dents’ reported delinquency. 


Network Characteristics and Expectations 


By drawing linkages between social control and differential association 
theory, a network perspective suggests how structural properties of friend- 
ship networks may moderate the delinquency-peer association. These ex- 
pectations are detailed below. 

Density —When network density is high and all members of the net- 
work are likely to know and interact with one another (Bott 1957), it is 
expected that the behavior evidenced in a friendship network will be 
more strongly associated with an adolescent’s behavior than will be the 
case in less cohesive friendship networks. More cohesive networks contain 
higher levels of interaction and communication within them so that the 
opportunities for network members to express their views of appropriate 
behavior are maximized (Giordano et al. 1986). Additionally, when all 
network members know one another, the likelihood of knowing others 
outside the network is reduced compared to the opportunity to know 
outsiders in less dense friendship networks (Granovetter 1973). This sug- 
gests that outside pressures are less influential for adolescents located in 
dense friendship networks. 

James Coleman (1990) discusses the advantages of social closure of this 
sort in an individual’s network for the facilitation of a clear social identity. 
Conceptualizing networks in terms of social capital suggests that “a co- 
hesive network conveys a clear normative order within which the indi- 
vidual can optimize performance, whereas a diverse, disconnected net- 
work exposes the individual to conflicting preferences and allegiances 
within which it is much harder to optimize” (Podolny and Baron 1997, 
p. 676). Thus, very dense social networks facilitate common identities and 
constrain the behavior of their members to be consistent with the net- 
work’s behavior, whether that is toward delinquency or not. 

Density is defined in this study as the number of ties present in the 
egocentric friendship network divided by the number of possible ties. A 
network’s density would be completely maximized if every member had 
ties to every other member and at a minimum if others in the network 
were only connected to the respondent. 

Centrality —Similarly, it is expected that an individual’s position within 
the friendship network affects how influential the network’s behavior is 
on the adolescent’s behavior. Adolescents located in more central positions 
are expected to report behavior more closely associated with their friends’ 
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behavior than individuals located in more peripheral positions (Giordano 
1983)."’ A central adolescent is defined as one who has ties to all or most 
of the network members in the friendship network so that information 
passed through the network will have to be passed through this central 
adolescent. This definition of centrality gauges whether adolescents are 
located in prominent positions within their friendship networks according 
to whether they are involved with many others in the network. It does 
not matter whether involvement is due to being the recipient of many 
friendship ties or the source of the ties—what is important is that a central 
adolescent is involved in many relationships with friends. This is an ex- 
posure argument—the central adolescents are exposed to greater com- 
munication and interaction within their friendship network than adoles- 
cents located in peripheral positions within the network. 

Centrality is a measure of the number of links required to link all other 
adolescents in a respondent’s friendship network; the lower the number 
of links necessary, the more central the respondent. The measure used in 
this study also accounts for the centrality of the adolescents to whom 
respondents are linked by weighting respondents’ centrality scores by the 
centrality of those to whom respondents send ties. Therefore, this measure 
of influence not only depends on the adolescent’s centrality within the 
network, but also on the positioning of those to whom he or she is 
connected. 

Popularity —Individuals who receive many friendship nominations 
(Le., popularity) are expected to be more constrained by the behavior of 
the network than individuals who receive fewer friendship nominations. 
For example, individuals who are more popular may have behavior more 
similar to their friendship network than individuals with less status in 
the peer hierarchy. This expectation arises because the most popular stu- 
dents in the schoo] have the most to lose (the furthest to fall in the status 
hierarchy) by not adopting the network’s behavior. Supporting this idea, 
ethnographic research by Eder (1985) finds that the most popular ado- 
lescent girls in a junior high school perceived more limited friendship 
choices and experienced great stress associated with maintaining their 
dominant position within the student hierarchy. The stress resulting from 
maintaining positions within school hierarchies was partly attributable to 
the girls’ considerable effort directed at ensuring that their behavior re- 
mained consistent with that expected from their elite position. Limited 


1 Conversely, it could be argued that more central actors (and more popular actors) 
have greater influence on the group’s behavior. While unable to determine the direction 
of the relationship with cross-sectional data used in this study, either position implies 
that the association between friends’ behavior and a respondent’s behavior will be 
stronger when the respondent is located in a more central position within the friendship 
network 
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friendship choices in a stratified setting suggest that popular students may 
have little option but to go along with group processes if they wish to 
maintain their high status. Popularity is defined in this study as the num- 
ber of friendship nominations an adolescent receives from other adoles- 
cents located in the school. 

The guiding premise of these expectations is that adolescent delin- 
quency can better be understood by incorporating characteristics of social 
networks. A network perspective will enable integration of findings from 
social control and differential association theory by premising that ado- 
lescents engage in delinquency not only because their friends are partic- 
ipating, but because their location within the friendship network and other 
structural characteristics of the network determine whether the network 
has more or less social control over member’s behavior. That is, integra- 
tion within a friendship network will be associated with delinquency 
depending on the behavior of the network (pro- or antidelinquent), and 
the structural properties of the friendship network will maximize or min- 
imize this association. 


DATA 


While fundamental, the effect of social networks within a school context 
on adolescents’ delinquency has rarely been studied. In part, this is be- 
cause the requisite data have not been available. Understanding social 
networks’ influence on adolescent delinquency requires detailed popu- 
lation-level data on the structure of friendship patterns within a school, 
for many different schools. Until recently, the only data that approached 
these stringent requirements was Coleman’s landmark study (1961) of 
social relationships among high school students in the 1960s. Although 
these data provided insight into the importance of peer relationships and 
adolescent culture, they were only available for a small number of schools, 
so that generalizability to other schools and adolescents as a whole was 
prevented. 

This study utilizes data from the first wave of the Add Health survey, 
a newly available data set consisting of a nationally representative sample 
of adolescents in grades 7-12 nested within randomly selected schools in 
the United States during 1994-95. The innovative design of this sample, 
particularly its emphasis on the effects of multiple contexts of adolescents” 
lives, allows for an examination of the correlates of adolescent health and 
health behavior (including delinquency) that goes considerably beyond 
prior research. 
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In-School Interviews 


In contrast to other nationally representative surveys of adolescent delin- 
quency that use sample designs to randomly select adolescents from the 
schools, the Add Health study contains very detailed social network data 
for students in 129 randomly selected schools stratified by region, urban- 
icity, school type, ethnic mix, and size.” Within each school, brief inter- 
views were conducted with every student attending the school on that 
day (N = 90,000). During this in-school phase, students were asked to 
identify their best male and best female friends from a school roster (up 
to 5 friends of each sex),” as well as provide some brief demographic 
information and describe their participation in a few minor delinquent 
activities. Because friendship nominations were recorded by student iden- 
tification number located from school rosters, it is possible to link together 
most of the students in the schools, thereby recreating the social networks. 
From this complete network information, network characteristics and the 
mean delinquency rate of the respondent’s friendship network are 
calculated. 

Because this is a study of friendship networks within schools, it is 
important to acknowledge that the network data do not include detailed 
information on ties to friends who do not attend the school. Friends who 
do not attend school may be particularly likely to be delinquent (e.g., 
drop-outs), and delinquent adolescents attending school may be more 
likely to nominate these out-of-school friends. However, adolescents were 
able to nominate friends outside of the school, allowing a calculation of 
the number of out-of-school nominations per adolescent to be determined 
(this is the only information available on network structure outside of the 
school). This information reveals that the average number of outside nom- 
inations for adolescents is generally small, averaging 1.4 ties per adoles- 
cent. More important, out-of-school nominations do not significantly differ 
between delinquent and nondelinquent adolescents (1.4 vs. 1.3). Therefore, 
the assumption that delinquent and nondelinquent adolescents have 
friendship networks located within schools (rather than outside of school) 
appears reasonable. 


In-Home Interviews 


This project also draws on a second component of the Add Health study, 
which contains extensive in-home interviews with a randomly selected 


* For information about the design of the Add Health study, see the “Research Design” 
section of the web site at http/Avww.cpc.unc.edu/projects/addhealth/design html. 

4 Although the maximum number of nominations allowed was 10, very few students 
were affected by this restriction. In fact, the mean number of nominations an adolescent 
identified (from their send network) was 4.15 (SD = 3.02). 
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sample of adolescents. This nationally representative sample of adoles- 
cents in grades 7-12 was drawn from school rosters provided by each 
school, with students stratified by grade and sex. Approximately 200 ad- 
olescents were selected from each of the 80 pairs of schools (the high 
school and its feeder junior high school), yielding a sample of approxi- 
mately 20,000 adolescents.“ 

These in-depth interviews involved the collection of more sensitive data 
such as an extensive series of questions that concern involvement in both 
property and violent delinquency. An additional advantage of this data 
involves the use of laptop computers to maintain confidentiality about 
sensitive subjects such as delinquency. This method of data collection 
allowed respondents to maintain their anonymity by listening to pre- 
recorded questions about participation in different delinquent activities 
and then entering their responses directly into the computer. This self- 
report information on participation in a series of delinquent activities is 
used to construct the dependent variable—delinquency involve- 
ment—which is described in the following section. Because the dependent 
variable can only be constructed from the in-home survey (vs. the in- 
school survey), the in-home survey composes the main sample for this 
project. Network information from the in-school survey is appended to 
this sample. Drawing on these data components, the final sample for this 
study consists of approximately 13,000 adolescents nested within 120 
schools who had completed both in-school and in-home interviews. 


Dependent Variable 


To assess an adolescent’s involvement in delinquent behavior, this study 
employs a commonly used measure of delinquency involvement. This 
additive index of delinquency involvement is based on the self-reported 
responses from adolescents describing participation in a series of 14 dif- 
ferent delinquent activities during the past year.” The 14 delinquency 
items incorporated into the index are listed in table 1 and include painted 
graffiti, damaged property, shoplifted, stole something worth less than 
$50, stole something worth $50 or more, burglarized, borrowed a car 
without the owner’s permission, sold drugs, involved in a serious physical 


4 Of the 160 schools initially selected, 134 schools agreed to participate, yielding a 
response rate of 79%. Of the 134 schools, 129 (96%) arranged for an in-school survey 
to be completed between September 1994 and Sepember 1995. 

4 Short and Nye (1958) introduce the self-report method of measuring delinquency. 
Subsequent work has established that adolescents do report their delinquent behavior, 
these reports tend to be internally consistent, and the reports relate to differences in 
official delinquency status and to other differences predicted by research and theory 
(Hindelang, Hirschi, and Weis 1980). 
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TABLE 1 
FREQUENCY DISTRIBUTIONS OF DELINQUENCY ITEMS 











Dependent Vartable N % 
Delinquency index:* 

Painted graffiti: 

NOV ais nato ui 12,332 91.2 

One or more times ........... ... ... 1,186 8.8 
Damaged other's property: 

Never... . eee 11,135 82.4 

One or more times ............ 2 00... 2,383 17.6 
Shoplifted from a store 

Nevet iio tits 10,247 758 

One or more times ......... o... 3,271 24.2 
Stolen something worth less than $50: 

Never > ws 10,813 80.0 

Öne ok capes times: = ; 2,705 200 
Stolen sométhiñg worth more than $50. 

Nové oitu e a ea 12,885 95.3 

One or more times ..... 633 4.7 
ee teins dass) e ide 

Never .... eyed. a sank 12,870 95.2 

One or more times . arauen y 648 48 
Borrowed a car (w/o owner's permis- 

sion) 

Never E ls A S E 12,204 90.3 

One or more times .... . .......... 1,314 9.7 
Sold drugs: 

Ni ais 12,592 93.1 

One or more times ...... .. ......... 926 69 
Got into serious physical fight: 

Never cnica a & a 8 9,457 70.0 

One or more times . 4,061 30.0 
Seriously mjured someone: 

Never oo. ssaecr esis @ Sbeheats, ie sase 11,186 82.7 

One or more times ............... 2,332 173 
Taken part in group fight 

Neyer necia isa e ida 10,980 812 

One or more times . ....... ....... 2,538 18.8 
Used or threatened to use weapon: 

Never oia 13,001 96.2 

One or more times ...... oe... eae 517 3.8 
Pulled a knife/gun on someone 

NG VEE foes cn 3 12,961 95.9 

One or more times . ........ 2.2.0... 557 41 
Shot/stabbed someone: 

Never 13,303 98.4 
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Smoked cigarettes: 

Never unica as 8,275 64.4 

One or more times ... . . ....... ... 4,574 35.6 
Drank alcobol: 

INGVEP sas evgesded nai a pers E ess 5,787 45.1 

One or more times .. ..... o. ..... 7,036 54.9 
Got drunk: 

Never esenee Set, o SE 8,869 69.5 

One or more times .. ....... o ......- 3,893 305 
Raced on bike, boat, or car A 

Never o i orren E Scaestie meras dee 5,496 44.8 

One or more times . ........0 eee 6,770 55.2 
Been in danger due to dare: 

A leiald, oa, e 7,777 60.8 

One or more times .. .............. 5,011 39.2 
Skipped school without an excuse: 

Never ucnauri ina da 8,981 69.9 

One or more times .......... oo... -- 3,868 30.1 


Norr —Data for all items include only the pest 12 months. 

* Summed index based on tems 1-14, below (Cronbach'ax = 83; mean = 172; 
SD a 2.31) 

* Items from the in-school interviews used to calculate mean delinquency of fnend- 
ship send/recerve-network (Cronbech’sa = 82; mean = 533, SD = 337). 


fight, seriously injured another, used or threatened to use a weapon, par- 
ticipated in a group fight, pulled a knife/gun on someone, or shot/stabbed 
someone. 

The particular questions ask students to report how often in the past 
12 months they have participated in these activities. Each response is 
coded “0” if the respondent reported not participating in the activity dur- 
ing the past year or coded “1” if they reported participation.“ Following 
common procedures used to measure self-report delinquency, a summated 
delinquency index is created based on responses to the 14 delinquency 
items. The Cronbach alpha of 0.83 indicates that these items have con- 


14 All analyses were additionally carried out with responses to each delinquency item 
coded on the original ordinal scale (0 = never participate to 3 = 5 or more times) 
and with separate property and violence delinquency indices. Findings from these 
additional analyses closely dovetailed those presented here for the overall delinquency 
index. Peer behavior was found to be just as robustly associated with property delin- 
quency as it was with violent delinquency. Network characteristics also operated quite 
similarly to results presented bere. The only differences to emerge were in regard to 
a few of the effects of control variables. Results are available upon request from the 
author. 
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siderable internal consistency.” Although the average delinquency levels 
for respondents is 1.7, with considerable variation around this mean, many 
respondents (44%) report never participating in any delinquent activities 


Peer Delinquency 


Most of the prior research on the delinquency-peer association assumes 
that respondents’ perceptions of their friends’ attitudes and behaviors 
accurately reflect the reality of these attitudes and behaviors without 
allowing for the powerful influence of assumed similarity (Jussim and 
Osgood 1989). Problems with this common approach to measuring the 
influence of peer behaviors have been highlighted in the developmental 
literature and, to a lesser extent, in the criminological literature. Generally, 
this literature notes that respondents’ perceptions of friends’ behavior are 
not very accurate assessments of the friends’ actual behavior (Bauman 
and Fisher 1986; Billy, Rodgers, and Udry 1984). These findings give 
particular credence to Gottfredson and Hirschi’s (1990, p. 157) argument 
that measures of peer delinquency are simply “another measure of self- 
reported delinquency.” That is, when asked to report their peers’ delin- 
quent behavior, adolescents show a proclivity to report their own delin- 
quent behavior. If this is the case, it casts considerable doubt on the 
frequently observed relationship between peers’ delinquency and adoles- 
cents’ own reports of their delinquency. 

To avoid problems of projection (i.e., a respondent projecting his or her 
behavior to friends), this study relies on the actual responses of all iden- 
tified friends to a series of questions about delinquency. During the in- 
school interview, every student was asked about his or her involvement 
in a series of six minor delinquent acts. Using the friendship nomination 
data, which links up all of the students in the school and this information 
on friends’ delinquency involvement, a measure of peer network delin- 
quency is created.'* It is calculated as the average response of all identified 
friends to the minor delinquency items, which include smoked cigarettes, 
drank alcohol, got drunk, skipped school without an excuse, did dangerous 
things on a dare, and raced vehicles such as cars or motorcycles (see table 
1 for a description of the six items). Table 1 indicates that the mean minor 


The items running away from home, lying to parents, and disorderly conduct had 
very low intercorrelations with the index and therefore were not included in the index. 
“ Thanks to a reviewer for pointing out that what I have measured in this study are 
really characteristics of the adolescent’s egocentric network, which is not quite the 
same as a “peer group” in the sense that there is not identification of particular cliques 
or subgroups. Instead, the measure used in this study benefits from the fact that 
adolescents tend to belong to multiple cliques in school, some of which overlap, while 
others may not. 
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delinquency of a respondent’s peer network is 5.3, signifying that members 
of the typical respondent’s friendship network committed five minor de- 
linquency activities once or twice in the past 12 months.” Although these 
are admittedly minor deviant activities, which may provide a conservative 
estimate of the association between peer behavior and an adolescent’s 
more serious delinquency involvement, Osgood and colleagues (1988) pre- 
sent evidence for the generality of deviance, which provides some support 
for the use of this more conservative measure. 

In addition to constructing the mean delinquency rate of the friendship 
network, the connections among adolescents are used to describe the dif- 
ferent characteristics of their friendship network: density, centrality, and 
popularity. Table 2 provides definitions and calculation of the ego-centered 
network attributes, and table 3 displays the descriptive attributes of the 
sample. This latter table indicates that the average adolescent has a net- 
work density of 0.42 (range 0.20-0.83), revealing that less than half of an 
adolescent’s egocentric network has close ties to each other. This finding 
is consistent with results from network studies, which find that most 
people are not in tightly knit, tightly bounded networks, but instead have 
sparsely knit, loosely bounded personal networks (Wellman 2000). In 
terms of positioning within the friendship network, the typical adolescent 


1 Unfortunately, information on friends’ delinquency is only available from in-school 
interviews, which do not include the friends’ involvement in the extensive list of 
delinquency items used to create the dependent variable (which were collected during 
the in-home portion of the interview). Therefore, a measure of friends’ minor delin- 
quency involvement is the only information available for all adolescents composing 
the respondent's friendship network in the schools included in the complete sample. 
In the Add Health data, in-home interviews were conducted with every adolescent 
attending school in 14 small schools, chosen because of their size, and in two large 
schools, which were purposefully selected. (This is referred to as a saturation sample). 
This very limited sample, therefore, contains information on friends’ involvement in 
the more extensive list of 14 delinquency items, as well as the minor delinquency items. 
To gain an initial benchmark of how big a discrepancy there is between the association 
of friends’ mimor delinquency involvement and a respondent’s more serious delin- 
quency involvement—compared to the association between friends’ more serious de- 
linquency and a respondent’s serious delinquency index—some preliminary regression 
analyses were conducted on the saturation sample. The correlation between friends’ 
minor and friends’ serious delinquency involvement in the saturation sample (r = 


(b = .06 using friends’ minor delinquency compared to b = 0.11 using friends’ serious 
delinquency). Nonetheless, despite a weaker association, friends’ minor delinquency 
involvement remained significantly associated with a respondent’s more serious delin- 
quency involvement and supports the use of this measure subsequent analyses, $o 
results can be generalized from a more representative sample. 
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Variable Mean (%) SD Min Max N 

Dependent variables: ...... ... 

Delinquency index . ....... 1.72 2.31 00 14.00 13,518 
Egocentric network charac- 

teristics: 

Mean friendship detin- 

quency rate ............. 534 3.37 00 30.00 13,518 

Popularity . ...... ..... 4.55 3.61 00 32 00 13,518 

Centrality (Bonacich) ..... 82 64 00 4.29 13,518 

Density (relative) ... ...... 42 10 20 83 13,518 
Background individual char- 

acteristics: 

Male (ref) . .. .....o.ooo.o .... 48 00 1.00 13,518 

Female .. .. 52 00 1.00 13,518 

White (ref) ...... 51 20 1.00 13,518 

Black ioeie 23 .00 100 13,518 

Other race ............... 26 Saia .00 1.00 13,518 

NE 15.61 170 11 00 19,00 13,516 

Age aii 246 47 52.75 121.00 361.00 13,516 

Two-parent family ......... 70 : 00 1.00 13,518 

Public assistance receipt 07 00 1.00 13,518 

Parent attachment index 4.56 77 00 5 00 13,513 

School attachment index . 3.78 86 100 5.00 13,365 

Friend attachment .......... 4.25 80 1.00 5.00 13,507 

Friend involvement ... ... 1.98 99 .00 3.00 13,518 


Norr,—Data for students m schools with more than 50 respondents completing both in-school and 
in-home mterviews 


has a centrality level of 0.82 (range 04.3) and 4.6 other adolescents in 
the school nominating him or her as a friend (popularity range: 0-32). 
Control variables associated with delinquency in prior criminological 
research are included in this study, and measurement of these variables 
are also described in table 2. These control variables include gender, race, 
age, an indicator of family structure, receipt of public assistance, a parental 
attachment index, a school attachment index, a friend attachment index, 
and a measure of friend involvement. These control variables were se- 
lected because they either control for important demographic differences 
in delinquency propensity (e.g., age, gender, race, social class, family struc- 
ture) or because they are implicated as key variables in social control and 
differential association theory (e.g., parental attachment, school attach- 
ment, peer attachment, friend involvement). Table 3 provides the de- 
scriptive characteristics of these variables and reveals that the average 
adolescent is white, lives in a two-parent family, has relatively high at- 
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tachment to parents and friends, and spends a considerable amount of 
time “hanging out” with friends. 


ANALYSIS 


Assessing whether peer network characteristics are associated with delin- 
quency and condition the delinquency-peer relationship necessitates mul- 
tivariate analyses with emphasis on interaction terms. However, deter- 
mining the proper modeling procedure for the analyses requires examining 
the distribution of the dependent variable: delinquency involvement. 
While the mean value of the overall delinquency index is 1.7 (see table 
1), the distribution of the variable is far from normal. Inspection of the 
frequency distribution for the delinquency index finds that the most com- 
mon value is zero, indicating that many adolescents report no delinquency 
involvement. Additionally, a minority of respondents report involvement 
in a large number of delinquent activities (5% reports participating in six 
or more acts, and 1% reports participating in 10 or more). Due to a large 
number of zeros and a large positive skew in the distribution, the nor- 
mality assumption of OLS cannot be approximated with a mathematical 
transformation. Therefore, negative binomial regressions, designed to han- 
dle dependent variables with distributions incorporating many zero values 
and large positive skews, are used in the results that follow.” 

Because standard negative binomial models assume that regression co- 
efficients are fixed between groups and that error terms are not correlated, 
these models are inadequate for complex sampling designs where indi- 
viduals are nested within a larger macrounit (here schools; see Goldstein 
1987; Lee and Bryk 1989; Raudenbush and Bryk 1986). Due to the clus- 
tering of the data and the correlated error structure, statistical techniques, 
which can correct for design effects and unequal probability of selection, 
are necessary to achieve unbiased parameter estimates.” In addition, since 


æ The negative binomial model differs from Poisson regression by the addition of a 
residual variance parameter that captures overdispersion in the dependent variable 
(which occurs when the standard deviation is greater than the mean [Gardner, Mulvey, 
and Shaw 1995). Designed for dependent variables that are counts of events, negative 
binomial models utilize a distribution that characterizes the probability of observing 
any discrete number of events, given an underlying mean count of events (Osgood 
1999). In the negative binomial regression model, #, = exp(s,), which indicates that a 
one-unit increase in X, multiplies the expected delinquency index by a factor of 
exp(B,), and conversely, a one-unit decrease divides the expected index by the same 
amount (Gardner et al. 1995). 

11 See Kim Chantala and Joyce Tabor’s 1999 article, “Strategies to Perform a Design- 
Based Analysis Using the Add Health Data,” on the Add Health web site at 
http/Ayww.cpc.unc.edu/projects/addhealth/strategies_ Data html. 
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dispersion or unexplained variation in individual outcomes is likely to 
vary across schools because of unidentified school-specific reasons (e.g., 
school delinquency rate, student-teacher ratio, school size, region of the 
country) a random-effects overdispersion model is used in all analyses. 
This type of random effects model allows dispersion to vary randomly 
from school to school. The software package Stata is used for all analyses, 
which allows incorporation of the survey design characteristics into the 
computational formulas.” 


FINDINGS 
Delinquency-Peer Association 


Before addressing the main question about the moderation of delinquency- 
peer associations by network characteristics, it is useful to examine 
whether network characteristics are associated with delinquency and im- 
prove model fit after accounting for control variables. The multivariate 
analyses in table 4 present models that examine the association between 
delinquency involvement and background individual-level characteristics. 
Model 1 simply provides a baseline against which to assess model fit. 
Consistent with prior research, variables that tap dimensions of social 
control are associated with a reduction in delinquency involvement (the 
exception is attachment to friends), whereas involvement with friends (..e., 
the amount of time spent with friends) is associated with increased de- 
linquency involvement. Model 2, which is of more interest, assesses 
whether a measure of peer delinquency is associated with an adolescent’s 
delinquency. Recall that a common critique of prior studies’ findings of 
a strong association between delinquent peers and a respondent’s self- 
report delinquency is that the friends’ behavior is measured through the 
respondent’s potentially biased perceptions. 

Model 2 indicates that friends’ delinquency is associated with a re- 
spondent’s own delinquency; each unit increase in friends’ delinquency 
yields a 4% increase in the respondent’s mean delinquency level 
(exp (.04) = 1.04). This finding indicates that friends’ delinquency is not 
simply a measurement artifact reflecting a respondent’s own delinquency 
involvement, but rather remains an important correlate of a respondent’s 
delinquency even after controlling for a host of different individual-level 
factors. With maximum-likelihood estimates, it is possible to determine 
whether a more complex model provides a better fit to the data than do 
simpler models using likelihood ratio significance tests. Model 2 yields a 


n The Stata procedure ximbreg was used in all multivariate analyses. 
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likelihood ratio value of 215.5 in comparison to the baseline model (model 
1), which is statistically significant (df = 1, P < .001). 

To assess the strength of the association between delinquent peers and 
delinquency compared to other control variables, it is useful to standardize 
the coefficients by multiplying the bs by the Xs’ standard deviations. 
The coefficients are thus expressed in terms of change in log(Y) per stan- 
dard deviation of X, . Column 3 for each model presents these standardized 
coefficients (exponentiating the results indicates the proportional change 
in delinquency per standard deviation of X,). In terms of delinquent peers, 
each standard deviation is associated with a 13% increase in a respon- 
dent’s delinquency (exp(.13) = 1.13). The effect of delinquent peers is thus 
larger than the effect of family structure (two-parent family), public as- 
sistance receipt, and attachment to friends; is comparable to the effect for 
race, parental attachment, and school attachment; and is smaller than the 
gender, age, and peer involvement effect. This finding that delinquent 
peers is not among the strongest (or weakest) variable in the model sug- 
gests that prior findings that delinquent peers have the strongest associ- 
ation in models may have overestimated the strength of the delinquency- 
peer association by using potentially biased respondent perceptions. 

To determine the main effect of network characteristics on delinquency 
involvement (before assessing whether they condition the delinquency- 
peer association), model 3 incorporates the network measures of density, 
centrality, and popularity. Findings indicate that density and centrality 
within friendship networks are not associated with delinquency involve- 
ment when relevant control variables and peer delinquency are accounted 
for. Network popularity, in contrast, is associated with a small increase 
in delinquency involvement. Each additional friendship nomination re- 
ceived translates into a 2% increase in delinquency involvement. Peer 
delinquency remains associated with delinquency when the network char- 
acteristics are incorporated into the model. 

Although results from this model suggest that structural characteristics 
of friendship networks are unassociated with delinquency when peer de- 
linquency is accounted for, the model does not tell us whether network 
characteristics play a role in the degree to which peer behavior is trans- 
lated into the adolescent’s own behavior. Assessing this requires inter- 
action models. Finding significant interactions between peer delinquency 
and network properties will suggest that network characteristics are not 
as important in regard to their main effect on delinquency, but are very 
important in understanding how the context of friendship networks de- 
termine the degree to which peer behavior is associated with an adoles- 
cent’s own behavior. That is, network characteristics may determine the 
intensity of differential associations. 
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Interaction Models 


Interactions between peer delinquency and each network characteristic 
of interest are examined in models 4—6 of table 5. Each model focuses on 
a particular network characteristic or property of an adolescent’s friend- 
ship network. Model 4 examines whether the association between friends’ 
delinquency and a respondent’s own delinquency is conditioned by the 
cohesion within the friendship network (¡.e., network density). Results 
reveal this is the case; the interaction between peer delinquency and den- 
sity is very strong and significant (exp(.33) = 1.39), indicating that density 
in conjunction with delinquent peers is associated with a large increase 
in self-reported delinquency. In contrast, the coefficient for density, in 
model 4, suggests that when adolescents do not have delinquent peers, 
density is associated with a reduction in delinquency involvement 
(exp(—1.77) = 0.17). This provides strong evidence that network density 
is an important structural component of peer networks in its ability to 
moderate the delinquency-peer association. 

Graphing this relationship in figure 1 reveals that high density in com- 
bination with a delinquent peer network translates into greater delin- 
quency involvement, whereas low density is less related to delinquency. 
Specifically, when an adolescent is located in a friendship network re- 
porting a minor delinquency index of 5, medium and low density in the 
peer network is associated with a respondent’s serious delinquency index 
of 0.33; however, being enmeshed in a dense peer network translates into 
a serious delinquency index of 1.3. Moreover, when friends’ delinquency 
is at higher levels (equals an index of 10), a less cohesive friendship net- 
work is associated with a respondent’s mean delinquency index of 0.45 
versus 1.75 for respondents located in very dense friendship networks. 
What is important in this graph is that as density increases, the association 
between peer delinquency and an adolescent’s own delinquency involve- 
ment increases rapidly. This highlights that the relationship between 
friends’ delinquency and a respondent’s own delinquency is conditioned 
by the cohesion present in the friendship network. 

Network centrality, an indicator of positioning within a friendship net- 
work, also moderates the delinquency-peer association. Specifically, model 
5 indicates that location within a central position in a delinquent peer 
network is associated with an increased delinquency index (exp(.05) = 
1.05). Conversely, location in a central position within a nondelinquent 
friendship network is associated with a reduction in delinquency involve- 
ment (exp(—.27) = 0.76). Graphing this relationship in figure 2 illustrates 
the interaction. With friendship delinquency equal to 5, there is little effect 
of network centrality (low, average, and high centrality = 0.28). With 
increased peer delinquency (equal to 10), low centrality corresponds to a 
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Fic 1.—Interaction: density by friends’ delinquency 


delinquency index of 0.32, average centrality to a delinquency index of 
0.37, and high centrality to a serious delinquency index of 0.56. These 
results indicate that the association between friends’ delinquency and a 
respondent’s own level of delinquency is stronger when the respondent 
is located within a central position in the friendship network 

The last network characteristic, popularity, is examined in model 6. 
Here it is found that although the delinquency-peer relationship is con- 
ditional, popularity plays a smaller role in moderating the delinquency- 
peer association. When an adolescent is without delinquent friends, pop- 
ularity has a small negative relationship with delinquency status (exp 
(—.03) = 0.97). In contrast, when located in a delinquent peer network, 
popularity is associated with a small increased delinquency index 
(exp(.01) = 1.01). Specifically, figure 3 indicates that when an adolescent 
is located in a peer network that averages a minor delinquency index of 
5, high popularity is associated with a delinquency index of 0.37 compared 
to 0.34 and 0.32 for average and low popularity, respectively. With greater 
peer delinquency (index = 10), high popularity is associated with a de- 
linquency index of 0.69 versus 0.47 and 0.39 for average and low 
popularity. 

Overall, these results indicate that all of the network characteris- 
tics—density, centrality, and popularity—condition the delinquency-peer 
association. Incorporation of the underlying pattern of relationships 
among adolescents explains when friendship networks are more or less 
effective in constraining adolescents’ behavior to resemble that of their 
peers. However, the size of the coefficients and graphs of the interactions 
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FiG. 2.—Interaction: centrality by friends’ delinquency 


suggest that network density (i.e., cohesion) is a particularly important 
structural property, followed by network centrality (¡.e., positioning) and 
popularity. 

To better assess which network property is more important in explaining 
when the behavior of peers is associated with an adolescent’s own be- 
havior, it is useful to examine a model with all three network interactions 
incorporated. This allows for the effect of each network characteristic to 
be assessed independently while controlling for the others. Model 6 in 
table 6 presents findings from this analysis and indicates that while all 
of the network interactions remain significant, that is, moderate the de- 
linquency-peer association, density is the more important component of 
friendship networks. Peer delinquency in conjunction with greater co- 
hesion is associated with an increased delinquency index (exp(.16) = 
1.17), whereas centrality within the friendship network and popularity 
play much more modest roles (exp(b) = 1.03 and exp(b) = 1.01, for cen- 
trality and popularity, respectively) in moderating the delinquency-peer 
association. Graphing the relationship between peer delinquency and a 
respondent’s delinquency in figure 4 reveals that when an adolescent is 
located in a friendship network with an average minor delinquency index 
of 10, high density within the network translates into a serious delinquency 
index of 1.64 versus 0.49 for centrality and 0.61 for popularity. 
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TABLE 6 
COMPLETE INTERACTION MODEL: RANDOM EFFECTS NEGATIVE BINOMIAL MODEL 
OF DELINQUENCY INVOLVEMENT 








MODEL 6 
Coeff. B 
Intercept n.i... cesee ceveceerees eee eve — 98 (1 02) 
Control variables. 
BACK cococcicinsdas 2 Ae. doe eens coats 26 (.03) 11 
Other race (nonwhite) 25 (03) 11 
Female a 50 (02) —.24 
Age 38 (.13) 62 
BGO? a A pose - 02 (.00) —.60 
‘Two-parent family . — 09 (.02) —05 
Public assistance .......o o 0.0.0. ..... 13 (04) 04 
Friend involvement . 17 (01) 17 
Friend attachment ..... ...o.mmococoo.. —.03 (01) —.02 
Parent attachment index ............. -.13 (01) -.11 
School attachment index .......... ... —.21 (.01) — 18 
Friends’ delinquency ............ -- 00° (01) 00 
Network characteristics: 
Density ......ooomorcro o cues ooooocos —1.20 (.36) —.11 
Centrality sorre er EEE 3 —.15 (.04) —.10 
Popularity i. 2.2 ccessecenecscaceasees 00” (01) 00 
Interactions: 
Density x friends’ delinquency ... . .16 (05) .12 
Centrality x friends’ delinquency .. 03 (.00) 09 
Popularity x friends’ delinquency ... 01 (.00) 08 


A A ee 
Nors.—N = 12, 971 adolescents clustered within 120 schools. AI variables are significant at P < 
OS unless noted. SEs are given m parentheses Log Hkehhood = 22,062 06; 2 log hkelihood (compared 
to model 2) = 1482 (P < 001) 
* Not significant 


CONCLUSION 


In this project, the delinquency-peer association was explored guided by 
the premise that a more complete understanding of peer influence requires 
an examination of the characteristics of friendship networks in which 
adolescents are embedded. Although few adolescents report participation 
in many serious delinquent activities, the majority of adolescents report 
participation in at least one serious delinquent activity, with a minority 
reporting high levels of serious delinquent involvement. Moreover, most 
adolescents are located in friendship networks that report some minor 
delinquency involvement. By incorporating a social network approach, 
this research contributes to research and theory on the conditions under 
which differential associations are maximized. Overall, the findings pre- 
sent a picture more complex than that provided by social control theory 
and differential association theory alone and suggest that a network per- 
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spective can provide a coherent and powerful framework for addressing 
adolescent delinquency. 

Two major findings emerge from this research. First, friends’ delin- 
quency, as measured by responses from friends who compose the adoles- 
cent’s friendship network, is associated with an adolescent’s own delin- 
quency involvement. This is an important finding because the 
delinquency-peer association found in prior research has been criticized 
for inadequate and potentially misspecified measures of peers’ behaviors. 
Since most prior research incorporates a measure of peer delinquency 
based on responses from the respondents rather than the peers themselves, 
Jussim and Osgood (1989) caution that peer delinquency may simply 
measure the respondent’s own delinquent behavior due to processes of 
assumed similarity. In contrast, measures used in the present research 
incorporate the responses of friends themselves and indicate that the de- 
linquency-peer association exists and remains robust regardless of controls 
for numerous other factors. The significant association consistently evi- 
denced in this study gives credence to the idea that delinquency is best 
understood in the context of the friendship network where common norms 
and behaviors emerge from locations in structured patterns of relation- 
ships (Wellman 1988). 

Second, network properties summarizing the structure of friendship 
networks moderate the delinquency-peer association. Specifically, delin- 
quent friends have a lesser association with delinquency (and occasionally 
no association) when adolescents are located in a peripheral position 
within their peer network (low centrality), when their peer network is not 
very cohesive (low density), and when they have less prestige (low pop- 
ularity). Conversely, peers’ delinquency has a stronger association with 
an adolescent’s delinquency when the adolescents are located in a central 
position within their friendship network, when their friendship network 
is very dense, and when they are nominated as friends by many others. 

While all network characteristics were found to condition the delin- 
quency-peer association to some degree, one network characteristic ex- 
erted stronger influences than others. In particular, network density 
emerged as an important moderator of peer delinquency. Because network 
density represents the number of ties present in the friendship network 
divided by the number of possible ties in the network, it serves as an 
ideal measure of peer cohesion. Greater network cohesion better facilitates 
a common identity as either delinquent or nondelinquent and subse- 
quently places more constraint on the behavior of peer members to be 
consistent with the network’s behavioral disposition. The premise that 
densely knit networks have stronger norms and better communication 
and control has an extensive history in sociological studies (e.g., Durkheim 
1951; Sutherland 1947; Bott 1957; Fischer 1982), which makes it surprising 
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that criminological studies have yet to incorporate network cohesion as 
a variable central to the delinquency-peer association. 

Popularity within the friendship network is a network property that 
plays a smaller role in conditioning the delinquency-peer association. Re- 
call that popularity is indicative of the number of other adolescents in 
the school network that nominate the adolescent as a best friend. One 
explanation for the weaker interaction between popularity and peer de- 
linquency is that prestige may be indicative of the permeability of peer 
group boundaries. That is, the extent to which- members have contact 
with individuals outside their immediate peer clique. An adolescent who 
is located in a delinquent friendship network but also has high popularity 
may be well connected to other adolescents outside his or her immediate 
peer clique, resulting in weakened influence of the friendship network. 
In these situations, the intensity of the peer network may be reduced. In 
all likelihood, the more popular adolescents belong to several different 
peer cliques such that their egocentric network is large and heterogenous 
in terms of exposure to different behaviors, norms, and values in the 
school setting. 

These results highlight the constraining influence of peer networks on 
individual behavior when certain structural components of friendship 
networks are maximized. With a friendship network centered around 
delinquent activities, adolescents are even more likely to report self- 
involvement in delinquency when they are located in a very cohesive peer 
network. Conversely, when peer networks do not incorporate peer delin- 
quency, cohesion in the network is associated with lower risk of self- 
involvement in delinquency. This finding, if properly interpreted, is con- 
sistent in part with social control’s emphasis on the constraining influence 
of social bonds, although it appears even more compatible with differential 
association and social learning theories’ emphasis on the importance of 
the context of friendship networks. It is in this network context where 
social norms and values regarding delinquency are shared and validated. 
According to differential association and learning theories and consistent 
with this research, when delinquent peer networks are very cohesive, 
network members are at heightened exposure to definitions and behavioral 
patterns favorable to delinquency involvement. 

However, an alternative explanation is offered for the influence of net- 
work structure on adolescent delinquency. Critics of differential associ- 
ation and social learning theories argue that the observed delinquency- 
peer association is not due to peer influence, but rather to self-selection 
into delinquent peer networks based on prior behavioral dispositions. The 
cross-sectional data on which this study is based do not allow explicit 
determination of whether the observed associations solely reflect network 
influences on adolescent behavior or whether they additionally reflect the 
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tendency of youth with similar behaviors to select each other as friends. 
However, studies with longitudinal data have found that both influence 
and selection processes are responsible for similarities in adolescent be- 
havior (Bauman and Ennet 1996; Elliott and Menard 1996; Kandel 1978; 
Krohn et al. 1996; Matsueda and Anderson 1998; Thornberry 1987). Al- 
though the precise mechanism underlying the delinquency-peer associa- 
tions cannot be specified with the available data, the findings do suggest 
that personal networks containing delinquent members provide, at a min- 
imum, a supportive environment conducive to delinquency involvement.” 

With this in mind, it is important to point out that if selection were 
mainly responsible for the association found between delinquent peers 
and self-involvement in delinquency, we would not expect to find the 
network characteristics conditioning the delinquency-peer association. For 
example, being located in a very dense peer network or in a very central 
position within the network should not influence the strength of the 
delinquency-peer association if adolescents are selecting peer networks to 
join that most closely match up with their own delinquent proclivities. 
Therefore, the strength and consistency of the pattern of network prop- 
erties found to condition the delinquency-peer association suggest expla- 
nations more consistent with a differential association or social learning 
approach to peer influence. However, to explicitly test whether self- 
selection or socialization is the primary mechanism responsible for the 
association, longitudinal data incorporating the social networks of ado- 
lescents is necessary. In addition to longitudinal data on adolescent peer 
networks, a theory of delinquency is needed that incorporates an under- 
standing of friendship formation. Such a theory would explain processes 
underlying selection into and out of peer networks. 

The findings presented in this project should be considered in light of 
the following limitations. As noted earlier, the data that this project is 
based on are cross-sectional, which limits the causal inferences we can 
make about the relationships between network characteristics and delin- 
quent behavior. Longitudinal data that allow measurement of changes in 
network characteristics and delinquency behavior over time would be 
extremely valuable. Such a data design would allow determination of 
whether delinquent behavior precedes selection into friendship networks 
or whether incorporation into peer networks occurs prior to delinquency 
involvement. Moreover, another reason for the use of longitudinal data 


3 Even if initial selection is viewed as the critical process, it remains unclear whether 
the initial selection is due to the attraction for peers similar to oneself or to structural 
forces that reduce freedom of movement in the social network, and thus predetermine 
the network to which one 1s exposed (Cairns and Cairns 1994). School tracking is a 
good example of a structural force that determines, in part, the selection of peers 
similar to oneself 
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for investigating peer networks, as well as the difficulty with such anal- 
yses, involves the constantly changing nature of social networks. The 
dynamic nature of network structures suggests that it is rare for friendship 
networks to remain unchanged even in a short period of time. Capturing 
and modeling these changing network structures remains a challenge for 
longitudinal analyses. 

The ideal longitudinal network analysis of friendship networks also 
would have included complete network data on a wide range of delin- 
quency items including more serious delinquency items. As indicated ear- 
lier, a drawback of the present study is that friends’ involvement in minor 
delinquency is used to predict the respondent’s more serious delinquency 
involvement. Unfortunately, this strategy tends to underestimate the 
strength of the actual relationship between friends’ serious delinquency 
and the respondent’s own involvement in serious delinquency. 

Additionally, while this study drew attention to common properties of 
peer networks, other network characteristics may be important and wor- 
thy of future investigation. For example, age, sex, and race heterogeneity 
of the friendship network and the percent of reciprocated friendship ties 
in the network are a few additional properties of peer networks that may 
condition the delinquency-peer association. 

Despite these limitations, the present study’s results show that the ap- 
proach of identifying and examining peer social networks provides a co- 
herent and promising framework for investigating a variety of ways that 
social relationship might be associated with adolescent delinquency. This 
framework’s emphasis on the social connections among adolescents goes 
considerably beyond that of prior research, which viewed individuals as 
essentially separate from their social structure. 


SUMMARY 


Overall, this set of findings suggests that the patterning of adolescent 
relationships helps explain when peer delinquency is most and least im- 
portant. Not all adolescents are influenced to the same degree by their 
peer associations, and when the patterning of relationships between ad- 
olescents provides more opportunities for interactions among members, 
peer delinquency plays a larger role in the adolescent’s own delinquent 
behavior. Adolescents located within central positions in very cohesive 
friendship networks report behavior that is more closely associated with 
the delinquent behavior occurring in their network. This suggests that 
friendship networks expose these adolescents to more intense differential 
associations favoring delinquency involvement than are adolescents sit- 
uated in more peripheral positions within less cohesive networks. Thus, 
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positioning in the peer network provides different opportunities for peer 
interaction resulting in varying exposure to delinquent behavioral models, 
communication of delinquent norms, access to information on delinquency 
opportunities, and opportunities for rewards or deterrents for delinquency. 
Examining properties of peer networks elucidates when the peer network 
is more or less effective in constraining the behavior of adolescents to be 
similar to that of their peers. This conclusion is consistent with the current 
emphasis on the significance of social contexts (e.g., community, neigh- 
borhood, schools) and suggests that an important context with implica- 
tions for adolescents’ behaviors is the peer networks in which youth are 
embedded. 
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The Promotion Paradox: Organizational 
Mortality and Employee Promotion Chances 
in Silicon Valley Law Firms, 1946-1996* 
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This article argues that there is a “promotion paradox”—a negative 
relation between firm life chances and employee promotion chances. 
I argue that this is due to a firm’s bargaining power, which increases 
with the firm’s competitive strength. I find strong support using 
data on 50 years of Silicon Valley law firms and attorneys. Young, 
small, specialist, and low-status firms are more likely to fail but are 
also contexts with the highest promotion likelihood. Moreover, ex- 
cept for those firms that are “near death,” an associate’s promotion 
likelihood increases with the law firm’s probability of failure. 


INTRODUCTION 


Many studies in organizational theory, as well as articles in the popular 
press, suggest that employees are most likely to obtain career advancement 
and long-term employment within firms that have favorable life chances 
(e.g., survival, productivity, profitability). An association between firm life 
chances and firm growth underlies this positive relationship (Whetten 
1987). Favorable life chances imply more openings within an organization, 
generating greater mobility. Indeed, the argument that firm life chances 
and employee promotion chances are positively related has great intuitive 
appeal. 

However, previous findings have underappreciated that a firm’s rela- 
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tion to its employees and the labor market is affected by its competitive 
position. A consideration of the dynamics surrounding the bargaining 
strength of an employer suggests that there is a negative relation between 
firm life chances and employee promotion chances. In the exchange of 
rewards between the employer and employee, the stronger the firm’s bar- 
gaining position, the less the firm will be compelled to cede rewards and 
resources to the employee in the form of promotions. Therefore, employers 
with favorable life chances may actually disadvantage employee attain- 
ment. Ironically, this negative relation exists alongside the advantages due 
to growth. 

Likewise, this negative relation can be predicted from the potential 
employee’s perspective if one considers that employees choose jobs as a 
function of the expected value of income from employment. Using this 
rational choice approach, the expected value is the income earned as a 
junior employee plus the probability of being promoted times the net 
present value of the income stream associated with being a senior em- 
ployee, manager, or owner.’ If the expected value is observable and con- 
stant across all firms, there is a tradeoff between promotion rates and the 
net present value of the income stream. In other words, firms that are 
more likely to fail offer less future income and thus must offer a higher 
promotion rate to recruit employees.’ 

This rational choice perspective is consistent with the bargaining power 
approach since firms with favorable life chances have bargaining strength 
to the extent that they have resources that members of the labor market 
value: higher future income. Similar to the notion of bargaining power 
within economics (see Lindblom 1948; Lindbeck and Snower 1986), per- 
spectives on the employment relationship are most effective when they 
are consistent from both the perspective of the employer and employee 
(Siow 1994). It is reasonable to expect that the bargaining power and 
individual rational choice explanations are consistent rather than in com- 
petition. Whether due to the career choices of potential employees eval- 
uating the firm’s competitive position, or firm decisions that incorporate 
its competitive position, the resulting prediction is that there exists a 
negative relation between firm life chances and employee promotion 
chances. 

This article focuses on the likelihood of making partner in a law firm 
from the firm’s perspective. The bulk of the sociological research pointing 
to a relationship between firm life chances (e.g., firm growth) and employee 
promotion chances features conceptual and empirical analyses from the 


? An employee may incorporate (discounted) postemployment opportunities as well. 
+I am grateful to an anonymous reviewer for noting the relevance of this alternative 
perspective. 
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standpoint of firms (Pfeffer 1983; Stewman and Konda 1983; Stewman 
1986), often explicitly considering firms as conduits of attainment (Baron 
and Bielby 1980; Hannan 1988). A similar framework exists within the 
efficiency-wage theories that dominate much of neoclassical economics in 
which all labor market power rests with the firms (Lindback and Snower 
1986). Therefore, this article’s perspective builds upon influential contri- 
butions to organizational sociology, stratification, and labor economics. 
At the same time, this article does not distinguish between which per- 
spective (firm or individual) most accurately explains the “promotion par- 
adox.” It is likely that the “promotion paradox” is the product of the 
rational decisions of firms and potential employees. 


The Relationship between Organizational and Career Processes 


This research agenda fills a void in our understanding of the relationship 
between organizational and career processes. Within the organizations 
community, the open-systems perspective has made commonplace our 
understanding that an organization’s environment has relevant ramifi- 
cations for the strategy, structure, and life chances of firms (Lawrence and 
Lorsch 1967; Meyer and Rowan 1977; Hannan and Freeman 1977; Scott 
1992). 

Despite this acknowledgement, researchers of labor markets and careers 
have yet to fully incorporate the relationship between employers and their 
environment to understanding stratification, in general, and career pro- 
cesses, in particular (Baron and Bielby 1980). Most studies have focused 
on individual or positional influences on promotion or the effect of or- 
ganizational or environmental characteristics on the existence of employ- 
ment-related structures. For example, studies of within-firm promotions 
have often focused on the role of vacancy chains and their relationship 
to the employer’s growth (White 1970; Stewman and Konda 1983; Wholey 
1985; Rosenbaum 1989). 

Less often do scholars of employee opportunity structures examine the 
impact of the organization’s success and fitness—with respect to its en- 
vironment—on the career trajectories of its members. Some studies have 
examined how labor supply, labor demand (Bridges and Villemez 1991), 
and institutional pressures (Baron, Davis-Black, and Bielby 1986; Edel- 
man 1990) affect the creation and evolution of internal labor markets. 
Unfortunately, these studies do not explore whether the existence of formal 
opportunity structures are influenced by an employer’s performance. 
Moreover, these studies do not directly examine career trajectories, rather 
they seek to explain the existence of particular bureaucratic governance 
structures of opportunity and control. While we tend to infer promotion 
patterns based on organizational structures, we are unable to understand 
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the variance within particular formal structures due to organizational life 
chances. 

Thus, while the scholarship on internal labor markets and promotion 
chances has been a substantial contribution to organizational sociology, 
we have not fully developed an understanding of how career trajectories 
are influenced by the fate of the employer—where career trajectories are 
embedded. I argue that fruitful insights can be obtained by understanding 
how employee opportunity structures are influenced by an employer’s 
relationship to its ecological niche. To date, there have been few attempts 
to develop a research agenda capable of linking firm survival and career 
mobility. 

Research is moving in this direction, however. Brittain and Wholey 
(1991) examine the effect of industry volatility and uncertainty on quits 
and layoffs. However, they do not consider firms as instruments through 
which environmental pressures influence career trajectories. Haveman 
and Cohen (1994) show that population dynamics (e.g., foundings, merg- 
ers, and dissolutions) create and destroy jobs, thus generating career mo- 
bility. Drawing insight from the vacancy chain perspective, their study 
traces a clear link between population dynamics and job changes, marking 
a new path of research that examines the consequences of firm life chances 
on individual mobility. In another study that simultaneously considers 
organizational and career processes, Sørensen (1999) addresses how the 
structure of competition affects patterns of career mobility. He finds that 
firms that recruit from the same sources as their competitors have lower 
growth rates, due to the competition in the more dense regions of the 
labor market. While these studies have advanced the understanding of 
the relationship between firms and individual mobility, there exist many 
unanswered questions. Without further developing an understanding of 
the relationship between a population of firms and the careers of indi- 
viduals, the conclusions made by organizational theorists and career re- 
searchers will be sorely incomplete. 

One key to further understanding the relationship between the fate of 
firms and individual careers is exploring the relationship between a firm’s 
life chances and the promotion chances of individuals that work within 
them. This article seeks to demonstrate that contradictory outcomes exist 
and shape the opportunity structure of individual careers. That is, there 
is a negative relation between firm life chances and the promotion chances 
of its employees. Moreover, it uses bargaining power as a conceptual 
framework for understanding why this relation exists. In essence, firm 
life chances are associated with the firm’s bargaining strength in the 
employment relationship.* Employers with strong life chances (Le., 


* While this study draws from an ecological approach and thus emphasires firm life 
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strength in the “product” market) are also those employers with greater 
bargaining advantage over their employees (i.e., strength in the labor 
market). I test the role of bargaining strength with a unique dataset on 
50 years of law firms in Silicon Valley and a subset of attorneys that 
worked within them. 


BACKGROUND AND THEORY 
Organizational Growth and Employee Promotion Chances 


The intuition that there is a positive relationship between employer and 
employee success is supported by studies that examine the impact of 
organizational growth on employee promotions (Stewman and Konda 
1983; Rosenbaum 1984; Wholey 1985). While a conclusion drawn by other 
sociologists as well as economists (Chiang 1986; Holzer 1991), much of 
the research linking growth and promotion chances has been developed 
by scholars in the vacancy chain tradition (White 1970). This research is 
accompanied by articles in the business popular press that claim long- 
term employment and employee advancement occur in successful firms 
(e.g., Hatvany and Pucik 1981). 

White’s (1970) seminal work introduced vacancy chains into discussions 
of mobility by noting how the retirement of Methodist clergy initiated a 
chain of vacancies filled sequentially by an occupant in a lower rank. 
Using insights from White (1970) and Keyfitz (1973), Stewman and Konda 
(1983; Stewman 1986) developed and refined a model of multiple grade 
ratios to predict promotion patterns. In addition to factors such as cohort 
size and exit rate, Stewman and Konda (1983) found that the rate of 
promotions increases with organizational growth. The logic is straight- 
forward: organizational growth involves the creation of new vacancies at 
the top and new subunits throughout the organization, which leads to 
them being filled by new hires and lower-ranked employees. Thus, 
whereas a retirement was the catalyst for increased attainment by lower- 
ranked employees in White (1970), organizational growth was also iden- 
tified as a mechanism for attainment. 

The relationship between firm growth and the probability of promotion 
is not confined to vacancy chain research however. Rosenbaum (1984) 
argues, with empirical evidence, that firm growth allows more people to 
be winners in a tournament selection. Pfeffer (1983) uses an analysis of 
a U.S. railway industry case study from 1885 to 1940 (Morris 1973) to 
draw a similar conclusion. He shows that a lack of growth is one of the 





chances, 1 expect my arguments to hold for other indicators of firm strength. For 
institutions that rarely fail, such as universities, one would require an indicator of firm 
strength that is more appropriate for the context 
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factors responsible for a firm’s inability to hold or attract young talent. 
The increasing net loss of talent resulted in further performance problems 
and organizational decline, leading to the failure of some of the organi- 
rations in the railroad industry. 

The extension of each of these arguments is that there is a positive 
relation between the life chances of firms and the promotion chances of 
its employees. However, I argue that alongside the positive contribution 
of firm growth to employee promotion chances, a negative relation, also 
associated with firm life chances, exists. This relationship is captured with 
the following insight: favorable life chances are associated with bargaining 
power on the part of the firm. Bargaining power privileges the firm to 
limit the resources it shares with its employees without detrimental con- 
sequences. This view is consistent with Sgrensen and Kalleberg (1981) 
and Sgrensen (1983) in their argument that employers turn away from 
internal promotions and toward the external labor market to the extent 
that the employer has greater power than the employee. Promotions are 
one of the rewards and resources that employers concede when their 
advantage is weakened.‘ 


Bargaining Power and the Employment Relationship 


Generally, bargaining power is the capacity of a party to produce an 
agreement on its own terms (Chamberlain and Kuhn 1965). My use of 
bargaining power is similar to the use of informal bargaining power pre- 
sented by Cyert and March (1998, p. 187): “bargaining between two groups 
in which conflict of interest is implicit.” In other words, the concept does 
not require each party to explicitly bargain. Economists that have incor- 
porated sociological perspectives have used similar definitions of bar- 
gaining and bargaining power (Schelling 1956; Williamson 1983). Im- 
portant for this article is the notion that bargaining varies as the firm’s 
life chances improve or decline. I am interested in the firm-level variation 
of promotion rates as a function of a firm’s life chances. Employers that 
have weaker life chances have less bargaining leverage, which leads to 
higher promotion chances among the employees. The rate of promotion, 
therefore, is a barometer for the balance of bargaining power and resources 
between the employer and the employee. 

While this article focuses on the perspective of the firm, a similar re- 


! Promotions that are purely symbolic fall outside of my conceptualization. Rather, I 
assume that promotions involve some cost to the employer. I consider, as does Pfeffer 
(1981), that employers have two fundamental resources: (1) budgets and other monetary 
resources and (2) positions in the organizational hierarchy. I focus on the allocation of 
positions, with the assumption that monetary and nonpecuniary rewards are largely 
determined by position. 
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lationship between firm strength and promotion can be derived from the 
perspective of the potential employee. Two assumptions are necessary. 
First, firm strength is correlated with profits the firm shares with those 
promoted. In other words, stronger firms have greater rewards to share 
with their members than do weaker firms. Second, potential employees 
are able to select employers with the goal of maximizing their expected 
value of income. The potential employee is able to distinguish a set of 
employers on the probability of promotion and the rewards given to those 
promoted. The expected value of income is a function of rewards given 
to those promoted, as well as the probability of promotion. 

Given these two assumptions, weaker firms must offer higher proba- 
bilities of promotion in order to attract potential employees. The rela- 
tionship between firm strength and promotion chances is a result of po- 
tential employees rationally choosing employers based on a calculation of 
expected future income. The stronger the firm’s competitive position, the 
greater its rewards to those promoted and the greater its ability to min- 
imize the probability of promotion promised to potential employees. Com- 
petitive strength gives the employer the ability to offer lower promotion 
without punishment from the labor market. 

Similarly, one may reasonably posit that firms with strong life chances 
offer highly valued jobs that reward their employees with highly observ- 
able and positive information about their individual talent. Employees 
take jobs at firms with strong life chances because it will imply that they 
are high-quality employees. Future income is expected to be higher when 
an employee is seeking a job in firms with greater firm strength since 
employment increases the income of future jobs. In this case, firms with 
highly valued jobs are able to offer lower chances of promotion in 
exchange for inferring the talent of its employees. Bargaining power, in 
this case, is derived from the signaling ability of firms with strong life 
chances. 

This phenomenon is one that I term the promotion paradox. Firms with 
strong life chances (e.g., survival, productivity, profitability, etc.) have 
greater bargaining power than do firms with weak life chances. Similar 
to Hannan’s (1998) use of firms occupying robust and fragile positions, 
strong life chances allow a firm to be less susceptible to environmental 
variation and shocks (such as labor turnover). While occupying strong 
competitive positions allows a firm to attract and retain employees seeking 
to obtain a share of the firm’s resources, the firm is able to deny the same 
resources to many employees without severe penalty. When coupled with 
the fact that weak life chances reflect a firm that has a high likelihood 
of failure, firms at risk of failure become the contexts for employee 
promotions. 

This proposition has both counterintuitive and paradoxical elements. 
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First, it argues against the intuition derived from vacancy chain and 
organizational growth studies, in addition to reports in the popular lit- 
erature. Whereas these contributions suggest that firms with strong life 
chances are also the best context for employee promotions, I modify the 
argument. Independent of firm growth rates, firms with weak life chances 
(Le., having a high risk of failure) are the best contexts for employee 
promotions. Second, the proposition is paradoxical. If promotion occurs 
more often in firms that have a high likelihood of failure, there is a risk 
of seeking tenure in a firm that is likely to fail. Firm failure, as Haveman 
and Cohen (1994) demonstrate, is tantamount to the destruction of one’s 
job. I argue that it is exactly this tension that often describes the em- 
ployment relationship. 

To be clear, I am not arguing that the prevalence of vacancies is ir- 
relevant to promotions. It is tautologically true that a job must exist if 
an individual is to fill it. Rather, I would argue that vacancy-based per- 
spectives tend to remain agnostic about conditions that lead to the creation 
of vacancies. Bargaining power is one important determinant of when 
vacancies are actually created. 


The Determinants of Firm Strength and the Promotion of Employees 
Arguing that the likelihood of firm failure increases the likelihood of 
promotion carries the implication that the particular determinants of firm 
failure contribute to the promotion opportunities of employees. In order 
to verify this implication, I examine five determinants: competition, size, 
scope, status, and age.* To be consistent with my main proposition, the 
conditions under which each of these factors enhance the life chances of 
the firm should be the same conditions under which they depress the 
promotion chances of employees. 

The critical test, however, is to determine whether employees in firms 
with weak life chances have a greater likelihood of being promoted. I 
expect that each of the aforementioned factors (density, size, scope, status, 
and age) will affect the life chances of law firms similarly to organizational 
forms examined in the past. In other words, low density, large size, broad 
scope, high status, and older age, result in stronger life chances. Dem- 
onstrating that the determinants of firm life chances are negatively related 


* Given the context of law firms, one may generate other determinants of law firm 
strength, such as the identity of each firm’s clients Unfortunately, data on law firm 
cHentele for firms from 1946 to 1996 is not consistently available. I assume that much 
of the unobserved heterogeneity due to client identity is highly correlated with ob- 
servable variables such as firm size and scope. 
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to employee promotion chances will provide supporting evidence, as well 
as aid in understanding the components of firm life chances. 

A firm’s life chances are a function of each determinant. When placed 
in a promotion model as a function of the determinants of firm failure, 
the firm’s likelihood of failure should affect promotion opportunities. The 
weaker a firm’s life chances, the lower its bargaining power in the em- 
ployment relation and, consequently, the greater the likelihood of em- 
ployee promotion. Demonstrating that there is a direct negative influence 
of firm life chances provides evidence that promotion opportunities are 
inversely related to the strength of the employer’s life chances. Accord- 
ingly, I present my central thesis: 

Main Proposition: The weaker a firm's life chances, the lower the firm’s 
bargaining strength, the higher the likelihood of employee promotion. 

In addition, I will examine the relationship between each determinant 
and the likelihood of associate promotion to partner, controlling for their 
contribution to the firm’s life chances. While not the focus of my central 
thesis, it is possible that firm size, age, status, scope, and density influence 
the bargaining strength of employers beyond their contribution to the 
firm’s likelihood of failure. 


DATA AND METHOD 
Context 


I test my proposition using longitudinal data on Silicon Valley law firms 
and cohorts of attorneys that work within the firms. Law firms are an 
appropriate context for testing my proposition for three related but distinct 
reasons. First, promotion to partner is clearly a substantial increase in 
both the associate’s pecuniary and nonpecuniary rewards. By using law 
firms, one avoids employment contexts in which pay and promotions are 
decoupled. In essence, the relationship between firm strength and ad- 
vancement is less “noisy” than other work arrangements. Second, law 
firms allow a clear distinction between the firm and its employees. The 
firm is the set of partners, whereas the employees are the associates. In 
many contexts, this distinction is less clear, and in some cases, it is con- 
tinuous rather than qualitative. Third, law firms emphasize promotion 
from within. Wholey (1985) notes that candidates for promotion to partner 
are much more likely to be those hired initially as first-year associates 
than a lateral hire from another law firm. Therefore, law firms are a 
sensible case for an initial test of my proposition. I use the period from 
1946 to 1996 to observe firm failures and individual promotions. Given 
that the emergence of Silicon Valley is attributed to events that occurred 
after World War I (Saxenian 1981), starting the observation period in 
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1946 allows measurement of organizational and career processes as Silicon 
Valley emerges and begins to mature. 


Santa Clara County and the Emergence of Silicon Valley: A Synopsis 


While the geographic region nicknamed Silicon Valley is now known as 
a center for high-tech research and entrepreneurial activity, the pre-World 
War I characterization of the region (Santa Clara County) was that of 
fertile agricultural soil (Saxenian 1981). World War I was the turning 
point for the county’s shift to a technological economy. The war served 
as a catalyst for the creation of many organizations, such as Hewlett 
Packard. Eventually, through the founding of technological firms by Stan- 
ford graduates and spin-offs from existing firms, a critical mass of elec- 
tronics-based firms arose, facilitating the emergence of a semiconductor 
industry. A steady growth in the success and prominence of these firms 
eventually led to the term “Silicon Valley,” coined around 1971 by an 
electronics trade publication journalist. 

It is within this context that a niche for lawyers and their firms opened 
and expanded. Silicon Valley has an abundance of technologically oriented 
corporate clients for law firms. While not every Silicon Valley law firm 
seeks and serves high-tech corporate clients, a large part of the legal 
community is directly or indirectly dependent upon the success of the 
technologically oriented industries. In addition to corporate clients, Silicon 
Valley has a large supply of well-educated individual clients, with rela- 
tively high incomes (Saxenian 1981). These clients require their own legal 
representation and advice. 


Attorney Promotion to Partner 


In general, law firms have two prominent positions: associates and part- 
ners. Associates are the employees of the firm entering directly out of law 
school or after a one-year judicial clerkship. They generally work under 
the partners of the law firm, who leverage the work of associates.” As- 
sociates are considered for promotion to partnership after a period during 
which they work under the supervision of the partners, receive training, 
and exercise increasing responsibility (Smigel 1969; Nelson 1988; Galanter 
and Palay 1991). Partners are the firm’s owners and share in the firm’s 
profits. Partnership is thought of as a deferred bonus that provides an 
associate the incentive to maintain hard work and commitment to the 
firm (Galanter and Palay 1991). This internal promotion scheme acts as 


7 Leveraging is the process of deriving a surplus generated from hiring associates at 
a given salary and billing them out to the firm’s cHents at multiples of that salary. 
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an incentive system or motivating device used by employers (Stinchcombe 
1974). The transition from associate to partner is accompanied by in- 
creased attainment, as well as a new functional role within the firm. 
Partnership encompasses several new tasks and responsibilities, empha- 
sizing e different set of skills in firm management that transcends the 
traditional tasks involved with practicing law as an associate (Nelson 
1988). 

Clients prefer and tend to have strong relations with a firm’s partners 
over its associates, giving the partners a source of bargaining power (Smi- 
gel 1969, pp. 156-60). However, Smigel (1969) distinguishes between large 
and small firms by noting that small firm associates are often encouraged 
to establish strong ties with clients. In the large firms studied, associate 
contact is deemphasized (p. 201). This observation is consistent with other 
accounts given by attorneys distinguishing between small and large law 
firms (Pyszka and Clark 1997). To the extent that firm size is consistent 
with firm strength, Smigel’s (1969) observation suggests that one way in 
which a firm’s lower bargaining power is manifested is through policies 
that dictate an associate’s access to sources of revenue. 

The decision to promote to partner typically involves both objective 
and subjective measures. Objectively, partners consider an associate’s 
number of billable hours and papers or briefs written. In addition, it is 
typical for partners to consider both the state of the legal market and the 
firm’s economic situation when making promotion decisions (Gilson and 
Mnookin 1989; Hildebrandt 1989; Greene 1999). Subjectively, partners 
consider an associate’s untested ability to attract, retain, and enhance the 
firm’s client relations. Since many of these factors are exogenous to as- 
sociates, it is difficult for associates to predict their promotion chances or 
verify that the firm is faithfully executing the up-and-out contract (Kahn 
and Huberman 1988; Gilson and Mnookin 1989; Kordana 1995; Schmidt 
1998). 


The Significance of Law Firm Failure 


Hillman (1990) and others (Bellows 1987; Hildebrandt 1989; and Inter- 
national Financial Law Review 1994, 1995) portray law firm failures as 
traumatic events within the legal profession. The termination of a law 
firm partnership is sufficiently disruptive that it is inconceivable that a 
set of attorneys would enter into a partnership with the objective of 
establishing anything short of a successful, lasting law firm. The concept 
of divorce in Western culture provides a convenient metaphor, since the 
failure of a law firm has interdependent social, legal, and financial 
components. 

In its simplest form, law firm failure reflects the point at which the 
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social, legal, and financial understanding of the firm’s identity funda- 
mentally changes or ceases to exist. Socially, when law partners withdraw, 
there is little contact with the remaining partners. Legally, the discontin- 
uance of a law firm partnership is referred to as a dissolution, which holds 
a specific meaning outlined in the Uniform Partnership Act as “the change 
in the relation of the partners caused by any partner ceasing to be as- 
sociated in the carrying on . . . of the business.” Accordingly, anytime a 
partner leaves a law partnership, the original partnership is dissolved. 
Upon the withdrawal, the partners complete the business of the partner- 
ship, liquidate the assets, settle the liabilities, and distribute any residual 
amounts among themselves (Hillman 1990). Liabilities incurred in com- 
pleting the business are the responsibility of the original partners. Com- 
pletion of this process terminates the original partnership. 


Data 


The data were collected from annual Martindale-Hubbell directories using 
the 1945 edition through the 1996 edition, for law firms and attorneys in 
Silicon Valley, California (Martindale-Hubbell 1945-96). Galanter and 
Palay (1991) note that while being the most comprehensive legal direc- 
tories available, the Martindale-Hubbell directories only list those attor- 
neys that have passed the bar examination. As a result, associates that 
pass the bar after the firm is surveyed must wait another year before 
being included in the directory. This creates a potential error in the mea- 
surement and estimation of an associate’s tenure, a control variable in 
the promotion model. In addition, the Martindale-Hubbell Law Directory 
is like all directories of organizations with an unknown probability of 
excluding very small and short-lived organizations (< 1 year). To the extent 
that this occurs, one would expect this bias to make the task of demon- 
strating a liability of newness (Freeman, Carroll, and Hannan 1983) more 
difficult. 

Silicon Valley is a relatively self-contained market for legal services in 
Northern California (Suchman 1993; Escher and Morze 1998), making its 
choice as a context appropriate.’ In this study, Silicon Valley comprises 
the following ten cities: Redwood City, Menlo Park, Palo Alto, Los Altos, 
Mountain View, Sunnyvale, Santa Clara, Cupertino, Campbell, and San 
Jose. In 1996, Silicon Valley hosted 209 law partnerships employing 2,375 
active attorneys. The directories list attorney and law firm characteristics 


*Suchman’s (1993) extensive interviews with attorneys in Northern California, in 
addition to Escher and Morze's (1998) report on law firm penetration into Silicon 
Valley both argue that, with respect to client base, Silicon Valley is a bounded region 
of competition with great barriers of entry, even for firms as close as San Francisco. 
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and, when followed across time, provide information on the life chances 
of law firms and the mobility of the attorneys that work within them. 
This collection generated data on 512 law partnerships across the 50 years, 
which comprises every firm listed with more than one active attorney 
(i.e., solo practitioners are excluded).’ 

In addition to coding firm-level data, the Martindale-Hubbell direc- 
tories were used to collect data on 12 cohorts of attorneys in Silicon Valley 
firms at four-year intervals, beginning in 1946 and ending in 1990. Only 
those attorneys hired directly out of law school or after a one-year clerk- 
ship are included. Within a cohort, the attorney’s demographic charac- 
teristics and background, employer, and position in the firm are coded 
annually until they exit the labor market, or become right-censored at 
1996. This generated a total of 443 attorneys. Supplementary data on law 
firm practice areas and employee demographics was collected from the 
National Association for Law Placement Directory of Legal Employers 
(1991-96) and the Legal Directory of the Santa Clara County Bar As- 
sociation (1967—96). 


Operationalizations and Measures 
Dependent Variables 


Firm failure —The first year that a firm appears in the Martindale- 
Hubbell Law Directory is considered the year of founding. The year that 
a firm is no longer listed or only has one attorney is considered the year 
of failure. A dichotomous variable for failure is coded “1” for the respective 
year, 

Promotion.—While there are minor gradations between associate and 
partner (e.g., junior associate, senior associate, etc.) the meaningful pro- 
motion is that of associate to partner. The Martindale-Hubbell Law Di- 
rectory distinguishes the partners and associates in a firm in each year of 
the directory. The first year that an associate is listed as a partner is the 


* In 1996, 4% of the attorneys in Silicon Valley were solo practitioners (88 of 2,376). 
© While law firm mergers are popular along the east coast of the United States (es- 
pecially in New York), I observed relatively few in Silicon Valley. This is most likely 
due to the fact that mergers typically occur in more mature markets (such as New 
York’s legal service market), but not markets experiencing rapid growth in their ad- 
olescence, as the case with the Silicon Valley legal market. In total, I observed only 
nine mergers in the 50-year observation period. For each merger, one firm was coded 
as a failure, while the other was considered as being in continued existence. I used 
two decision rules to determine firm failure: (1) the firm whose name partners are listed 
first is considered to be the surviving firm. (2) the firm whose retains its addreas is 
considered to be the surviving firm. In each of the nine mergers, both of the conditions 
were met 
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year of promotion. A dichotomous variable for promotion is coded “1” in 
the year that the associate is promoted. 


Independent Variables 


Firm characteristics —Each firm's age is simply recorded as the num- 
ber of years since the firm first appeared in the Martindale-Hubbel Law 
Directory bi hical section. Firm size is operationalized in two ways: 
(1) the total number of full-time partners and (2) the total number of full- 
time associates. This distinction is important given the law firm context. 
The number of partners captures ownership size, whereas the number of 
associates the number of employees. Given this distinction, it is 
clear that employee size more easily captures the concept of organizational 
size, as considered in organizational sociology (Scott 1992). In contrast, 
there is little to guide our understanding of ownersbip size in the pro- 
motion context. Therefore, it will be important to examine each opera- 
tionalization of size independently. Given that the distribution of firm 
sizes is log-normal (skewed to reflect a few relatively large firms), the log 
of each size variable is coded.” 

It is convention within the study of law firms to consider the ratio of 
associates to partners. Profits primarily come from the firm's ability to 
leverage the skills of the partners with the efforts of the associates. As a 
measure of firm leverage, the ratio of associates to partners serves as a 
proxy for firm performance and a mechanism that drives promotion rates 
(Galanter and Palay 1991). Firms with greater leverage are considered to 
be stronger firms, with lower promotion rates. 

A well-specified model that includes the ratio of associates to partners 
as an interaction effect requires the inclusion of the inverse of the number 
of partners instead of the number of partners (Bradshaw and Radbill 
1987).2 There is no theoretical justification for including the inverse of 
the number of partners in a model of employee promotion. Rather, using 
partnership and associate size as separate components, instead of the ratio 
of associates to partners, provides a theoretically consistent model with 
easily interpretable results. This is especially the case since, in this data, 


1 ran each model with size logged and unlogged. The models with logged size had 
a statistically better fit (1), but were otherwise unchanged. 

u In other words, to include the ratio (A/P), a properly specified model should also 
include A and 1/P 
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the ratio is highly correlated to the number of associates.” Examining the 
effect of the number of associates, while controlling for the number of 
partners, is preferred to inserting the associate to partner ratio as an 
interaction effect. 

Firm scope is operationalized as a continuous variable from 0 to 1. It 
captures the number of law practice areas that a law firm reports in the 
Martindale-Hubbell Law Directory in a particular year, divided by the 
total number of practice areas that are reported across all firms in that 
year. For example, if a firm in a certain year practices law in eight areas, 
but a maximum of 16 areas of law are possible, the firm's scope is half 
of the total market. Since the number of practice areas grows from eight 
possible areas in 1946 to 79 possible areas in 1996, this measurement 
allows a relative measurement of firm scope. Given that the Silicon Valley 
legal community is in its early stages and has yet to reach maturity, it is 
difficult to directly test Carroll’s (1985) predictions. However, if Carroll 
(1985) is correct, midrange firms should be facing higher failure rates as 
they become the victims of generalists’ competitive pressures. Accordingly, 
I will test for this indicator by including a quadratic term for firm scope. 

Population density.—Population density is calculated as the log of the 
total number of law firms in Silicon Valley for a given year. For the 
promotion to partner models, I also calculated the log of the number of 
law firms in Silicon Valley that hired at least one associate. Since not 
every firm hires associates in a given year, the hiring density may provide 
a better indicator of competition for labor. 

Firm status.—Firm status is operationalized as the proportion of the 
firm’s attorneys that have degrees from six elite law schools (Yale, Har- 
vard, Columbia, New York, Stanford, and Chicago) within the law firm. 
The greater the number, the greater the firm’s social status. These law 
schools were selected by examining the legal affiliations of individuals in 
national and regional institutions (political and economic) since World 
War IL. An assortment of methods was used to acquire this information: 
recording the affiliations of individuals in Who’s Who entries, National 
Law Journal lists of prominent attorneys, local politicians and judges, 
and so on. 


“Tn my data, the correlation between the associate measure and the ratio of associates 
to partners is 0.74, while the correlation between the partner measure and the ratio 
was —0.06. In addition, I ran models with both size components and the ratio of 
associates and partners included as an interaction effect. The ratio measure was not 
significant and did not improve model fit. 
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Control Variables 


There are several alternative explanations or potential sources of unob- 
served heterogeneity that can be addressed with the use of control var- 
iables. Prominent among these variables is whether the firm is one of 
many offices, whether the subset of the area of law that a firm practices 
is specific to individual clients or corporate clients, the rate of partner 
growth and turnover, the particular era or time period in question, and 
the individual characteristics of the employee. 

Branch or single office.—A dummy variable is coded “1” if the focal 
firm is one of many offices. It is likely that multiple (branch) office firms 
affect, and are affected by, the competitive landscape differently than 
single office firms. 

Individual versus corporate clients—Heinz and Laumann (1982) in 
their study of the Chicago bar found that one of the most distinguishing 
dimensions along which lawyer competition and interaction takes place 
is whether the types of law that a firm practices address the needs of 
individual clients as well as corporate clients. Firms that have individuals 
as customers compose a social system that is distinguishable from those 
firms that focus solely on corporate customers. To capture this effect, a 
variable is coded “1” if the firm practices any areas of law that pertain 
to individual clients (e.g., family law, probate and estate, etc.). Otherwise, 
the variable is coded “0.” 

A representative listing of each firm’s clients would also be informative 
to this study. Unfortunately, a complete listing of clients for each firm in 
Silicon Valley from 1946 to 1996 does not exist. Moreover, the subset of 
firms that provides a representative list of their clients does not do so 
reliably. Fortunately, much of the variance of firm clientele tends to be 
captured by the firm’s size (larger firms have larger corporate clients) and 
the individual client dummy variable used here (Smigel 1969; Heinz and 
Laumann 1982). 

Growth and partner turnover—Proportional growth of the number of 
partners is coded to capture the rate with which the firm is creating and 
filling new vacancies for partnership. To capture promotions due to part- 
ners leaving the firm and creating vacancies to be filled, a variable is 
coded to capture the proportion of partners to leave in the past year. This 
variable also controls for the effect of firm failure due to partners aban- 
doning the firm. Each of these variables addresses two aspects of the 
vacancy chain argument: vacancies are created by growth, and vacancies 
are created by the departure of senior members. 

Period effects Four dummy variables are coded to capture activity 
before 1970, between 1970 and 1986, between 1987 and 1991, and between 
1992 and 1996. Before 1970, there was little change in the social and 
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economic landscape of Silicon Valley. However, beginning around 1970, 
Silicon Valley experienced a period of rapid growth, lasting until the stock 
market drop and subsequent recession (1987—91). Finally, the years 1992 
to 1996 are coded to capture the beginning of the postrecession era. These 
dummy variables serve to capture some of the unobserved heterogeneity 
that may be due to the time period that the firm or attorney may exist 
in. 

Individual characteristics —Organizational demography and career 
mobility researchers have identified individual-level factors that influence 
promotion chances. Here, I include the attorney’s age, plus six dummy 
variables. Each attorney’s date of birth is given in the Martindale-Hubbell 
Law Directory. Age has competing effects. New employees who are 
younger than the average new hire are often assumed to be more qualified 
(Rosenbaum 1984). On the other hand, age also reflects work force ex- 
perience, which would enhance one’s promotion chances. Age is obtained 
from each attorney’s reported date of birth. 

A dummy variable is included to capture whether the attorney is a 
woman. If the new hire attended an elite law school (Yale, Harvard, 
Stanford, Columbia, New York, and Chicago), a dummy variable was 
coded “1.” Finally, there are three indicators of associate quality: (1) grad- 
uated with magna/summa cum laude, (2) was a member of an honors 
society, and (3) served as a law clerk for a judge. 


Statistical Methods 


Before testing my central thesis, I model the effect of the firm-level cov- 
ariates on the likelihood of law firm failure. In this way, we can better 
interpret their respective effects on an employee’s likelihood of promotion. 
The model of firm failure will serve as a basis for understanding the 
model of associate promotion to partner. Accordingly, the first model es- 
timates the likelihood of firm failure, as influenced by population density, 
firm size, firm age, firm scope, firm status, and a set of control variables. 
I follow the model for firm failure with a second set of models addressing 
associate promotion rates due to competition, firm size, firm age, firm 
scope, and firm status, with a host of individual and firm-level control 
variables. The last set of models tests the direct effect of firm life chances 
on the likelihood of promotion to partner. 

Both firm failure and employee promotion are modeled using a hazard 
rate model, which is analogous to the conditional probability of an event 
at time é given that the event has not yet occurred. The rate can vary as 
a function of the time that a firm has existed or an associate has been 
employed. In either case, use of the instantaneous hazard rate allows for 
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estimation of the changes of the event occurring while controlling for age 
dependence (Tuma and Hannan 1984). 

I model the hazard rate of firm failure and promotion with piecewise 
constant exponential models. This approach allows the inclusion of firm 
age (in the failure rate model) and attorney tenure (in the promotion rate 
model), while avoiding misspecification that accompanies other functional 
forms. A piecewise constant exponential model splits the time axis into 
time periods. In the model for firm failure, the time axis is split according 
to firm age, whereas in the model for promotions, the axis is split according 
to employee tenure. It assumes that transition rates are constant in each 
of these intervals. However, base rates vary freely across time periods. 
The assumption is that the period-specific baseline rate can vary across 
time periods, but the covariates have the same (proportional) effects (Roh- 
wer 1993). 

The resulting model gives an age dependent constant (a “y-intercept”) 
for each time piece of the model. There are different strategies for choosing 
the appropriate time periods. To estimate this model, it is important to 
choose time segments that are long enough to contain a meaningful num- 
ber of events (Le., firm failures). For example, a firm age segment from 
20 to 22 years old could not be estimated if no firms fail within this age 
window. Some theoretical predictions may require that the time periods 
take on particular values. In my model, I assume no a priori knowledge 
of age dependence. The null is an exponential hazard rate model without 
time periods (where it is assumed that rates are time invariant). From 
this baseline, I only include those time periods that led to an improvement 
in the chi-square fit of the failure rate model (P < 0.05). As a result, the 
y-intercepts included in the model are statistically significant with respect 
to a chi-squared model improvement test. 


RESULTS 

Descriptive Statistics 

To facilitate the interpretation of the models, the appendix lists descriptive 
statistics (minimums, maximums, means, and standard deviations) for 
firm and person-year variables in both the firm failure and promotion 
models. Figure 1 is a graph of the population density of Silicon Valley 
law firms across the 50-year observation period. While only six firms exist 
in 1946 (I was able to code the proper founding dates for these firms), 
209 firms exist in 1996. Most of the growth takes places from the late 
1960s to 1986, rising with a relatively constant slope (a net gain of ap- 
proximately eight firms per year). This rise continues until 1987, the year 
of a historic stock market drop and ensuing recession. At that point, there 
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Fic. 1.—Population density of SiHcon Valley law firms, 1946-96 


is a drop in the number of law firms, which recovers after 1991, continuing 
on the same slope until the end of the observation period in 1996. Figure 
2 shows the number of law firm foundings and failures over the same 
time period. Again, most of the activity begins after 1966 and increases 
until 1986. In 1987, it appears that the drop in population is due to a 
large reduction in the number of foundings combined with a slight increase 
in the number of failures. The number of foundings recovers by 1992, 
however, and continues to increase. 

Figure 3 is a bar graph showing a count of the number of new hires 
sampled in each cohort (the striped bar). The second (dark colored) bar 
is a count of the subset of new hires that were eventually promoted. Most 
of the new hires sampled represent cohorts from 1966 forward. A relatively 
large fraction of those hired in 1970 were eventually promoted to partner. 
The slopes of the new hires and the subset eventually promoted both 
increase, albeit at drastically different rates. The number of new hires 
increases at a much higher rate than the subset of those eventually pro- 
moted. The fact that the 1990 cohort is right censored implies that the 
subset of new hires promoted is incomplete. This figure does not include 
any new hires promoted after 1996. 

Figure 4 is a graph reporting the number of years to promotion for the 
87 associates that were promoted to partner. While it is clear that most 
of the promotions occur in the fifth to the eighth year, 25.3% of the 
associates were promoted in less than 5 years. It appears that there are 
two peaks. The first peak reveals that some attorneys are promoted as 
early as their second and third years of tenure. The second peak occurs 
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Fic. 2.—Foundings and failures of Silicon Valley law firms, 1946-96. Solid line represents 
foundings; broken line represents failures. 


during the sixth and seventh year—consistent with legal press reports and 
past studies (Nelson 1988; Galanter and Palay 1991).* 


Associate Promotion Rates as a Function of the Firm Failure Rate 


Table 1 presents the results of the firm failure model estimated using 
transition data analysis (Rohwer 1993). The likelihood of firm failure 
increases with increasing population density (P < 0.01) and decreasing 
age (P < 0.05). Both measures of law firm size reflect that larger firms 
have lower failure rates. Supporting Carroll and Swaminathan’s (2000; 
Carroll 1985) findings, the effect of firm scope is curvilinear (P < 0.05). 
Law firms that practice law in 15% of the possible areas have the highest 
likelihood of failure. The estimation of firm status is statistically significant 


14 In an effort to understand whether early promotions occurred across cohorts, and 
varied with the size or age of the firm, I examined the raw data. It appears that the 
early promotions are more likely to occur in the earlier cohorts. However, early pro- 
motions are distributed across cohorts, with some occurring in the 1990 cohort. There 
does not appear to be a statistically strong relationship with respect to firm size or 
age. Spurr (1987), using data from 14 large New York law firms from 1969 to 1973, 
found that promotions ranged from 3 years to over 10 years, with fewer early pro- 
motions than appear in my data. He found that law firms that handled bigger claims 
waited longer to promote. 

4 I also tested for density dependence (Hannan and Carroll 1992) but found no effect. 
A priori, there is no reason to expect that Silicon Valley law firms from 1946 to 1996 
would be subject to legitimation effects. By 1946, law firms were in existence and part 
of the cognitive awareness in northern California, and thus legitimated since at least 
the late-1800s (Hubbell Legal Directory 1889). 
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Fic. 3.—Associates hired and promoted by cohort, Silicon Valley law firms, 1946-96 
Solid bars represent promotions; shaded bars represent new hires 


(P < 0.01), demonstrating the lower likelihood of failure attributed to high 
status (Podolny 1993; Park 1997). 

Therefore, low competition, older age, larger size, and higher status all 
lower the likelihood of failure. The effect of scope is curvilinear but clearly 
shows that generalists have stronger life chances. In many ways, the 
ecological competition of law firms resembles the contexts of breweries, 
newspapers, hotels, and other contexts that ecologists have studied. Next, 
I test the relationship between these indicators of firm strength and the 
likelihood of associate promotions. 

The models for predicting the likelihood of promotion are presented in 
table 2. If my thesis is correct, the greater the probability of law firm 
failure, the greater the likelihood of associate promotion to partner. I test 
this thesis by examining the effect of the probability of failure indepen- 
dently of the population and firm-level covariates, and with these cov- 
arlates in the model as well. Including the individual-level covariates in 
the full model (models 4 and 6) allows the observation of the direct effects 
of each variable on the promotion rate, as well as their indirect contri- 
bution to the likelihood of failure. Furthermore, it insures that the prob- 
ability of failure cannot be explained away by including the population 
and firm-level variables. I test the proposed relationship between firm life 
chances and associate promotion chances directly by estimating the effect 
of the predicted probability of firm failure on the likelihood of promotion. 

In model 1 of table 2, the y-intercepts for tenure verify that promotion 
chances improve with tenure. Consistent with past research on the pro- 
motion chances by sex, women are less likely to be promoted (P < 0.05). 
Older new hires (P < 0.10), members of honor societies (P < 0.05), and 
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Fic. 4 —Graph of the number of years until promotion to partner (1946-96, N = 87 


elite law school graduates (P < 0.05) have higher rates of promotion. These 
models also include controls for the cohort that the attorney was sampled 
from. The cohort controls indicate that attorneys bired in 1970 have the 
highest likelihood of promotion with respect to the reference category 
(1946-66). This cohort coincides with the emergence of Silicon Valley. 

Except for population density, model 1 shows that each of the covariates 
are in the expected direction and statistically significant. While not sta- 
tistically significant, population density decreases the likelihood of pro- 
motion. Model 2, which substitutes the logged density of firms that hired 
in the most previous cohort, provides a more labor market specific measure 
of competition. Showing evidence of oligopolistic power, model 2 reveals 
that for those cohorts in which less than eight firms hired all of the 
associates, there is a negative relation between hiring density and pro- 
motion chances. When hiring density is greater than eight (the average 
hiring density is 46.57), there is a positive relation between hiring density 
and promotion chances.'* 

Firm age is represented as a set of dummy variables that match the 
age y-intercepts of the firm failure model (table 1). The reference category 
is firms 31 years and older. Strong support would require that each co- 
efficient be statistically significant, with descending magnitude. In other 
words, the coefficient for firms 0-2 years old should be the largest and 
significant, followed by the coefficient for firms 3-5 years, and so on. The 
model shows a strong effect for firm age. All age dummy variables are 


1I ran a similar model with just the linear term for hiring density. The term was 
marginally significant, with little change in model fit. 
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significant (P < 0.05), with each dummy variable having a magnitude less 
than the previous dummy.” 

The two measures of size give opposite findings. Ownership Size, given 
as the logged number of partners, increases the likelihood of promotion. 
The more traditional measure of organizational size, employee size, de- 
creases the likelihood of promotion. Both measures are significant at the 
P < 0.05 level. Therefore, there is support for employee size, but not 
ownership size. 

Promotion is less likely in generalist firms, though not statistically sig- 
nificant once competition is appropriately specified. While not shown here, 
a quadratic representation of generalism was not significant. The fact that 
promotion is more likely in firms that have individual clients supports 
Heinz and Laumann’s (1982) argument that the higher status and more 
selective members of the legal profession are the attorney3 and firms that 
practice law for corporate clients (also see Abbott 1988). Finally, high 
status firms are less likely to promote associates to partners, but similar 
to scope, this effect is not significant in model 2. 

Models 3—7 test the central proposition, that there is a negative rela- 
tionship between a firm's life chances and the promotion chances of its 
employees. Model 3 introduces the predicted probability of firm failure 
for the employer, as a predictor of promotion. Using each of the coefficients 
from the full firm failure model (table 1), I generate hazard rates of failure 
of each of the firms in the promotion sample, according to the firm and 
population characteristics each firm faces. Thus, for each person-year, 
there exists a hazard rate of failure for the focal attorney’s employer. The 
hazard rate is transformed to the predicted probability of firm failure 
using the following formulation, 


Predicted probability of firm failure = 1 — exp(u"), 


where u” is the hazard rate of firm failure. This transformation is used 
by scholars that have similar approaches (Hannan and Carroll 1992; Car- 
roll and Harrison 1994; and Lee and Harrison 1998). Interestingly, model 
3 reveals that there is no statistically significant linear relationship be- 
tween the likelihood of firm failure and associate promotion to partner. 
This is surprising given the findings in models 1 and 2 that strongly 
support my proposition. 

An examination of the distribution of the probability reveals an im- 
portant pattern—greater than 98% of the firms have a probability of 
failure less than 0.07, while less than 2% have probabilities of failure 
between 0.07 and 0.99 (see the appendix for the distribution of proba- 


” The differences in mean are not necessarily statistically significant. My intention is 
to merely draw attention to the qualitative pattern. 
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bilities). It is possible that the relationship between the probability of 
failure and promotion chances is different for firms with a very high 
probability of failure. Specifically, the “promotion paradox” may not apply 
to firms that are “near death.” 

There are two ways of verifying my contention that the hypothesis does 
not hold for firms that are near death. First, including a quadratic term 
for the probability of failure should clarify the results and provide a better 
model fit. Specifically, the linear term should be positive and the quadratic 
term negative. Second, one would expect that running the models without 
the 2% of the firms that are the most likely to fail should result in a strong 
linear effect for the predicted probability of failure. I use both as a means 
of verification. 

Models 4 and 5 examine the associate to partner promotion rate with 
the quadratic specification. Introducing the quadric term does clarify the 
relationship between firm life chances and employee promotion chances. 
In model 4, both the linear and squared terms have strong statistical 
significance (P < 0.01 and P < 0.05, respectively). Model 5 reveals that 
while attenuated, the linear term remains significant (P < 0.05) after the 
inclusion of the population and firm-level variables. The quadratic term 
is now marginally significant. Interestingly, model 4 also indicates that 
several of the population and firm-level variables have direct effects on 
the probability of promotion, independent of their contribution to the 
likelihood of failure. Strongest among these covariates are the growth 
rate, ownership size, and employee size. The dummy variables for an 
individual client focus, while attenuated, remain significant as well The 
effect for firm age largely disappears once a firm’s likelihood of failure is 
controlled for. Independent of the likelihood of failure, firms ages 10-30 
years are the most likely to promote. This result is interesting since many 
of the firms 10-30 years old were founded between the years 1966-86. 
This period, while profitable, was also highly uncertain, with law firms 
pioneering risky fee arrangements with high-tech start-up clientele (Wool- 
ley 1992; Chanen 2000). The hiring density remains unchanged from 
model 1. This is not surprising since the predicted probability of failure 
is calculated using the entire population density, and not the subset mod- 
eled in model 5. 

Models 6 and 7 examine the promotion to partner likelihood after ex- 
cluding the 2% of firms with the greatest likelihood of failure (the near 
death firms). The results are consistent with models 4 and 5. Except for 
near death firms, there is a statistically strong relationship between the 
predicted probability of firm failure and the likelihood of promotion (P 
< 0.01). Therefore, both strategies to validate the near death explanation 
support this modification to my original thesis. 

Overall, the final model supports the proposition that the greater a 
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firm’s risk of failure the more likely it is to promote. However, my thesis 
does not hold for firms with the highest probabilities of failure. These 
near death firms are less likely to promote. Therefore, the evidence for 
my central hypothesis requires an important caveat. That is, the greater 
the probability of failure, the greater the likelihood of promotion, except 
in the extreme cases (2%) for which the likelihood of failure suggests that 
the firm is near death. 


DISCUSSION 


Taken together, the results allow us to address the thesis that is central 
to this article. There is strong evidence of a negative relation between 
firm life chances and employee promotion chances. Law firms that are 
young, small, narrow in scope, or lower in status have a higher likelihood 
of failure but are also contexts where the likelihood of promotion is high- 
est. Moreover, there is a clear relationship between a firm’s likelihood of 
failure and the promotion chances of individuals that work within the 
firm. Whether through the decision of firms during the tenure of an as- 
sociate, or the decision of attorneys in their selection of an employer, firms 
with weak life chances have less bargaining power. This lack of bargaining 
power results in better promotion chances for employees. 

It is likely that near death firms lack the power to retain any of their 
employees. At some point, one’s employer is so unattractive that employee 
exit is the most likely outcome. In this case, the likelihood of observing 
a promotion declines quickly as it becomes clearer that promotion will 
have little value. Therefore, while my theory accurately predicts the re- 
lationship between the likelihood of failure and the likelihood of pro- 
motion for 98% of the cases, a caveat is necessary. The promotion paradox 
is relevant in every case except for those firms that are near death. 

My prediction with respect to firm age was supported. Silicon Valley 
law firms suffer from a liability of newness. Younger firms are the most 
likely to fail. This finding is consistent with age dependence in auditing 
firms, another professional service firm. Boone, Brocheler, and Carroll 
(1998) found that Dutch auditing firms from 1896 to 1970 suffered from 
a liability of newness. Their finding for age dependence is less clear after 
1970, however, when supply and demand regulations were instituted. The 
lack of strength due to its “newness” places a young employer at a dis- 
advantage, resulting in higher promotion rates. A young firm does not 
have access to all sources of labor, neither is it as readily equipped to 
replace lost employees as are more established firms. Furthermore, this 
firm is more likely to desire input from new hires concerning the culture 
and direction of the organization than would an older firm. The inclusion 
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of the likelihood of failure into the promotion model significantly dimin- 
ishes the effect of firm age, giving stronger reason to consider the “liability 
of newness” argument with respect to firm bargaining strength. 

Interestingly, the effect for firms between 10 and 30 years old remains 
statistically significant once the probability of failure is included. This 
suggests that there remain aspects of bargaining power not incorporated 
into the law firm’s probability of failure. Given that many of the firms 
in this category were founded during the birth and initial rise of Silicon 
Valley, there are two potential sources of unobserved heterogeneity. First, 
the initial rise in Silicon Valley was characterized by unique fee structures 
pioneered by Silicon Valley law firms. These law firms took equity in a 
young firm in lieu of standard fees. While becoming increasingly more 
common, the strategy is high risk (relying on the success of the new 
venture) and was viewed by many more established firms (e.g. New York 
and Chicago firms) as unethical. This increased risk may have led to firms 
within the 10-30-year age category to lose bargaining power without 
having that loss reflected in the firm’s likelihood of failure. Second, 10-30 
years of age is the time when the law firm's original founders reach 
retirement age (the average founding age is approximately 42 years old). 
It is possible that the uncertainty surrounding the departure of a firm's 
founders reduces the firm’s bargaining power. Since the founder’s age 
was not included in generating the probability of law firm failure, I may 
have overlooked this source of unobserved heterogeneity. Future research 
should examine those populations in which older firms are more likely to 
fail, to determine whether the promotion paradox is robust with respect 
to firm age. É 

The findings for organizational size provide fruitful insight. Employee 
size, the more traditional interpretation of organizational size, supports 
the promotion paradox framework. Large firms are more likely to survive, 
and because of their stronger position, they are less likely to promote 
associates to partners. Independent of its contribution to the firm’s like- 
lihood of failure, there is a direct effect for the number of associates. This 
direct effect is in part due to the increased competition between associates 
that occurs for coveted partnership positions (ceteris paribus). Indepen- 
dent of the effect of size on the firm’s likelihood of failure, the greater 
the number of associates, the greater the competition that each associate 
faces, thus lowering the likelihood of promotion. 

Firms with many owners are less likely to survive and are more likely 
to be the contexts for promotions. This result, and its resilience once the 
firm’s probability of failure is included, is consistent with other findings 
in vacancy chain and graded ratio research (Stewman and Konda 1983). 
One difference between my conceptualization and the graded ratio re- 
search, however, is in the underlying mechanism. Stewman and Konda 
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(1983) would model the ratio of partners to associates as the firm structure 
that modifies the creation of vacancies through growth and attrition (see 
p. 649). I examine the number of partners, associates, attrition, and growth 
independently, emphasizing the number of partners and associates as in- 
dependent indicators of firm strength and the propensity to promote rather 
than factors-that modify the creation of vacancies. Despite this difference, 
the observed outcome is similar: the more partners a firm has (controlling 
for the number of associates), the greater its propensity to promote. 

One plausible interpretation of this finding involves the cost of pro- 
motion to the original set of partners. Partnership size may reflect the 
cost to each partner when a new partner is promoted. For example, if a 
firm with only two partners promotes an associate to partner, the division 
of profits decreases by 1/6 or 0.167 (1/2 minus 1/3) for the two original 
partners. However, if a law firm with 20 partners promotes an associate, 
the division of profits decreases by only 0.002 (1/20 minus 1/2 1). Therefore, 
the greater the number of partners, the lower the cost to the average 
partner for promoting an associate to share in the profits as a partner. 

In support of past research, generalists and high-status employers are 
both more likely to survive and less likely to promote (although neither 
is statistically significant in the final model). In the context of law firms, 
a service industry heavily dependent upon human capital, generalism may 
have a different meaning than in past research contexts, such as news- 
papers (Carroll 1985). There are two significant differences. The first (a 
conceptual difference) is that areas of practice within the legal profession 
have a strong component of supply-side social construction. Rich social 
processes are involved in determining the value and boundaries of various 
specialties of law (Heinz and Laumann 1982; Abbott 1988). Second (an 
empirical difference), from 1946 to 1996, the number of distinguishable 
areas of law in Silicon Valley increased from eight to approximately 79. 
Therefore, what it means to be a generalist necessarily changes. While I 
attempted to capture this increased differentiation by operationalizing 
generalism relative to the total number of distinguishable areas in a par- 
ticular year, understanding the proliferation of practice areas requires 
further exploration. In short, future research should examine the role of 
generalism in more detail. 

The findings on firm status support the notion that high-status firms 
have meaningful competitive advantages (Podolny 1993; Park 1997). 
High-status law firms have a lower likelihood of failure. Whereas most 
in the legal press assume that size and status are interchangeable, 1 con- 
ceptualize and model each factor separately. Indeed, separating these var- 
iables yields insight in the study of law firms that have been previously 
overlooked. 

Contrary to expectations, the results indicate that while not Significant, 
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the greater the population density, the lower the likelihood of promotion. 
I expect that the result in model 1 reflects the fact that population density 
does not capture labor market competition as well as product market 
competition. For the promotion to partner model, the hiring density more 
easily captures the competition in the labor market. The finding for the 
hiring density suggests that when only a few firms hire associates, these 
firms possess oligopolistic (or oligopsonistic) power (Blair and Harrison 
1993). In other words, the set of employers may collude with respect to 
their hiring and promotion practices in order to avoid interfirm compe- 
tition for labor. Once there are greater than eight employers (solving for 
the inflection point of the linear and quadratic term), collusion is no longer 
possible, and the competition for associates more closely resembles the 
neoclassical case of perfect competition. Thus, the majority of associates 
in my sample have increased promotion chances due to competition be- 
tween firms for labor. 

The negative relation between law firm strength and associate pro- 
motion chances remains when controlling for demographics (age, race, 
and sex), human capital (elite law school, honor society, etc.), being one 
of many law offices, firm growth, firm attrition, and whether the firm 
seeks individual clients.'* The controls for growth and attrition provide 
an interesting method of testing for direct vacancy chain effects. Firm 
growth increases the likelihood of promotion, as expected. This finding 
verifies past research on the relationship between organizational growth 
and promotion chances, while serving as a reminder of the irony of the 
promotion paradox. Alongside the advantages due to growth, there is a 
negative relation between firm strength and the likelihood of promotion. 
Not only may law firm growth increase job opportunities, but it may be 
one of the factors that increases the bargaining strength of the firm as 
well. Consistent with past work, the results show that firm growth is a 
positive and significant predictor of promotion. 

These findings are not inconsistent with vacancy chain effects. On the 
contrary, they compliment vacancy chain processes. Firm bargaining 
strength is the result of the relationship between the firm and its envi- 
ronment. This relationship is one mechanism that determines when va- 
cancies are opened and closed. Vacancies are created when the employer 


“ It is important to note that the coefficient for individual clients remains significant 
across all models in table 2 Subsequent analyses determined that this effect was largely 
due to firms practicing criminal law. Criminal law firms, beyond having a higher 
degree of failure, have other characteristics that lower their power. 

the characteristics: (1) associates have direct client contact (Smigel 1969) (2) criminal 
law firms face isolation from referral networks and thus sources for recruiting and 
collusion (Heinz and Laumann 1982) 
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lacks the bargaining strength to deny rewards in the form of promotions 
to its employees. 

Given that the actual decision to partner an associate is largely unob- 
servable from such a macroperspective, constructing precise measure- 
ments of the relationship between firm strength and promotion chances 
poses a serious challenge. Despite this difficulty, the evidence amassed 
thus far for promotion patterns in 12 cohorts across 50 years of law firms 
is compelling. Not only is it clear that individual determinants of law firm 
failure led to the increased promotion chances of associates, but predictors 
of firm life chances also predict promotion opportunities. 


Scope Conditions and Limitations 


There are two main limitations to this research, both related to gener- 
alizeability. The uniqueness of law firms in Silicon Valley suggests that 
the promotion paradox be evaluated within a set of scope conditions that 
guides future research by organizational sociologists. While examining the 
Silicon Valley legal profession has clear advantages, it is not clear how 
far one can extrapolate the findings to organizations that have different 
organizational structures and employment systems. Furthermore, it may 
be unreasonable to expect that socioeconomic characteristics of the Silicon 
Valley labor market exist in other markets. 


Law Firm Employment 


First, the legal labor market is a unique market. Most law firms, as 
partnerships, have atypical organizational forms. In partnerships, there 
is a clear distinction between the firm and the employees. The firm is the 
set of partners, and the employees are the associates. Promotions to partner 
are largely made from within, rather than laterally. Associates that become 
partners are then members of the firm, responsible for making promotion 
decisions in the next round. 

In contrast, the traditional work arrangement is characterized by a more 
continuous relationship between the firm and its employees. Employees 
are thought to move up the rungs of a job ladder, where increases in 
status and pay associated with each additional rung rarely constitute an 
increase on the order of making partner in a law firm. In fact, some 
organizations employ symbolic promotions, which are characterized by a 
change in job title, but little else. Moreover, the entity making the pro- 
motion decisions is often another superior that may not use the firm’s 
strength in their decision. Thus, applying the promotion paradox to other 
organizational forms may be premature. The evidence demonstrates the 
promotion paradox’s relevance to up-or-out partnerships. Future research 
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must determine whether the promotion paradox exists in other organi- 
zational and employment settings. 

At the same time, these findings do directly apply to the career patterns 
of professionals within many organizations: accounting firms, consulting 
firms, and venture capital firms. An increased understanding of the pro- 
fessional service industry is a substantial advancement to organizational 
and career theories. Indeed, this research is directly relevant to attainment 
within knowledge-based organizations where promotions are coupled 
with rewards and there is a clear distinction between the firm and its 
employees. To the degree that the United States becomes an information- 
intensive, service-based economy, the labor context exploited increases in 
its applicability. 


Silicon Valley as a Unique Context 


Another unique aspect to my study is its location in Silicon Valley. For 
both employers and employees, Silicon Valley is a high growth region. 
The dynamics of competition and market entry, and thus firm strength, 
may be different in more mature industries. Silicon Valley is also a region 
characterized by an incredibly low level of unemployment and high com- 
pensation. During 1997, the Santa Clara County unemployment rate was 
3.1%, the lowest in the United States (4.9%). Furthermore, 1996 income 
data reveal that the average annual earning in Silicon Valley was $43,510 
compared to $28,040 nationally (Hoffman 1997). Employees, especially 
professionals, fear loss of employment or income due to transfer less often 
than do employees in other regions of California or the United States. 
The great number of exits in my study (recall that only 87 of 463 new 
hires were promoted) draws attention to the uniqueness of the law firm 
as an employer. Caution must be taken in generalizing results to contexts 
that do not reflect these patterns. This may influence the shape of the 
bargaining power dynamics in ways that do not exist in most employment 
relations. 

An interesting aspect of studying Silicon Valley is the opportunity to 
investigate the employment relationship within one of the most econom- 
ically influential regions in the entire world. To my knowledge, this is the 
first in-depth, rigorous analysis of firm competition and career advance- 
ment within Silicon Valley to address these issues across a 50-year period. 
Thus, while Silicon Valley is unique, it is also critically important for both 
scholars and practitioners to glean insight from studies of a region that 
has set the technological pulse and direction of the United States and 
much of the world. 
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CONCLUSION 


This study is an important contribution to organization theory, as well 
as research on stratification and career mobility. First, it conceptually and 
empirically bridges the void between research on organizations as em- 
bedded in environments and the career advancement opportunities of the 
organization’s employees. Second, the richness of the data allows one to 
test alternative hypotheses while exploring new ideas in a way that has 
not been possible in past research. For example, while some may argue 
that growth or attrition are the covariates of primary relevance, I find 
that the promotion paradox exists whether or not these better-known 
engines of mobility are controlled for. This is important, for it suggests 
that the influence of the firm’s environment is largely independent of the 
usual indicators of promotion opportunities. Third, this research advances 
the understanding of professional career mobility (attorneys) in service, 
human capital intensive, industries (legal services). 

One of the most important contributions lies at the theoretical under- 
pinnings of this thesis: that there is a significant relationship between an 
organization’s life chances and its bargaining strength. Without taking 
this relationship into account, along with its implications for organizations 
and their employees, we are led to an incomplete interpretation of strat- 
ification and career advancement. This research opens more possibilities 
for the applications of organizational theory to understanding an indi- 
vidual’s structure of opportunity (Baron and Bielby 1980). While this 
article focuses on the contributions of organizational ecology and popu- 
lation dynamics, there is a larger theme of understanding how the fate 
and life chances of firms influences the rewards and opportunities of 
individuals that has only begun to be explored. 

The findings of this study have particular implications for stratification. 
Traditionally, the syntheses between organizations and stratification has 
given us the insight that organizational and environmental attributes drive 
variation in individual attainment. Much of this work has examined dif- 
ferences in opportunity as a function of the firm’s economic sector/industry 
and whether the firm has formalized employment structures (Baron 1984). 
Extending this research, this article suggests that an associate’s oppor- 
tunity structure is also a function the law firm’s competitive strength. 
The stratification of Silicon Valley law firm associates is influenced by 
the distribution of firm competitive strength within the population. More 
mobile (or less stable) careers are positively associated with populations 
dominated by law firms with strong life chances. To the extent that a 
population of employers has weaker life chances, opportunity structures 
become more stable while the rewards of partnership become more likely 
to be attained. 
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The results of this study offer new insights to the study of law firms 
in particular. My analyses would argue that modeling law firm promotion 
systems as up-and-out firms (O’Flaherty and Siow 1995), professional 
labor markets (Wholey 1985), or arenas for tournament-based promotions 
(Galanter and Palay 1991) are credible but incomplete representations. A 
more complete understanding of organizational influences on attainment 
and mobility requires us to better understand the role of the organization’s 
environment in influencing bargaining strength. 
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APPENDIX 
Minimums, Maximums, Means, and Standard Deviations 
TABLE Al 


DESCRIPTIVE STATISTICS FOR 4,108 ORGANIZATION- YEARS (ALL 512 
SILICON VALLEY FIRMS), 1946—96 











Variable Min Max Mean SD 
Firm failure .. 0 1 08 Bice 
Firm age ........ 1 51 9.84 959 
In(density) ........... 1.79 5.34 4.78 69 
In(partners).. ..... 00 4.47 1.29 75 
In(assoclates) . 00 533 90 96 
Scope ..... 01 62 15 10 
Individual clients 0 1 81 
Branch ........ ..... 0 1 24 
- Growth .. 00 433 .11 30 
Attrition .... 00 92 .06 17 
Firm status ...... ... .00 1.00 20 .22 
Note — N = 512 
TABLE A2 


DESCRIPTIVE STATISTICS FOR 1,775 PERSON-YEARS (443 SILICON VALLEY 
NEw Hines), 1946-96 











Variable Min Max Mean SD 
Tenure (yrs) ....... 1 11 2.97 1.82 
Promotion ... . 0 1 OS 
Attorney’s age . 25 55 31.57 4.63 
Female ..... ...... 0 1 35 
Elite law school . 0 1 .18 
Magna/summa .. 0 1 -09 
Honors society ....... 0 1 23 
Clerk/extern ..... ...... 0 1 .04 Cars 
Firm age ... ...... 1 49 15.75 953 
Indenstty) . . ......... 2.08 5.34 4.99 A5 
In(hiring density) .. .... 00 4.09 3.71 64 
In(partners) . .00 4.47 2.34 .94 
In(associates) ... . ..... 00 533 247 1.20 
Scope a...n. sorces o 01 62 23 42 
Individual cHents .... 0 1 84 
Branch office . ...... .. 0 1 57 
Growth (9)  ...... z 00 3.00 20 27 
Attrition (%) ... ..... 00 56 07 14 
Firm status . i 00 1.00 23 19 
Probability of failure 00 99 01 ' 08 
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OF FAILURE 
Percentile P 
59: A <0001 
25% ..... ... <0001 
50% (median) . 0003 
TSF: EEEE EA 0021 
95% .. .0225 
98% .........0. .0650 
99% . . eee 1300 
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Neoclassical Sociology: From the End of 


Communism to the End of Classes’ 
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American sociology marked the triumphalism of the immediate postwar 
period with its emblematic “end of ideology” thesis. Class struggle for an 
alternative socialist order was ruled an anachronism because capitalism 
and liberal democracy could effectively deliver expanded freedoms and 
improved living standards. America was as good as it gets while “com- 
munism” was the despised, totalitarian “other.” Protagonists of the “the 
end of ideology”—+the most famous being Seymour Martin Lipset, Daniel 
Bell, and Philip Selznick—had themselves started out as unrepentant 
socialists in the 1930s. Their drift toward complacency, culminating in 
1950s “functionalism,” was itself overtaken by the successor radicalism 
of the 1960s, a radicalism that pointed to the seamy side of U.S. capitalism 
and the limits of its “democracy.” This revolt against the end of ideology 
and its concomitant “anticommunism” inspired such commentators as 
Iván Szelényi and David Stark in the 1970s and 1980s to develop alter- 
native class critiques of actually existing “communism.” They reconcep- 
tualized communism in more positive terms as “state socialism,” pointing 
to its potentialities as well as its limits. Szelényi came to his (new) class 
analysis of state socialism by joining critical sociology drawn from the 
West to the critical theory of the Budapest school, while Stark’s interest 
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in Marxian-inspired labor studies gave rise to research into the possibilities 
and limits of worker democracy, first in Yugoslavia and then in Hungary. 

In their most recent books, however, Szelényi and Stark turn away 
from their past to merge with other contemporary theoretical cur- 
rents—currents that espouse a second round of the end of ideology, em- 
bracing capitalism in its multiple guises, losing touch with sociology’s 
critical powers. While the first round of the end of ideology still recognized 
a working class—even if it focused on its supposed pathologies—the sec- 
ond round, nearly half a century later, seems to have abandoned class 
altogether. The basis of this new sociology is no longer, as it was in the 
1950s, the threat of totalitarian others—either from without (communism) 
or from within (working class)—but the commemoration of their disap- 
pearance. Nowhere has this new sociology found more fertile soil than in 
the former socialist countries of Central Europe. 


CENTRAL EUROPE: A LABORATORY FOR THE STUDY OF 
COMPARATIVE CAPITALISMS 


In Making Capitalism without Capitalists Gil Eyal, Iván Szelényi, and 
Eleanor Townsley call the new paradigm neoclassical sociology. If the 
classical sociology of Karl Marx, Max Weber, and Emile Durkheim 
achieved canonical status by engagement with the problems and possi- 
bilities of the 19th-century transition to modern capitalism, then neo- 
classical sociology is the analogous engagement with the capitalist tran- 
sitions at the end of the 20th century. Whereas classical sociology was 
haunted by the prospect of a socialist alternative to capitalism, neoclassical 
sociology marks the demise of that alternative. In place of the classical 
concern with the origins, dynamics, and reproduction of a singular cap- 
italism, neoclassical sociology is concerned with capitalism’s diversity, “a 
mosaic of the most diverse socio-economic structures and institutions” 
(Eyal et al, p. 16). It is no longer a matter of studying the origins of a 
single order but the multiple origins of plural orders. Whereas classical 
sociology dwelt on the uniqueness of capitalism through comparison with 
noncapitalist societies—the feudal past or a socialist future—neoclassical 
sociology compares capitalism with capitalism. The contemporary world, 
in particular the burial of socialism, calls for a research program of com- 
parative capitalisms. 

In a similar vein David Stark and László Bruszt, in their Postsocialist 
Pathways, argue that the eclipse of communism redirects the sociological 
focus and sociological imagination to the different paths to multiple but 
always capitalist futures. “When we stop defining capitalism in terms of 
socialism, we see that, in our epoch, capitalism as a construct is analyt- 
ically interesting only in the plural: Capitalisms must be compared vis- 
a-vis each other” (p. 3; emphasis in the original). Like Eyal et al. they 
regard East and Central Europe as natural laboratories for the new so- 
ciology. Both books try to grapple with the continuities and discontinuities 
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of socialism and capitalism. Both try to come to terms with the eerie 
echoes of the past alongside entirely novel economic, political, and social 
forms. Rather than revolutions or evolutions, they think in terms of 
mutations, reconfigurations, or, to use Stark’s prescient concept, 
“recombinations.” 

Their similarities notwithstanding, the books focus on different analytic 
levels of the transition. Stark and Bruszt focus on institutions, the collapse 
of the old and the creation of the new, connected by strategies of extri- 
cation. They call their approach path dependent in that institutional so- 
lutions—forms of democracy and privatization—forged within the mo- 
ments of extrication from socialism, have locked Hungary, the Czech 
Republic, East Germany, and Poland into divergent paths. On the one 
side, Stark and Bruszt distinguish their perspective from the neoliberal 
“transition as imitation” of textbook capitalism, what they call “capitalism 
by design” or one might call future dependence. On the other side, they 
hotly dispute the past dependence of involutionary approaches that treat 
postsocialism as the degenerative product of communist legacies.’ 

Eyal et al. adopt a similar “path” perspective toward the transition, 
but in contrast to Stark and Bruszt their paths go back deep into com- 
munist and even precommunist days of Hungary, Poland, and the Czech 
Republic. Their paths, however, are not of institutions but of individuals. 
Indeed, they have little to say about institutional transformation—the 
logic of collapse, extrication, and creation. Instead they focus on the tra- 
jectory of various elites—individuals endowed with different forms of 
capital (economic, social, political, and cultural resources) who strategi- 
cally adjust to the exigencies of traversing different orders. They call their 
approach trajectory adjustment: institutions shape the dispositions of in- 
dividuals, who in turn contest and transform those institutions, leading 
to the readjustment of individual trajectories. Both books share a common 
outlook on the transition: they both optimistically celebrate the indeter- 
minacy, novelty, and variety of the transformations—whether of insti- 
tutional patterns or elite alliances—taking place in post-Soviet societies. 

In this essay I would like to suggest that such neoclassical optimism is 
misplaced and unsubstantiated—rooted more in faith than in reality. The 
optimism of these authors is based, I argue, on an overestimation of the 
importance of elites, patterns of privatization, and political democracy, 
and on an underestimation of the importance of capital accumulation, 
class relations,’ and global forces. In other words, in their “comparative 


2 Tn a sense their approach is also “past dependent” but the past only goes as far back 
as the break with socialism. This is not to say that legacies of communism do not 
enter their analysis, but their key claim is that the consequence of those legacies is 
shaped by the particular mode of extrication. For the clearest elaboration of their 
complex causal logic see Stark and Bruszt (p. 164). 

* By class relations I mean the relations of domination and exploitation between classes, 
which is to say classes have to be studied in relation to one another and class structures 
_ have to be studied as systemic wholes (see Wright 1997). 
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capitalisms,” capitalism drops out of the picture. In pursuing openness 
and plurality, neoclassical sociology forsakes classical sociology’s elabo- 
ration of the limits of capitalism—limited possibilities for democratic in- 
stitutions and individual self-fulfillment. Thus, Eyal et al. extend Weber’s 
vision of different capitalisms as the project of different bourgeoisies and 
expand Weber’s distinction between class and status (rank) to compare 
precommunist, communist, and postcommunist societies. In the process, 
however, they lose Weber’s critical insights: his iron cage view of capi- 
talism, his tying of capitalist efficiency to bureaucratic domination, and 
more generally his own disenchantment with Western rationalism. 

Stark and Bruszt move away from Weber’s concern with the fate of 
the individual to echo Durkheim’s concern with crafting institutions.’ Like 
Durkheim they are very critical of both market and state pana- 
ceas—markets require both underlying trust and supportive institutions 
just as states can only supply coordination at a distance. Like Durkheim 
they turn to intermediary associations as the fulcrum of an effective cap- 
italism—only for Stark and Bruszt these associations are ownership net- 
works among firms whereas for Durkheim they are occupational asso- 
ciations. Just as Durkheim extolled the enabling power of constraints, so 
Stark and Bruszt laud democratic accountability and policy coherence. 
Yet, entirely absent is Durkheim’s trenchant critique of power inequalities 
and of social injustice that for him, at least, marked the gap between the 
pathological present and the ideal future. Indeed, Stark and Bruszt's 
repudiation of the very concept of transition, because of its implied ev- 
olution toward a normative future, allows them to endow the present with 
multiple potentialities. The past cannot be reborn, the future is uncertain, 
and so the present is triumphant. 

The use of “neoclassical” to describe the new research program suggests 
parallels with neoclassical economics, and indeed parallels there are. Both 
emphasize strategic action in the deployment of capital, only neoclassical 
sociology extends the meaning of capital to include social and cultural as 
well as economic resources.* Both downplay oppression and social justice. 
Both have optimistic assessments of the potentialities of capitalism as the 
end of history. In one fundamental respect, however, neoclassical sociology 


* While Eyal et al. are very clear about situating themselves within a Weberian research 
program, Stark and Bruszt do not acknowledge any connection to Durkheim. The 
parallels, however, are unmistakable, especially as Stark and Bruszt draw so widely 
on Durkheimian-influenced economic sociology, such as Mark Granovetter’s focus on 
the embeddedness of markets, Charles Sabel’s idea of developmental associations, 
Wolfgang Streeck’s notion of associative orders, and more generally on the neoinsti- 
tutionalism of John Meyer, Walter Powell, Paul DiMaggio, and others who highlight 
the normative constraints on the form and operation of organizations. 

* One is struck by the parallels with the 1950s renaissance of sociology in the form of 
structural functionalism which derived from Talcott Parsons’s theory of social action, 
itself an explicit attempt to make neoclassical economics a special case of a broader 
sociology. In claiming to synthesize neoclassical economics and path dependency, Eyal 
et al. are also proposing a quite general theory of social action (see, e.g., pp 36-45) 
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diverges from neoclassical economics, namely in replacing the latter’s 
singular normative model of capitalism with multiple capitalisms derived 
from different origins. But, as I shall argue, this last claim remains un- 
proven, resting as it does on the diversity of superstructwral manifestations 
of capitalism (social origins of elites, property forms, and political de- 
mocracy) rather than on an underlying diversity of economic forms of 
production and corresponding class relations. 

Tronically, in their earlier pioneering analyses of state socialism both 
Szelényi and Stark were anything but “neoclassical” in their orientation. 
To the contrary, they were heavily influenced by Marx—an influence that 
has left traces in their accounts of postsocialist capitalism even though 
economic processes, class relations, and alternatives to capitalism have 
disappeared. Waving farewell to the now-calumniated socialism, which 
magically and unexpectedly evaporated, they lose their critical sense to- 
ward emergent, peripheral capitalisms. Even in their common appropri- 
ation of Pierre Bourdieu, they deploy his theory as an all-purpose tool kit 
for the analysis of resources (capital), dispositions (habitus), and strategic 
action (in social space and fields), but in the process sacrifice the political 
dimension of Bourdieu’s writings—the analysis of the reproduction and 
mystification of class relations. In a paradoxical twist of history, it is as 
if it is not the beginning but the end of socialism that spells the end of 
classes! 

In what follows I discuss the move away from the study of subaltern 
groups and class relations—so present in their early works—to the focus 
on ruling elites, ownership patterns, and forms of democracy. I then in- 
terrogate the conceptual basis for the research program into comparative 
capitalisms, namely their origin driven analyses—whether this be Stark 
and Brusrt’s path dependency, which starts with the breakdown of so- 
cialism and highlights discontinuity between socialism and its aftermaths, 
or Eyal et al.’s trajectory adjustment, which offers greater historical sweep 
and continuity. I put both frameworks to the challenging Russian test and 
show how their conceptualizations minimize the constraints on elites by 
nonelites, politics by economics, the national by the global. I propose an 
alternative approach, namely to suspend the study of origins for the study 
of class relations in the present. As against neoclassical optimism I propose 
a postsocialist theory, analogous to postcolonial theory, that restores a 
critical focus based on the limits to change at the same time it recuperates 
subaltern alternatives. So, finally, I propose not only a recovery of class 
but also the excavation of embedded socialism. 


* Thus, Eyal et al. “claim to have reconstructed Bourdieu’s theory, shifting it from a 


static explanation of social reproduction into a dynamic, comparative-historical theory 
which is able to explain social change” (p. 187). 
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THE ROAD TO NEOCLASSICAL SOCIOLOGY 


Iván Szelényi is the leading sociologist of the former Soviet world. He is 
best known for the classic book, The Intellectuals on the Road to Class 
Power (1979), which he wrote with Gydrgy Konrád in the middle 1970s 
when both were under surveillance of the Hungarian state. State socialism, 
they argued, had a special place for intellectuals as architects and legit- 
imizers of the “rational redistribution” of resources in a planned economy. 
Intellectuals formulated and justified the plan as serving the needs of all. 
To be sure, intellectuals had not yet arrived at their appointed destination, 
but they were a class im statu nascendi, a class opposed to the workers 
in whose name they would rule and from whose labor they would organize 
the extraction of surplus. From this two-class model of socialism much 
else followed, not least its possible democratization through intellectuals’ 
responsiveness to the working class. 

Szelényi (1983) concretized this theoretical framework in his study of 
Hungarian urban inequalities, specifically the distribution of housing, 
which showed that supervisors, technicians, white-collar workers, and 
managers, in short the new class, had privileged access to subsidized 
resources (apartments). This meant, he further argued, that under state 
socialism the working class had an interest in the expansion of markets 
so that they could bypass the biases of “rational redistribution” and engage 
in entrepreneurial activity, such as obtaining materials to build their own 
homes. Markets, he argued, countered the inequalities of the administra- 
tive distribution of resources. He, thereby, laid the foundation of the tran- 
sition debate two decades later: whether the market transition would 
exacerbate or undermine previous inequalities of the socialist state.’ 

In the 1980s it seemed as though the ascendancy of intellectuals had 
been halted. Working with his former Hungarian students, Szelényi dis- 
covered a second new class making its way upward—peasant entrepre- 
neurs who were able to take advantage of the opening up of the market 
in the still-socialist Hungary. In Socialist Entrepreneurs Szelényi now 
argued that the old bourgeoisie, whose life had been cut short by the 
advent of communism and who had parked their resources in the edu- 
cational system or in the party, were now exploiting the new opportunities 
of Hungary’s mixed economy of the 1980s. Early communism, therefore, 
marked a period of interrupted embourgeoisement that was now reas- 
serting itself in late communism. However, once more history overtook 
Szelényi’s theory. State socialism collapsed. Although there were elements 
in the Hungarian Democratic Forum, the first postsocialist government, 
that were attracted to the entrepreneurial Third Road, reality did not 
treat indigenous capitalists too well. Certainly, they did not become a new 
ruling class. 

In his latest collaboration with Gil Eyal and Eleanor Townsley, Szelényi 


' See, e.g , the transition debate in the American Jowrnal of Sociology, vol. 101, no. 4 
Ganuary 1996). 
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returns to The Intellectuals on the Road to Class Power. Drawing on 
surveys that oversampled elites as well as their own observations and 
interviews, the authors argue that postsocialist Hungary, as well as Poland 
and the Czech Republic, have spawned a new ruling elite, dominated by 
the intelligentsia. In effect, Szelényi and his collaborators are returning 
to the thesis of intellectuals on the road to class power, only now intel- 
lectuals realize their mission after rather than during the period of state 
socialism. This is what they call the second Bildungsburgertum, the second 
arrival of a cultural bourgeoisie. The first Bildungsbiirgertum arose in the 
19th century when Central European intelligentsia pioneered a modern- 
ization project in partnership with professional civil servants. At the end 
of the 20th century the intelligentsia again envisioned a capitalist utopia, 
but this time in association with the socialist managerial elite—a coalition 
of propertyless agents who thus “make capitalism without capitalists.” 
Just as the first Bildungsbiirgertum was a reaction against the rank order 
of feudalism, so the second Bildungsbtirgertum is a reaction against the 
rank order of communism. Just as the first ascendancy of the intelligentsia 
proved transitory, Eyal et al. argue, so will the second. Indeed, at the end 
of their book they present new data, suggesting that a new economic 
bourgeoisie may be in the process of formation. ) 
Even though he seems to be returning to his earlier ideas, Szelényi’s 
vision now lacks the critical perspective that so marked The Intellectuals 
on the Road to Class Power. Konrád and Szelényi approached state so- 
cialism from the standpoint of its exploited working classes, just as So- 
cialist Entrepreneurs (Szelényi 1988) began with a study of peasant work- 
ers. But the latest analysis has shifted away from class relations to the 
“pacting” of elites." We can detect a similar shift in the work of David 
Stark, whose pioneering Hungarian social research made important con- 
tributions to the study of workplace politics. Like Szelényi, Stark began 
his career with a focus on the working class, showing how shop floor 
participation both contributed to and challenged the reproduction of class 
domination. His 1980s research into Hungarian factory regimes exposed 
the forging of all manner of informal arrangements as workers adapted 
to the exigencies of an administered economy. In his most celebrated article 
of that period, Stark (1986) showed how the development of internal 
subcontracting systems acted as a pseudo-market within the socialist en- 
terprise, compensating for the dysfunctions of the plan, just as in modern 
capitalism the internal administration of labor countered the dysfunctions 
of the external market. Hierarchies require markets, just as markets re- 
quire hierarchies. Capitalism and state socialism were each other’s mirror 


* Elite studies may note the separation between elite and mass, but they do not see 
the source of that separation in a specific relation of (class) subordination. Thus, Eyal 
et al do note (p. 35) that the transition to postcommunism has led to a polarized social 
structure, but they still insist on an elite analysis because capitalism is being made 
from above (pp. 159-60). A class perspective argues that “made” from above means 
“shouldered” from below. 
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image. State socialism, therefore, could not be reduced to a totalitarian 
model of a command economy but, at least in its mature form, required 
all sorts of subordinate organizations operating along lines that contradict 
communism’s dominant logic. This offered Stark (19894, 1989b) the con- 
ceptual tools for a richer, variegated portrait of state socialism. 

Joining forces with the Hungarian political scientist László Bruszt, 
Stark now transposes his earlier interest in the diversity of socialisms into 
a plurality of emergent capitalisms. Curiously enough, however, Stark 
and Bruszt connect postsocialist diversity not to diverse socialist legacies 
but to the autonomy of the political, to choices made in the democratic 
transition.’ In the first part of Postsocialist Pathways they chart the dif- 
ferent paths of “extrication” shaped by the strategic interaction of (a) 
hardliners and reformers and (b) ruling elites and the opposition. Thus, 
Hungary developed the most perfect democracy in the region because the 
opposition was weak while the ruling class was divided, whereas in Poland 
the opposition, organized around Solidarity, was much stronger, thus com- 
promising liberal democracy. Bulgaria, Romania, and Albania were at 
the other extreme. There the opposition was so weak that communist 
parties had merely to rename themselves to regain power. In the second 
part of their book Stark and Bruszt link the different privatization strat- 
egies of Germany, the Czech Republic, Poland, and Hungary to their 
different paths of extrication—reunification in Germany, capitulation in 
the Czech Republic, compromise in Poland, electoral competition in Hun- 
gary. In the final part they drive home the thesis that, far from being 
incompatible with capitalism, democratic deliberation and dense associ- 
ation are its necessary conditions. Extended accountability, as they call 
it, promoted a vibrant capitalism in the Czech Republic, whereas more 
limited accountability produced weaker capitalisms in Hungary and 
Germany.” 

The underlying parallels between these two books are striking. Both 
stress historical pathways or trajectories from past to present, so that 
where we end up bears some relation to where we start out. The plurality 
of origins gives rise to a plurality of capitalisms. To be sure, their histories 
start at different points, and so they come to different conclusions about 
the present, but both books take the commonsensical view that origins 
matter. This may be common sense, but it is not necessarily sociological 
sense. Classical sociology may have been interested in origins, but its 
overwhelming theoretical thrust was to argue that modern capitalism was 
so all-encompassing that it erased its origins. To understand modern cap- 
italism, therefore, was to understand it as a system, made up of badly or 
not so badly integrated parts. The crucial questions concerned the repro- 


’ Here the influence of theorists of the transition in Latin America—Philippe Schmitter, 
Terry Karl, Adam Prreworski, and others—is palpable. They too emphasize the open- 
ness and opportunities presented by the democratic transition. 

1% One might note that Czech success has become rather tarnished in the last two or 
three years, with multiple political scandals and desultory economic performance 
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duction of a system of relations, especially the economic relations that 
defined capitalism. In concentrating on origins, neoclassical sociology fails 
to do what its classical forbears did so well, namely to study the new 
order as a system of interdependent parts—relations of economy to polity, 
of elites to nonelites. 

For classical sociology—and here I am referring to Weber and Durk- 
heim as well as to Marx—it was a mystery how capitalism survived its 
internal tensions and contradictions, how it absorbed tendencies toward 
dissolution and disruptive transformation. But for neoclassical sociology 
questions of capitalism’s continuity are replaced by the study of origins. 
It is as if, with the eclipse of the socialist alternative, there is no longer 
a counterpoint, an external position from which to understand capitalism’s 
defining systemic and contradictory character. Thus, Eyal et al. study the 
formation of new elites out of old elites, while Stark and Bruszt, although 
closer to a systemic analysis, nonetheless study the origins of ownership 
patterns and corporate networks. Both books study the economy at a 
distance, as a taken-for-granted abstraction and not as a concrete set 
of productive relations. Whereas in their earlier writings they focused on 
the ingenuity of the subaltern classes in coping with socialism, the way 
workers and peasants challenged and transformed state socialism in the 
microprocesses of everyday life, Szelényi and Stark now turn to the elites 
engineering embryonic capitalisms. Their analyses exclude subordinate 
classes, which in effect become the bewildered—silent and si- 
lenced—spectators of transformations that engulf them.” 


THE Ss REPRISE: POSTSOCIALIST PATHWAYS 
Postsocialist Pathways is a collection of essays that were originally written 


over a period of seven years, starting in 1990, with themes spanning the 
phases of the transition itself: political extrication, privatization, and new 
forms of network capitalism. Stark and Bruszt try to bring the essays 
under a single rubric: the relation between property transformation and 
the consolidation of political democracy (pp. 1, 129). They ask whether 
the twin transformations are mutually contradictory. 1f property trans- 
formation and economic restructuring calls for popular sacrifice, will not 
this be blocked as democracy extends veto power to the suffering classes? 
Stark and Bruszt argue that a peculiar form of capitalism, residing neither 
in markets nor states but in networks of deliberation, has emerged to 
foster an economic growth that will benefit all. In short, in the postsocialist 


"Tt should be noted that Szelényi has retained his interest in the popular classes in 
his as-yet-unpublished study of the transformations of a Hungarian gypsy village over 
the last 150 years. He is also at work on a multinational study of poverty and ethnictty. 
In this review, however, I am confining my attention to Making Capitalism without 
Capitalists. 
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world, the transition to capitalism and democracy can be mutually 
reinforcing. 

But where does this comparative advantage of postsocialism come from 
if not from socialism itself? Indeed, that is exactly what Stark and Bruszt 
suggest: postsocialist states have neither strong markets nor coherent 
states but “they have decades of experiences with strong networks” (p. 
122). These economic networks, they conjecture, provide the foundation 
for the new form of capitalism that is neither market driven nor state 
driven. To be successful—and now the measure of success has shifted to 
policy coherence—these networks must be both deliberative and associ- 
ational: that is, they must bind enterprises together in a process of joint 
and extensive participatory decision making.” It turns out that Hungarian 
networks are associational but not deliberative, German networks are 
deliberative but not associational, while the Czech networks offer a su- 
perior combination of association and deliberation. 

The accuracy of these claims about novel arrangements concerns me 
less than the theory of their origins. Given Stark’s earlier work, which 
made much of the specificity of Hungarian socialism, one might have 
expected the authors to trace the origins of postsocialist pathways to 
divergent socialisms. Far from it. Instead they make a decisive analytical 
distinction between the dissolution of the old order and the creation of 
the new. On the one hand, they present the dissolution in a singular 
fashion: the communist party state, based on “delegation by usurpation,” 
spread its tentacles throughout society and eventually dissipated itself by 
dissolving into its environment. It collapsed through overextension and 
absorption. On the other hand, they stress the multiple paths of extrication, 
which cannot, therefore, be explained by a singular past. Rather, they 
argue, the diverse ways of reworking the past spring from diverse political 
conjunctures in the moments of dissolution. In Hungary—the perfection 
of its democratic consolidation notwithstanding—power is concentrated 
in the executive, leading to policy swings that are disastrous for its econ- 
omy. The Czech Republic is the most successful in developing a delib- 
erative relation between the executive and parliament, and its economic 
networks supposedly reflect this success. Again the empirical basis of these 
connections is thin, but the argument is interesting. It is as if each country 
started with the same (crumbling) foundations of state socialism but, for 
complex political reasons, each designed and built very different houses 
on those foundations. Once the scaffolding goes up, the trajectory is con- 
strained. So path dependency is past dependency, it is just that for Stark 
and Bruszt the “past” starts with the breakup of communism. As we shall 
see through a brief discussion of the Russian case, this is too arbitrary a 
starting point. 

Stark and Bruszt pay virtually no attention to Russia in their com- 


4 They are here drawing explicitly on the ideas of Charles Sabel, in which a democratic 
capitalism based on flexible specialization and deliberative associations can dissolve 
conflict and contradiction. 
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parative analysis, but it is worth testing out their scheme on this excep- 
tional terrain. There can be no doubt about the importance of associational 
ties among enterprises, even though these ties are less the product of 
privatization and more a function of economic interdependence based on 
barter or the links that bind enterprises into the so-called financial in- 
dustrial groups. But are the associations “deliberative” in the sense of 
being accountable to diverse actors? Like the Hungarian enterprise net- 
works, only more so, the Russian networks are accountable to themselves 
alone and insulated from external pressures. Like the Hungarian asso- 
ciations, they are continually being bailed out by the state, leading to the 
asymmetric distributions of assets and liabilities. The Russian state must 
surely be regarded as the prototype of what Stark and Bruszt call the 
“antidevelopmental state.”” Indeed, turning to the political field one can 
see in Russia’s so-called superpresidentialism an exaggeration of the Hun- 
garian concentration of power in the executive. 

It all fits very nicely—too nicely! No two economies diverged more in 
the socialist period than those of Hungary and Russia, and yet we see a 
postsocialist convergence—at least in terms of their network properties 
and their political fields. Surely this demonstrates the importance of post- 
socialist extrication? Yes and no. For all their supposed convergence, these 
national economies are worlds apart: in the one a dramatic and unprec- 
edented disintegration, while in the other stuttering growth; in the one a 
retreat to primitive barter relations and subsistence existence, while in 
the other the consolidation of a money economy; in the one criminalization 
of the economy, while in the other the emergent rule of law; in the one 
an enormous concentration of power in oligarchs who control media, 
natural resources, and banking, while in the other a diversified economic 
structure. So, if there are certain convergences that can be explained by 
the paths of extrication from communism, at the same time there are 
deeper and more significant divergences that go back much further in 
time. In short, there may be path dependency, but where does the path 
begin, which crystallizing event determines which future, how many such 
events might there be? It is far from obvious that the most significant 
divergence of historical paths begins when the party state is replaced by 
the formal trappings of liberal democracy.” 

To be sure certain phenomena are affected by the mode of extrication 
from socialism, but they may be only the more superficial ones. Ultimately 
the data Stark and Bruszt offer to demarcate different capitalisms in 


1 The antidevelopmental state siphons resources out of the economy rather than in- 
spiring economic development (Stark and Bruszt, pp 151-52). The most astute analysis 
of the Russian state and its inability to implement economic reforms is to be found in 
Woodruff (19995). 

1 Anna Selenyi (1999a, 1999b), e.g., argues that the new political order is decisively 
shaped by the forms of opposition and experimentation of the old order. It made a 
difference that Hungary was able to experiment with economic reforms for two decades 
whereas the experimentation in Russia was much more limited. 
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Central Europe concern patterns of ownership. They take the importance 
of these ownership patterns for granted. But how much does it matter 
that Hungarian companies own one another, that Czech companies are 
more likely to be owned by banks, that Polish companies are likely to be 
owned by investment companies, or that German companies are likely 
to be owned by the Treudhandanstalt? What difference do the recom- 
binant forms of property, founded between state and market, make? What 
evidence is there that they have any particular effects on economic per- 
formance, on capital accumulation, or even on inequality? Although they 
claim to be interested in how democracy threatens economic reform by 
giving power to those who suffer its consequences, we hear nothing of 
the armies of rural and industrial workers, of the potential disrupters of 
reform. We read much about privatization schemes but not about dis- 
possession, unemployment, and immiseration. In these pages, in short, we 
discover little about the economic or political consequences of property 
reform. 


THE WEBERIAN REPRISE: MAKING CAPITALISM WITHOUT 
CAPITALISTS 


Making Capitalism without Capitalists takes a different approach to the 
transition. First, its theoretical originality lies not in the analysis of in- 
stitutions, their collapse, recreation, dynamics, and so on, but in the way 
individuals adjust their trajectories to the exigencies of institutions. Sec- 
ond, its empirical focus is not the variety of ownership patterns within 
Central Europe but what Hungary, Poland, and the Czech Republic have 
in common, namely a form of capitalism defined by the absence of a class 
of proprietors. Without an economic bourgeoisie, so Eyal et al. argue, the 
cultural bourgeoisie that formed under and in opposition to socialism 
becomes the dominant elite in partnership with a managerial elite. Third, 
the book’s comparative method lumps contemporary Central Europe into 
a more or less homogeneous category to be compared with its communist 
and precommunist past, as well as with alternate forms of capitalism.“ 
If their historical inquiry is breathtaking in scope, its arguments are less 
falsifiable than Stark and Bruszt’s provocative claim that diversity within 
Central Europe springs from the recent patterns of extrication from 
communism. 

In elaborating the argument for trajectory adjustment Eyal et al. take 


4 Tn their conclusion (pp. 189-90) Eyal et al. do speculate that the habituses, that is 
the ingrained dispositions, inherited not just from communism but from precommun- 
ism, could lead to different capitalisms within Central Europe. But these are just 
speculations, and they do not explain how habituses are transmitted from one gen- 
eration to the next More convincing is Eyal’s (2000) “Pastors and Prognosticators,” 
which examines the different alliances among elites that composed Slovakia as com- 
pared to the Czech Republic 
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over the now-familiar tool kit of Pierre Bourdieu that conceives of social 
action as the strategic deployment of different forms of capital (economic, 
social/political, and cultural) within a social space, governed by interests 
and inherited dispositions (habitus). This leads them to map out the al- 
liances among elites in terms of forms of capital in precommunist, com- 
munist, postcommunist, and capitalist societies. Thus, precommunist elites 
subordinate economic capital to social capital; under communism political 
elites subordinate the increasingly important cultural elites; postcom- 
munism subordinates economic capital to cultural capital. All these are 
to be contrasted with Western capitalism, in which cultural elites are 
subordinated to economic elites.'* The authors recognize that they leave 
hanging the question of whether postcommunist capitalism will turn out 
different from any other capitalism, despite its different origins (Eyal et 
al., p. 190). 

In order to comprehend the lasting influence of those different origins 
we have to ask how Eyal et al. understand the relation between past and 
future. How does communism continue to live so powerfully within post- 
communist capitalism as to assert the latter’s lasting peculiarity? Post- 
communist capitalism is the project of elites that originally formed in 
opposition to communism, specifically the project of dissidents who re- 
jected the political order in the name of civil society and reform com- 
munists (technocrats) who rejected the economic order in the name of 
monetarism. The question Eyal et al. pose in their most original chapter 
(the book’s third) is how these two groups, occupying opposed positions 
under communism, could form an alliance in postcommunism. Drawing 
on the analogy of Weber’s elective affinity between the protestant ethic 
and the spirit of capitalism, they show how self-denying dissidents and 
capitalist-spirited technocrats shared a commitment to living within truth 
and opposing communist mendacity, to establishing government from afar 
and opposing state regulation of everyday life, and to supporting the rule 
of law and opposing party despotism. In other words, new postcommunist 
institutions originate from enduring “habituses” inherited from the past. 

It is one thing for elites to have projects, it is another to realize them. 
Eyal et al. recognize that emergent capitalisms are the product of nego- 
tiated compromises. The dissidents and technocrats may be able to rec- 
oncile their own differences, but they still need the support of managers 
who run the economy. They, therefore, recompose their ideologies of civil 


“ Note that Eyal et al. use the different forms of capital to map out alHances among 
elites, but they do not deal with the question of the accumulation of capital and in 
particular of economic capital. Indeed, they regard the task in Central Europe as 
privatization and not accumulation. “Accumulation occurred under state socialism, 
and the task of post-communist capitalism is to individualize it” (Eyal et al, p 192) 
Primitive accumulation—.e., dispossessing a large proportion of the population from 
direct access to the means of subsistence—may have occurred, but accumulation is a 
continual process that does not cease with proletarianization. Indeed, we have only to 
look at the Russian economy to know that while privatization can be accomplished, 
it can come at the cost of enormous disaccumulation and even deproletarianization 
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society and monetarism into a “managerialist ideology” that gives cohesion 
to the new power bloc. The relative balance of managers, dissidents, and 
technocrats, then, will determine the character and strength of the com- 
munist legacy. 

But why, one may ask, are compromises only struck among elites and 
not between classes? It is just as plausible that the divergent postsocialist 
trajectories have been shaped by the activist Solidarity movement with 
its accentuation of “voice” or by the individualistic Hungarian working 
class that “exited” into the second economy. Or if one wants to lump 
Central Europe together, one might contrast its oppositional working clas- 
ses with the Russian working class that, to overgeneralize, pursued a 
strategy of “loyalty.”” Why are these interclass relations not as important 
as the composition of the power bloc? Indeed, it could be argued that the 
compromises struck between dominant and subordinate classes set the 
prior conditions for alliances among the dominant classes. 

Even if Eyal et al. do not take up the question of class relations, to 
their credit they do take up the Russian challenge. If in Central Europe 
they see the emergence of capitalism without capitalists, in Russia they 
cleverly recognize a case of capitalists without capitalism. That is to say, 
in Central Europe the institutions of capitalism exist without a bour- 
geoisie, while in Russia there is a bourgeoisie, a class of owners recruited 
from the former nomenclatura, but it is running amuck in the absence of 
effective regulation. Here we witness neither revolution nor evolution to 
be sure but imvolution-—an imploding world in which a merchant bour- 
geoisie forged from the old nomenclatura raided the economy to produce 
primitive disaccumulation, a return to peasant society, a retreat to self- 
provisioning, the expansion of petty commodity production and of prim- 
itive barter. As the new bourgeoisie reaches for global hypermodernity, 
they thrust the mass of the population into a premodern quagmire. Here 
extrication from the past becomes, one might say, the revenge of the past. 

The Russian catastrophe convincingly vindicates Eyal et al.’s argument, 
contra Stark and Bruszt, that communist origins are critically important 
to postcommunist outcomes. But there is a paradox. Precisely here where 
the communist habitus was the most strong and the anticommunist in- 
telligentsia most weak, the neoliberal vision has been the most vigorously 
pursued through rapid liberalization, privatization, and then stabilization. 
Of course, neoliberalism is here not an inherited habitus but a borrowed 
ideology that is opportunistically deployed to justify new forms of ex- 
ploitation and dispossession. But that is just the point. By focusing on 
elites, Eyal et al. go only so far in appreciating the peculiarities of post- 
Soviet Russia. They focus on the continuity of the old political class 
transformed into an economic elite, but they miss the radical transfor- 


1 I am here using the terminology of Albert Hirschman that David Stark (19892) once 
used in connection with the differences between Hungary, Poland, and East Germany. 
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mation of society. Trajectory adjustment, in looking for continuity, misses 
the dramatic disjuncture in Russian society. 

If now there are capitalists without capitalism, before, we might say, 
there was socialism without socialists. By this I mean that Soviet society 
was socialist in its redistributive economy and in its centrally organized 
society—an elaborate and overweening social structure that governed 
people’s strategies and rewards. People acted on the basis of their place 
in society, not on the basis of a vision of the future. As a result, especially 
in its last phase, there was no significant class of actors who believed in 
a socialist future. Adhesion to socialism was more a ritual enactment, so 
that when the system faced crisis it could turn either to despotism or it 
collapsed. There were no “socialists” committed to restoring and rebuild- 
ing the structure. One might say that Russia has moved from one extreme 
to the other, from a society structured by positions to a society structured 
by assets; from competition for privileged positions to competition for 
asset accumulation; from a society in which individuals occupy places in 
preexisting social relations to a society in which individuals are continually 
constructing social bonds; from a world of social channels to one of social 
anarchy. The constitution of class shifts from relations of vertical domi- 
nation to networks of horizontal interdependence. One might think of 
Russia today as the first truly “poststructural” society, rejecting admin- 
istered visions and investments in the future and held together by a system 
of personal ties. Reaction to the failure of the most modernist of projects 
(the planned society) has led to the most postmodern, egotistical world. 
While Bourdieu’s framework of strategies around convertible capitals 
does indeed capture something about post-Soviet society, it is a mistake 
to project this back into the past, to impose an asset-based theory on a 
position-based society. By universalizing Bourdieu’s theoretical categories, 
Eyal et al. miss its historical and critical import and thereby elide profound 
differences between Soviet and post-Soviet orders. 

I may now summarize. Both works focus on divergent paths to pre- 
sumptively different capitalisms, but where the one traces this divergence 
to individual habituses formed under communism and even precom- 
munism, the other focuses on institutional solutions to the problem of 
extrication from communism. Their measures of capitalism also differ. 
The one looks upon capitalism in terms of ruling elites, the negotiations 
among former dissidents, technocrats, and managers, while the other stud- 
ies patterns of ownership, novel recombinant forms that belie conventional 
models. Both, therefore, assess the phenomenal form of capitalism rather 
than its underlying reality, its superstructures rather than its economic 
base. Elites, ownership patterns, and politics may all diverge and trans- 
form themselves while the social relations of capitalism remain the same. 
Neither book comes to grips with the social relations of the economy that 
define it as capitalist, above all with the relations between classes. 
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THE MARXIAN REPRISE: ENDURING CLASS RELATIONS 


Karl Marx offers an alternative model for the study of transitions, one 
that is not based on origins. While modern capitalism may have multiple 
origins (and Marx offers different accounts in his various writings), once 
established capitalism constitutes its own system of class reproduction 
and dynamic accumulation. The systemic logic effectively wipes out or- 
igins.** Even though capitalism may diverge in its expression from sector 
to sector, from country to country, from region to region, these divergences 
are interconnected—the result of common underlying economic processes. 
To study such swi generis economic processes, one must subordinate the 
study of historical paths and trajectories to careful im situ analyses of 
actual social relations. This calls for ethnographic data that will reveal 
the day-to-day world of strategic action bound by changing con- 
straints—the sort of detailed analysis that Marx extracted from the Blue 
Books of government inspectors or the fieldwork that Engels conducted 
in Manchester. 

We already have examples of ethnographic research in post-Soviet so- 
cieties that, instead of searching out the influence of origins, start out by 
examining constraints in the present. For example, in her study of de- 
collectivization in Romania, Katherine Verdery (1999) does not look for 
the source of privatization but shows how property relations become the 
object of struggle within her village. Political and economic relations, 
inherited from the past to be sure, govern the unfolding dynamics of 
property redistribution. The point of departure is the present, and it is 
from this vantage point that the past is understood. The same applies to 
Gerald Creed’s (1999) ethnography of the struggle over the liquidation of 
agricultural cooperatives in Bulgaria. Peasants act here as a bulwark 
against decollectivization, mobilizing the past in defending their lifeline 
to collective and individual existence. Or take Slawomira Zbierski- 
Salameh’s (1999) field research in rural Poland that shows how shock 
therapy strengthened the hold of monopolistic organizations— 
procurement centers, state farms, and banks—over the peasantry who, 
in self-protection, retreated to closed-cycle subsistence production. Again 
the focus is on the here and now of the transition, the struggle over the 
interpretation and deployment of economic reform by different classes. 
David Woodruff’s (1999a) close observation of Russian enterprises, sim- 
ilarly, shows how local forces combine to defy punitive monetization by 
resorting to barter as they contend with distant government edicts. If 


1* Max Weber is interesting in this regard. While he dedicated much of his life to the 
study of the origins of modern capitalism, he did so because its primary peculiarity 
lay in the obliteration of all traces of those origins. In other words, the universal 
significance of modern capitalism lay in its independence from its origins, it capacity 
to spread all over the world. Indeed, one might say, that this interpretation of Weber 
inspired Wallerstein’s world-systems analysis On the other hand, unlike Marx, Weber 
did not have a theory of capitalist dynamics—neither of capital accumulation nor of 
class struggle. 
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these ethnographies of everyday life share any common conclusion it is 
that the past becomes a terrain of struggle, manipulated by forces con- 
tending in the present. 

The past is an object and field of conflict, but it is also a reference 
point against which to compare the present. Revisiting anthropological 
studies of socialism is another possible technique of such before-and-after 
evaluation of the transition. In May 1999, János Lukács and I returned 
to the sites of our Hungarian industrial ethnographies of the 1980s (Bur- 
awoy and Lukács 1992). What had happened to my fellow workers in 
the 10 years after the Fall? 1 was surprised to discover that, despite the 
retrenchment of all but a few thousand of the original 15,000 and with 
the exception of our devoted shop steward, Gyuri, who had retired to his 
home village, all members of the October Revolution Socialist Brigade 
were still working in what used to be the Lenin Steel Works. Shaped 
under late communism, their habituses had indeed endured the transition. 
Csaba was still living and drinking with his father in their one-room 
apartment. Béla, whose leg had been crushed under a steel girder, was 
still grumbling, seemingly untouched by his upwardly mobile children. 
As he had done so successfully in the 1980s, Karcsi was still exploiting 
his handicraft skills and entrepreneurial talents to tide him over work 
stoppages. Bandi and his wife (with friends and relatives) were exhausted 
from building their new home, while Laci sat gloomily in his two-room 
apartment with his unemployed wife and his two barely employed sons. 
The steelworker habitus endured to be sure, but the overwhelming reality 
was status degradation, deplorable working conditions, plummeting 
wages, and imminent unemployment. 

While workers were laid off, on the other side of the class divide a few 
managers made a killing by spinning off their departments as limited 
companies, charging the enterprise with overheads, taxes, transportation, 
and so on. Here is the story Stark and Bruszt tell of recombinant own- 
ership—the enterprise shell, under state ownership, was landed with li- 
abilities while the satellites made off with the assets. No one wanted this 
dinosaur. Through the 1990s it was buffeted by a series of makeshift 
deals, until finally a Slovakian company went halves with the Hungarian 
government in a pretense at resurrecting the mill. To Gabi, our earnest 
young manager, however, it seemed that the Slovakians had only ab- 
sconded with the cash. There was no sign of any new investment, just 
another level of bureaucracy. Now, a year later, an American company 
has taken over the Slovakian conglomerate. The mill could close tomor- 
row, with the last human dregs poured into the street. 

Contrast this with Egér’s Csepel Auto, a division of Hungary’s largest 
truck enterprise, which was bought out by the Germany Corporation ZF, 
one of the biggest gearbox manufacturers in the world. I gasped in disbelief 
as I walked through the aisles of the spotless, silent glass house that had 
been the grimy, dark, whirring machine shop. Young technicians, relaxed 
and scarce, had replaced our oily brigades of ingenious, improvising, pet- 
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ulant operators. Tomás, the information systems manager who escorted 
us, was one of the handful of Hungarians to have survived the transition. 
The rest had been weeded out. Half the number of employees now pro- 
duced 30 times the product! Property relations may be diffuse, but the 
consequences are not. Neoclassical sociologists may celebrate indetermi- 
nacy and uncertain futures, but for most this simply means insecurity. 
Capitalism may be made without capitalists but certainly not without 
workers—a small fraction upgraded, the majority disconsolate and de- 
graded. To my fellow furnacemen, capitalism was but the immiseration 
of socialism. It is not clear whether postsocialist capitalism is different 
from any other, at least from the perspective of those who shoulder it. 

As these examples reveal, not only communism but also the forces of 
globalization haunt the present. Stark and Bruszt write how the neoliberal 
programs sowed disillusion, shifted the prevailing winds from West to 
East, from market panaceas to the elixir of the state. They claim that 
both models were “decoupled” from the Central European reality—a re- 
ality of deliberative networks that were neither markets nor hierarchies. 
Stark and Bruszt are right to be skeptical of the inexorable power of 
globalization, but “decoupling” becomes the conceptual excuse for ignor- 
ing global factors altogether. Stark and Bruszt overlook the possibility 
that “recombinant property” was as much a strategic hedge against com- 
petition from or expropriation by international capital as it was a function 
of the balance of forces at the time of extrication from state socialism. 
Similarly, as Eyal et al.’s data show, “managerialism” is now in retreat 
before the encroachment of foreign capital. 

This is where the project of “comparative capitalisms” meets its greatest 
sociological challenge. Russia and Hungary may diverge in remarkable 
ways, but that divergence is as much a product of their differential in- 
sertion into what is a singular world capitalist system as it is of their 
communist origins. Or better, it is a product of the way global capitalism 
combines with antecedent forms of production as these undergo market 
transition. It is difficult to talk of independent national or regional cap- 
italisms—as is implied in “comparing capitalisms”—when the global order 
is so interconnected. We need to understand how the global, whether 
through supranational institutions, transnational connections, or post- 
national discourses, has mediated effects on what has come to be called 
the “local.” 

Just as ethnography offers a nuanced understanding of class constraints 
in the present, so it also offers insight into the significance of the global. 
Lynne Haney (2000), for example, has shown how the (neo)liberal model 
of welfare was latched onto by Hungarian state managers and even so- 
ciologists for the very real material benefits it offered them. Benefits to 
the bureaucratic class, however, came at the expense of welfare clients 
who in turn tried to reclaim their lost assistance by appealing to the 
socialist universalism of the past. Like a lightening rod, the state trans- 
mitted global pressures, creating a devastated population of women. But 
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the global can also strike outside the purview of the state. Zsuzsa Gille 
(2000) shows that Hungary’s flimsy postcommunist democracy put up 
little resistance to multinational incineration capital that would make 
Hungary a backyard for dumping Western waste. The project had popular 
support among villagers who competed for economic crumbs from burnt 
ashes. They were located next to toxic waste that had been deposited by 
Hungary’s biggest chemical factory. For many socialist years, this enter- 
prise had been bribed to produce what was too dangerous to produce in 
Austria. Those rural populations excluded from incineration profits were 
easily mobilized by the Greens to oppose the project, but with little effect. 
Contestation and negotiation there may have been, but the results only 
augured badly for Hungary’s rural population. 

These ethnographic studies offer a postsocialist antidote to neoclassical 
optimism, recognizing the debilitating constraints of historical past and 
global present even as they also recognize ingenuity and innovation. As 
the few manage to reassert their control over property and profits, or- 
ganizational and cultural capital, so the majority stave off increased in- 
security and dispossession. The ethnographic eye exposes the seamy side 
of capitalism—means testing, incineration, and unemployment. As the 
joke goes, Soviet teaching may have been wrong about communism but 
it waè right about capitalism. Of course, Western ideologues have their 
responses: the liberal reforms were introduced too slowly (or too quickly), 
Stalinist managers sabotaged (or failed to comprehend) the market tran- 
sition, the people are not ready for capitalism, they are too corrupted by 
communism, and so on. In these perspectives the problem lies with the 
executors of the vision rather than with the vision itself. Just as devel- 
opment theory of the 1960s, which also blamed the victims for their 
suffering, gave rise to dependency theory and then to postcolonial theory, 
so we might expect neoclassical sociology to inspire new postsocialist 
theorizing that moves from the limits of global capitalism to the contes- 
tation of Western transplants by indigenous visions and alternatives. 


THE ROAD TO POSTSOCIALIST THEORY 


Central European intellectuals have found their own ways of coming to 
terms with the failure of postcommunist capitalism. According to Eyal et 
al, these intellectuals of the second Bildungsbilrgertum also blame the 
victims of socialism for collaborating—implicitly or explicitty—with the 
corrupt old regime. In order to atone for the past, to redeem their sin- 
fulness, the people of Central Europe need to be more diligent in their 
sacrifice, in their confessions, and in their rituals of purification. Those 
evil habituses formed under socialism still need to be purged. But for how 
long will intellectuals make the past guilty for the violation of the present, 
attribute the pathologies of contemporary capitalism to its socialist 
origins? 

Eyal et al.’s own origin-driven theory dovetails well with this denun- 
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ciation of the past, but it is not the only possible interpretation of the 
contemporary plight. Eyal et al. might have drawn lessons from their 
own account of the demise of the first Bildungsbiirgertum_—the way Cen- 
tral European intellectuals in the early 20th century turned away from 
the capitalist project either to the radical right or to the radical left. Today 
we might well ask, How long will it be before postcommunist intellectuals 
reject those Western prescriptions they had so avidly embraced—tiberal 
democracy and free markets? How long will it be before neoclassical 
theory, which condemns socialism and exonerates capitalism, gives way 
to postsocialist theory, which takes a more critical approach to capitalism 
in order to rescue the positive potential of socialism? 

Postsocialist theory could follow postcolonial theory. Just as disillusion 
with “national independence” led postcolonial theory to reject the very 
goals of liberation as themselves too tainted by the oppressor’s ideology, 
so disillusion could lead postsocialist intellectuals to contemplate alter- 
natives to the imported Western models. Just as postcolonial theory turned 
to subaltern studies and the search for opportunities and visions eclipsed 
in the colonial or even precolonial world, so postsocialist theory will per- 
haps exhume alternatives that were rapidly closed off when communism 
began to teeter.'? Postsocialist theory might well return to the subterranean 
alternatives that the youthful Stark and Szelényi did so much to un- 
cover—be they the different market socialisms of the Hungarian second 
economy or the mobilized society of Polish Solidarity. The standpoint of 
such postsocialist critique is not some externally fabricated blueprint but 
a concrete imagination that was not brought to complete fruition—what 
Erik Wright, for example, has called a real utopia (see, e.g., Wright 2000). 

It is probably too soon to revisit state socialism and the possibilities 
that were never allowed to mature. Nonetheless, in time there will be a 
revisionist history that, while not denying communist horrors, will rec- 
ognize socialism's potentialities. Capitalism's early history was no less 
horrific than communism’s, but it managed to cultivate its potentialities 
and handle its contradictions by reconstituting itself. Socialism did not 
have such a chance—for world historic reasons rather than internal 
limits—to refashion itself before it was overrun. Capitalism had the op- 
portunity to do what socialism never succeeded in doing, to contain its 
barbaric tendencies—or, more precisely in capitalism’s case, to export 
them to nafve and expectant nations, most recently those freed from the 
communist vice. The export package has not been the 20th-century mixed 
economy of advanced capitalism but an early 19th-century market uto- 
pianism that today implants a dependent development. After all, how 
many advanced capitalisms can there be in the world? Neoclassical so- 


1 I am thinking here of the subaltern studies associated with such distinguished social 
historians of the Indian subcontinent as Ranajit Guha, Gayatri Chakravorty Spivak, 
Partha Chatterjee, and Dipesh Chakrabarty. In the context of the history of socialism 
Linda Fuller’s (1992, 1999) excavations of the realities and potentialities of working- 
class politics in Cuba and East Germany are exemplary. 
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ciology can celebrate a plurality of capitalisms, but they may turn out to 
be no different from core and periphery, development and underdevel- 
opment. Plural origins easily dissolve into a singular world capitalism. 
Neoclassical optimism will give way to postsocialist critique that will 
assimilate not only the lessons of the socialist experiment but also those 
of the capitalist transition. The history of the second Bildungsbirgerium 
has yet to play itself out. 
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Michael Burawoy offers three thought-provoking criticisms of our book: 
(1) that we abandon class analysis, (2) that we do not suggest an alternative 
to the present capitalist order, and, therefore, (3) that our “neoclassical 
sociology” abandons the critical vision of classical sociology. 

At this abstract level, we plead not guilty to all three charges. First, 
instead of abandoning class analysis, our book offers a comprehensive 
theory of class structure in state socialism and postcommunism. True, we 
pay far less attention to the working class than Burawoy finds acceptable, 
but this is because we analyze a situation in which the working class is 
far from fully formed: we do not assume that classes, like Pallas Athena, 
issue forth fully armed from Zeus’s head. Second, rather than joining the 
chorus heralding the end of the history, we think our book injects historical 
openness into the analysis of postcommunism. We do not assume one 
single capitalism as the last station of history. Instead, we argue for a 
research agenda framed in terms of “comparative capitalisms.” Such an 
agenda addresses the diversity of market economies without ordering 
these forms into a single hierarchy from “advanced” to “backward” or 
from “central” to “peripheral.” Finally, we do not contrast different cap- 
italisms with a utopia, “concrete” or otherwise, but we do approach them 
“critically,” with a measure of Socratic irony. 

Moving beyond these abstractions, we reformulate Burawoy?s criti- 
cisms empirically, and ask the following questions: 
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1. Is interclass struggle the central cause of social change in the trans- 
formation from communism to capitalism? 

2. Using social scientific tools, can we identify a noncapitalist (or to 
be less shy about it, a socialist) alternative to the “actually existing” 
socioeconomic systems of the postcommunist world? 

3. Is it necessary to have a vantage point “outside” a system to crit- 
ically analyze its mechanisms and dynamics? 

Burawoy answers these questions in the affirmative; our answer to each 
is an emphatic no. 


CLASS STRUGGLE, SOCIAL CHANGE, AND THE TRANSFORMATION 
FROM COMMUNISM TO CAPITALISM 


Burawoy faults us for not studying the working class, but he does not 
explain why he thinks we should study class at all. The best formulation 
we find in his review is “Capitalism may be made without capitalists, but 
certainly not without workers.” We interpret this to mean that to study 
workers is to identify an internal contradiction of capitalism, at the point 
of production, where the disconnected-yet-concerted actions of workers 
can shape the dynamic of capital accumulation. We do not wish to dispute 
the merits of this approach. Instead, we argue that there is another, no 
less significant reason why sociologists study classes, especially the work- 
ing class. Sociologists study classes because they are interested in collective 
actors with the potential to make social change. And, we contend, in the 
present historical moment in postcommunist Eastern Europe, the question 
of collective agency is far more significant than the question of the internal 
limits of capitalism. Indeed, in his rush to denounce us as apologists who 
“forget class,” Burawoy misses the greatest question of the postcommunist 
transition: namely, where is the working-class-cum-collective-actor we are 
supposed to study? At present, there is nothing but a demobilized, djs- 
organized mass of workers! We are not looking for “revolutionary con- 
sciousness”—“trade union mentality” would do, but it is nowhere in 
evidence. 

To analyze this striking feature of postcommunist class structure, we 
begin in the communist period, arguing that interclass struggle was not 
a major force of historical change. Socialist societies were based on “rank” 
rather than “class,” and power and privilege were grounded in the pos- 
session of “political capital” rather than economic wealth. As a result, 
class formation was not very far advanced and interclass struggle was 
relatively insignificant. There were some instances of collective working- 
class action—Solidarity in Poland, for example—but these were short- 
lived and never produced a fully formed class. By the time socialism 
broke down, the socialist working class had already disintegrated and lost 
whatever modest capacity it had for collective action. In this, Burawoy 
reads our analysis’ correctly: the workers “became . . . spectators of the 
transformation.” At the top of the social hierarchy, however, the picture 
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was more complex. Socialism, especially in its later years, was disrupted 
by periodic attempts at class formation by the intelligentsia and began to 
develop into a “dual stratification order” based on rank and class prin- 
ciples. In this context, the central dynamic leading to the fall of socialism 
was not interclass struggle, but intraclass struggle over the making of a 
new dominant class. 

Of course, socialism’s breakdown was a complex process, triggered in 
part by global changes such as the technological revolution and a new 
wave of military competition. The story of internal crisis, however, is one 
in which resistance from below played a relatively small part. Socialism 
was not brought down by the “working class.” Rather, the old bureaucracy 
was toppled by an alliance of reform-communist technocrats and the 
liberal intelligentsia. In making this argument, we disagree strenuously 
with Burawoy’s claim that “socialism did not have a chance—for world 
historic reasons rather than internal limits—to refashion itself before it 
was overrun.” For 70 years socialists experimented with various schemes 
from Stalinism and Maoism to reform communism and Yugoslavian self- 
management. Socialism was not “overrun”; it disintegrated because of 
internal contradictions, loss of legitimacy, and economic inefficiency. 

The transition from state socialism to market capitalism was one of the 
most traumatic events in recent human history, but despite the suffering 
and degradation that attended it, society has remained silent. Workers 
and peasants engage in ingenious individual coping strategies, but they 
rarely act collectively. There is not a single instance in the decade-long 
history of postcommunism when the working class pursued collective 
action for a noncapitalist alternative. Burawoy misses this astonishing 
historical fact entirely. We think it is the great puzzle of the postcommunist 
transformation, and our book attempts to solve it. 

Our thesis is that a domestic, propertied bourgeoisie has been slow to 
develop in postcommunist Eastern Europe, and in its absence, the intel- 
ligentsia has constituted itself as a cultural bourgeoisie and undertaken 
the building of market capitalism. Hence our title: Making Capitalism 
without Capitalists. The emerging social system is one in which the main 
source of power and privilege is neither property nor political capital but 
cultural capital. This proposition challenges the received wisdom that 
former nomenklatura members converted their privileges into economic 
resources and became a new propertied bourgeoisie. In fact, we present 
evidence to suggest that a large part of the former nomenklatura has been 
downwardly mobile. It is the middle ranks of the former managerial elite 
who have adopted the most successful strategy. Instead of stealing debt- 
ridden, unprofitable state property, middle-level managers broker deals 
to sell this property to foreign investors and are appointed as managers 
in multinational firms at the same time; this is the strategy of the “com- 
pradore intelligentsia.” 

Our thesis also explains how the working class has been demobilized 
in postcommunism. In a social order dominated by cultural capital, when 
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the intelligentsia undertakes to build capitalism “from above,” working- 
class formation is hindered. Where the intelligentsia once provided lead- 
ership, it now abandons the workers and elaborates a particularly insid- 
ious justification of capitalism. Moreover, because this cultural bourgeoisie 
is an “invisible” foe and because there is no clearly visible propertied 
bourgeoisie, the workers lack a well-defined enemy—an important spur 
to collective action. Finally, we observe that workers in formerly socialist 
countries, unlike some Western academics, know from their experience 
that socialism offers no livable alternative. Thus, another precondition of 
collective action is missing: a compelling vision of an alternative future. 
. We observe that the case studies Burawoy cites actually contribute to 
our argument, since none of them provide evidence of collective action 
by the working class. Two studies document peasant resistance to decol- 
lectivization, but Burawoy fails to mention that these same peasants re- 
sisted collectivization vehemently only a few decades earlier. If there is 
collective action here, it is not on behalf of collective farming, but to block 
the appropriation of agrarian property by former cooperative management 
until it is possible to secure the conditions for family farming. Citing 
another work on collective protest against welfare reform, Burawoy fails 
to observe that the majority of protesters were women acting as mothers, 
not as workers. While we agree that neoliberal welfare reform was mis- 
conceived, we argue that its socialist alternative was, if possible, even 
worse. The decade before the fall of communism witnessed one of the 
sharpest declines in life expectancy in demographic history, so it is not 
surprising that there was no concerted working-class reaction to welfare 
reform. Similarly, the study of Hungarian workers teaming with the 
Greens to fight the dumping of Western waste certainly documents mes- 
merizing local action. Nationally, however, the Greens failed to get enough 
votes to secure even one parliamentary seat. This is amazing when one 
recalls that the Greens are heirs to the mass-based environmental move- 
ment of the 1980s. Finally, we observe that even Burawoy’s own stories 
provide evidence that, despite grim macroeconomic conditions, ordinary 
Hungarian workers pursue individual survival strategies rather than col- 
lective ones. Taken together, these ethnographic accounts document the 
demobilization of the working class: the loss of solidarity, the failure: of 
collective organization, and the absence of an attractive alternative. 
Burawoy has some nice things to say about the earlier work of one of 
our number, and his generosity is appreciated. However, he grossly ex- 
aggerates the “epistemological break” between The Intellectuals on the 
Road to Class Power (1979), Socialist Entrepreneurs (1988), and this latest 
book. Making Capitalism without Capitalists is the conclusion of a trilogy 
and constitutes, in essence, the third installment in a line of thought and 
research that has unfolded over two decades. In the first volume, at the 
height of reform communism, Konrád and Szelényi considered whether 
the intelligentsia would make a historic compromise with the bureaucracy, 
to “rationalize” state socialism and to constitute itself as a new dominant 
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class. In Socialist Entrepreneurs, however, Szelényi offered an autocritical 
acknowledgement that by the late 1970s this “class project” had failed. 
Rather than making concessions to the intellectuals, the bureaucracy per- 
mitted the emergence of socialist entrepreneurs, and in this way, demo- 
bilized the working class and kept the intellectuals out of power. In Mak- 
ing Capitalism without Capitalists, we revise this autocritique and close 
the circle. We now understand that the intelligentsia was victorious in its 
struggle against the old-line bureaucrats, but not in the way anticipated 
by Konrád and Szelényi. Instead of pursuing the project of a rationalized, 
humanized socialism, intellectuals now act as a “cultural bourgeoisie.” 
Their new project is building “civil society,” liberal democracy, and market 
capitalism. There are other continuities among the three books. Socialism 
was always depicted as a dual stratification system in which the logic of 
class never overruled the logic of rank. Thus, the terminology of “elites” 
and “intraclass struggle” is not new: the early work used the term “ruling 
elite” (or “ruling estate” in the German and Hungarian editions) to identify 
the main enemy of the intelligentsia’s “new class” project. The earlier 
work also contained an embryonic theory of social structure, now pre- 
sented fully in Making Capitalism without Capitalists. 

So, have we “abandoned class” as Burawoy contends? No. First, we 
never argued that socialist society was class stratified. Classes were an- 
alyzed in statu nascendi, that is, in the making. Second, this latest book 
continues the same mode of analysis because we diagnose postcommunism 
as a transitional form, on its way to becoming a class society. There is 
very little to indicate that interclass struggles played a major role in the 
transition, but there is a great deal of evidence that imtraclass struggles 
over the making of the new dominant class have been central to the 
transformation. In short, itis struggles to make classes that are the decisive 
battles in postcommunist Eastern Europe. 


CAN WE IDENTIFY AN ALTERNATIVE TO CAPITALISM IN THE 
POSTCOMMUNIST WORLD? 


The central assumption in our book is that it is not useful for social 
research to posit “capitalism” as a unitary system or the final destination 
of history. It may once have been fruitful to compare the “logics” of state 
socialism and capitalism, but since 1989 it seems uninteresting to preserve 
the dichotomy simply for its own sake. With the collapse of Cold War 
binaries, it becomes possible to see how diverse this putative “unitary 
system” really is, indeed, how diverse it always has been. Although market 
economies are everywhere defined by private property and integrated by 
price-regulated markets, it is increasingly obvious that there are differ 
ences in institutional arrangements and class relations across capitalist 
societies that are deeply consequential for those who inhabit them. 

It is for this reason, and with some degree of immodesty, that we suggest 
the term “neoclassical sociology.” In this, we hark back to Marx, Weber, 
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and Durkheim and classical sociology’s central concern with the historical 
process of transition to capitalism. Why does capitalism emerge? What 
forces propel or obstruct the transition to capitalism? What kind of a 
society is created by the transition? These were the big questions of clas- 
sical sociology, and we think we should return to them. The difference 
between our agenda and that of the classical sociologists, however, is that 
they tended to see capitalism as a unitary system—as a single (and for 
some, final) destination toward which all societies were converging. This 
cannot be true for neoclassical sociologists at the dawn of the 21st century. 
The idea of a single capitalist logic does not provide any leverage to 
analyze the diverse world we now confront. Except for Cuba and North 
Korea, the world is capitalist, and therefore, we argue, neoclassical so- 
ciology must engage an agenda of “comparative capitalisms.” 

Precisely for this reason (and despite what Burawoy says) neoclassical 
sociology is not akin to neoclassical economics. Neoclassical economics 
(like Burawoy himself) does operate with the notion of a single capitalist 
logic, while neoclassical sociology formulates a wide-ranging comparative 
empirical agenda. In this, we admit, our proposal for neoclassical sociology 
is not radically original. Institutionalist economic sociology has explored 
a similar research program for some time. The term “comparative capi- 
talisms” comes from Neil Fligstein, and Bruszt and Stark’s book derives 
from this tradition of economic sociology, as do the works of Andrew 
Walder and Victor Nee, who analyze the unique capitalist forms emerging 
in China. “Comparative capitalisms” also encompasses research on welfare 
state regimes by scholars like Gösta Esping-Anderson and Bruce Western, 
the story of Japan’s particular capitalist development in research by Chal- 
mers Johnson and Ronald Dore, work on Latin America by Peter Evans, 
Juan Linz, and Phillippe Schmitter, the burgeoning empirical literature 
on globalization by analysts such as Arjun Appadurai and Neil Fligstein, 
and many more examples that we have no space to mention. Of course, 
Max Weber was the first to research capitalism comparatively, even 
though he concluded (classically) that ancient capitalism and Prussian 
capitalism were nonviable dead ends. In light of this collective work, then, 
we should curb our immodesty and claim only that our contribution to 
the idea of neoclassical sociology is to recommend a broader research 
program, which considers not only economic organization but culture, 
ideology, and, in particular, class structure. 

Burawoy offers two criticisms of this program. First, he suggests that 
a single iron logic of capitalism rules across the globe. “Russia and Hun- 
gary may diverge in remarkable ways but that divergence is... a product 

. of their insertion into what is a singular world capitalist system.” In 
other words, Burawoy contends, the diversity of destinations we purport 
to study are only the difference between core and periphery. We disagree 
profoundly. Ours is a globalizing world, but not a single global capitalist 
system. How homogenizing globalization is—how diverse national, re- 
gional, and organizational responses to its challenges are—is an empirical 
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question. We pursue this question in our book, analyzing the importance 
of foreign capital to explain the delayed development of a domestic prop- 
ertied bourgeoisie. We find that former communist cadres did not steal 
state property, but preferred to be managers in the multinational firms 
that invested in East European economies. Indeed, the best indicator of 
a successful transition, so far, has been a degree of openness to interna- 
tional capital markets. Countries with more foreign investment are also 
the countries where the “cultural bourgeoisie” is the strongest—Hungary, 
Poland, and to a lesser extent the Czech Republic—and they are now 
undergoing rapid economic expansion. In contrast, countries that engage 
in autarkic policies and do not attract foreign investors—which are also 
the countries where the old nomenklatura is thriving (Russia, Ukraine, 
Bulgaria)—are still in the grip of a postcommunist transitional depression. 
These outcomes, which are predicted in our book, are precisely the op- 
posite of those invoked by Burawoy’s world-system theory, with its em- 
phasis on the benefits of autarky and buffers from world markets. 

Burawoy’s second point is that we do not consider socialist alternatives 
to the hegemonic capitalist order, and he offers his own version of “post- 
socialist theory”: “How long will it be before postcommunist intellectuals 
reject those Western prescriptions . . . liberal democracy and free markets. 
. . . Disillusion[ment] could lead postsocialist intellectuals to contemplate 
alternatives to the imported Western models. . . . It is probably too soon 
to revisit state socialism and the possibilities that were never allowed to 
mature. ... In time there will be a revisionist history, that... will recognize 
socialism’s potentialities.” For failing to present a socialist alternative, we 
admit we are guilty as charged. Of course Burawoy does not provide any 
clear alternative either, nor does he suggest how a social scientist might 
study such an alternative. We do not know when or if postcommunist 
intellectuals will reject liberal democracy and free markets, and we suspect 
Burawoy does not either. We do not even know whether they showld 
reject liberal democracy. We are certain though, that even if a few intel- 
lectuals reread Marx and Trotsky, rethink the lessons of state socialism, 
and devise a new socialist “real utopia,” not much will change about 
postcommunist societies. Should the efforts of such intellectuals be taken 
as evidence that there is such a utopia, such an alternative? We think 
not. Alternatives exist not in theorists’ minds but, as Karl Marx taught 
us, in the actions of classes, collective actors. Neither we, nor Burawoy, 
can point to any such noncapitalist alternative materializing in collective 
action in contemporary Eastern Europe. 


IS IT NECESSARY TO HAVE A VANTAGE POINT “OUTSIDE” A 
SYSTEM TO CRITICALLY ANALYZE IT? 


We were taken off guard by the accusation that our analysis was uncritical 
and that we have joined the triumphalists celebrating “capitalism as the 
end of history.” We thought we wrote a vitriolic critique of the practices 
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and ideologies of the postcommunist power elite. Thinking further, we 
realized that we simply disagree with Burawoy about what constitutes 
critique. Burawoy insists that “comparing capitalism with capitalism” is 
merely apologetics. He equates critical analysis with positing a “real uto- 
pia” outside capitalism. He also accuses us of abandoning our earlier 
critical approach. He is wrong on both counts. 

Our practice has always been “immanent critique,” and we are suspi- 
cious of critique that is “teleological.” Thus, in Konrád and Szelényi’s 
earlier analysis of state socialism, the critique was not formulated from 
the “external” perspective of market capitalism but explored the field of 
possibilities inside “actually existing” reform communism. Similarly, in 
this book, we offer an immanent critique of capitalism without opposing 
it to a more desirable form of “actually existing socialism” or some “real 
utopia.” It is sufficient to demonstrate, as Foucault does, that the phe- 
nomena under investigation are arbitrary, without proving they must be 
replaced by a more rational form. In this way, by analyzing capitalism 
as a diverse set of alternative destinations, each pregnant with its own 
conflicts, each originating in contingent circumstances, we supply the ac- 
tors with a broader and more open sketch of the field where they can act 
upon their interests. After all, solutions to the social problems of postcom- 
munism can only come from the critical imaginations of those who live 
under the social conditions analyzed, not from the analyst measuring 
reality against an imagined alternative. In this, we think the kind of: 
teleological critique Burawoy advocates fails to submit the role of the 
theorist to critical scrutiny; it falls short of a critical sociology of intel- 
lectuals. When Burawoy calls for a “revisionist historiography”—when 
he recommends revisiting the history of state socialism to uncover its 
potentialities and to imagine an alternative to liberal democracy and free 
markets—what is this if not an apologetics for state socialism? 

Immanent critique sketches a whole range of alternatives but does not 
recommend or celebrate any particular one. Instead, it offers an ironic 
view of history to provoke doubt in the minds of those who believe they 
know the truth, who think they know what is desirable or inevitable. 
True, irony and doubt do not offer quick fixes for the problems of the 
world, but they are likely to make us more reflexive about who we are 
and what we do. Can social science achieve much more? 
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ONE WAY? 


While we were writing our book, Postsocialist Pathways, during the mid- 
1990s, a friend in Budapest told us about a board game he had played 
as a child during the socialist period. Prior to the Second World War, 
Hungarians had played Monopoly, which they knew as Kapitaly. But the 
competitive game of capitalism was banned by communist authorities, 
who substituted another board game, Gasdálkodj Okosan! or “Economize 
Wisely.” In this goulash communist version of political correctness the 
goal was to get a job, open a savings account, and acquire and furnish 
an apartment. Our friend was too young to have had a Kapitaly board, 
but his older cousins from another part of the country knew the banned 
game and taught him the basic rules. You did not need to be a nine-year- 
old dissident to see that Monopoly was the more exciting game. And so 
they turned over the socialist board game, drew out the Kapitaly playing 
field from Start to Boardwalk on the reverse side, and began to play 
Monopoly—using the cards and pieces from Economize Wisely. But with 
the rules only intermittently regulated by the older cousins, the bricolaged 
game developed its own dynamics, stimulated by the cards and pieces 
from the “other side.” Why, for example, be satisfied with simple houses 
and hotels when you could have furniture as well? And under what 
configurations of play would a Prize of Socialist Labor be grounds for 
releasing you from or sending you to jail? 

The notion of playing capitalism with noncapitalist pieces strikes us as 
an apt metaphor for the postsocialist condition. The political upheavals 
of 1989 in Eastern Europe and 1991 in Russia turned the world upside 
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down. Assuming that they were “starting from scratch,” Western advisors 
from the International Monetary Fund (IMF) and university-based in- 
stitutes issued instructions for new “rules of the game.” But the ruins of 
communism were not a tabula rasa, and so the new hybrid game was 
played with institutions cobbled together partly from remnants of the past 
that, by limiting some moves and facilitating other strategies, gave rise 
to a bricolage of multiple social logics. If from these coexisting and over- 
lapping principles they are building a distinctively postsocialist capitalism, 
they share with all modern societies a common feature that the social 
fabric is woven with multiple, discrepant systems of value (Stark 1996, 
p. 995; 2000). i 

Neoliberals, whose free market doctrines guide the IMF, would not be 
fond of this metaphor. For them, the fall of communism is nothing more 
or less than the triumph of the market. Capitalism has a single logic, 
pervasive now around the globe and increasingly throughout every sphere 
of society. The notion that the citizens of poorer countries might seek to 
chart distinctive paths to development is folly at best. There is no viable 
Third Way, much less a fourth or fifth, but only a single path, one best 
way. 

Michael Burawoy’s review essay suggests that he would also not be 
fond of this metaphor and that the basis of his discomfort shares much 
with his neoliberal counterparts. Like the neoliberals, Burawoy’s capi- 
talism is a “single order” (p. 3), and his statement that “plural origins 
easily dissolve into a singular world capitalism” (p. 27) could easily have 
been written for an IMF position paper. Where neoliberals know the one 
best way, Burawoy knows that to step to the market is to be locked into 
a one-lane, one-way street to degradation, for with them he shares the 
view that multiple paths are a delusion. No less confident that he knows 
the “underlying reality” (p. 22), he similarly dismisses efforts to understand 
the shape and dynamics of political institutions as just so much 
noise—mere “superstructure” in his technical parlance—when even di- 
vergences across cases can be explained as “the result of common un- 
derlying economic processes” (p. 23). Whereas the neoliberals argue that 
there is no alternative, Burawoy claims that there is an alternative—but 
only one, socialism. Not, of course, the failed “actually existing socialism” 
but some never-existing socialism, a transcendental “external position’ (p. 
13) outside history. 

We wrote Postsocialist Pathways, in part, as a critique of neoliberal 
orthodoxy just at the height of its ascendancy in academic and policy 
circles. We argued that neoliberalism was a poor guide to policy because 
its rejection of diverse types of capitalism threatened to shut the door to 
social exploration almost immediately after it had been opened. Our task 
was to develop middle-range concepts capable of translating this social 
experimentation, of understanding its problems and prospects, and of 
charting diversity in the patterns of interacting political and economic 
institutions across the countries of East Central Europe. Given the sim- 
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ilarities between neoliberal orthodoxy and Burawoy’s Marxist orthodoxy, 
we are not surprised that he argues that these efforts were theoretically 
misplaced and politically misguided. 


NOTHING’S CHANGED? 


Because Burawoy’s starting premises are so different from ours, we will 
address his specific criticisms only after outlining two major areas of 
disagreement. 

How do we carry out the comparative analysts of capitalism after the 
fall of socialism? For decades, capitalism was defined vis-a-vis socialism 
and vice versa. Their systematic comparison enriched our understanding 
of both. But the world has changed, and the demise of socialism poses a 
challenge for comparative institutional analysis. Burawoy thinks other- 
wise. He insists on comparing a nonexistent socialism with a unitary model 
of capitalism, arguing that it is only with a standpoint outside capitalism 
that one can gain analytic leverage to understand its dynamics. 

By contrast, we study actually existing capitalisms. As sociologists with 
a disciplinary disposition to exploit variance, we see real analytic leverage 
in taking the diversity of capitalisms as an object of study and comparing 
capitalisms vis-à-vis each other. Taking experiences with economic and 
political liberalization in East Asia and Latin America as the comparative 
backdrop, we identified the distinctive starting point of the East European 
transformations as the simultaneous extension of property rights and cit- 
izenship rights. That is, what sets the postsocialist transformations apart 
is that they created a class of property holders at the same time that they 
extended political rights to the propertyless. To study whether and how 
these dynamics were leading to a distinctively East European variant of 
capitalism, we undertook the task of explaining variation within Eastern 
Europe. 

What is a critical sociology of capitalism after the fall of socialism? For 
decades, capitalism was criticized from the standpoint of socialism, and 
Marxism (in one variant or another) was synonymous with critical soci- 
ology. The economic failures and authoritarian features of real socialism, 
together with criticisms of capitalism from the standpoints of race, gender, 
sexuality, the ecology, and other perspectives, broke that monopoly. The 
demise of socialism offers an opportunity for critical engagement released 
from the old dichotomies. Burawoy thinks otherwise. Before, during, and 
after socialism, nothing has changed: he insists that the standpoint of 
class, as defined by Marxist criteria, is the only way to do critical sociology. 
Marxists, too, can play Monopoly. 

We did not pose the question of a critical sociology in our book, but it 
would seem that if “critical sociology” is to mean anything, it must be 
critical of any a priori efforts to foreclose discussion about what, who, 
and how to be critical. We are equally critical of all the “sciences of the 
not yet” (as labeled in our book) whether they be the designer capitalism 
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of the neoliberal vanguard or the imagined socialism of the neo-Marxist 
vanguard. Whereas each of these understands science as confirming cer- 
tainties about received truths, we understand science as research that 
yields uncertainties by investigating how social actors explore possibilities 
(see Latour 1998). The central problem for postsocialist societies is not 
how to enact the “best” policy but how to develop institutions that are 
capable of learning—a problem ruled out by ideologues left or right be- 
cause they already know the answers and the questions. A critical task 
for sociology is to examine how configurations of political, economic, and 
social institutions vary in their capacity to facilitate or inhibit decentral- 
ized learning and social reflexivity. 


FROM THE REVENGE OF THE PAST TO PATH DEPENDENCY 


Turning to specific criticisms, Burawoy argues that our attention to the 
period of the breakup of communism in which political and economic 
institutions were reconstructed is “too arbitrary a starting point” (p. 15). 
Jt certainly must be if, as he urges, the task is one of “recognizing the 
debilitating constraints of historical past and global present” (p. 28). But 
we started from a different premise, recognizing that history is the product 
of social actors. Burawoy’s notions of “the revenge of the past” (p. 20) 
was precisely the kind of fatalism that we argued against in Postsocialist 
Pathways. Just as we did not assume that the political actors we were 
studying were condemned to blindly follow the recipes, formulas, and 
therapies of their uninformed Western advisors, neither did we assume 
that they were condemned to follow an involutionary trajectory deter- 
mined by the overwhelming weight of the past. Instead, we chose to follow 
the actors, observing what they did in the face of extraordinary uncer- 
tainties and documenting the patterns of conflict and alliance that re- 
shaped institutions. 

We found the concept of path dependence useful not because it signaled 
a generalized “history matters” but because it forced us to be rigorous 
about identifying critical junctures in which the temporality and sequenc- 
ing of particular events are consequential (Abbott 1995; Sewell 1996). But 
we were also concerned that a narrow reading of path dependence along 
the now-famous QWERTY keyboard model would overstate contingency 
by assuming that political change occurs on a blank slate (see Thelen 
[1999] for a useful discussion). In our view, innovations, especially or- 
ganizational and institutional innovations, are not de novo but occur in 
the context of an array of resources. As we indicated in Postsocialist 
Pathways, “Actors who seek to introduce change require resources to 
overcome obstacles to change. This exploitation of existing institution- 
alized resources is a principal component of the paradox that even in- 
stances of transformation are marked by path dependence” (p. 83). In our 
view, path dependency is a theory neither of determinacy nor indeter- 
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minacy but a method for grasping the recombinant character of social 
innovation (Stark 1996). 

Given our goal of tracing the path through which postsocialist political 
and economic institutions evolved, it seems far from arbitrary that we 
chose the period of breakup and reconstruction as a “starting point.” But 
if that period occupies a strategic place in our analysis, it is not the case, 
as Burawoy implies, that we ruled out any discussion of institutions and 
events prior to 1989. To the contrary, our detailed discussion of Hungarian 
developments steps back into the era of János Kádár to understand just 
why it was that compromise seemed inevitable to the actors at hand while 
confrontation seemed inevitable in Poland. Only by doing so could we 
grasp just how remarkable it was that Poland became the case of com- 
promise whereas Hungary was that of confrontation. To do so, part of 
our analysis of eventful alliances and conflicts turned on the contentious 
meanings of “1956”—not because it was the “revenge of the past” but 
precisely because the actors whom we were interviewing were using styl- 
ized reconstructions of 1956 to model the events of 1989 as they were 
taking place. Similarly, our comparative account of changes in property 
regimes in Germany, Hungary, and the Czech Republic demonstrated how 
the differing patterns of network properties were shaped by institutional 
patterns in the pre-1989 period. In this we were methodical—in the sense 
of method as well as rigor and detail. Ruling out references to a generalized 
past, our method specified that we could invoke historical events or ex- 
isting institutions only if we demonstrated how they were being used as 
resources during the period of reconstruction. Thus, contrary to Bura- 
woy’s assertion (p. 15), path dependency in our usage is not past depen- 
dency—because it is not from a generalized (and hence ahistorical, because 
uneventful) “past” that actors reenact, recognize, redefine, and recombine 
resources. 


FROM DEPENDENCY THEORY TO DEMOCRATIC THEORY 


Burawoy’s second major objection is that, in analyzing plural forms of 
capitalism, we place too much emphasis on “superstructural manifesta- 
tions” such as “political democracy” (p. 6). Whereas the neoliberals hold 
that economic issues have a prepolitical status that should not be exposed 
to political deliberations, Burawoy says that political institutions do not 
matter. Insertion into the singular world capitalism is the road to under- 
development. Combining his notion of revenge of the past with depen- 
dency theory of the 1970s (“the debilitating constraints of historical past 
and global present”), Burawoy creates a new kind of theoretical hybrid: 
the past dependista. 

But the stakes are serious. Burawoy is arguing that the kinds of struc- 
tures and processes central to our book—the presence or absence of dem- 
ocratic accountability, their distinctive forms, and their interactions with 
economic institutions—are ephemeral. They evaporate in the grand 
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scheme of things where the laws of history rule. And to think that we 
were so audacious to give them equal status with that hard logic of the 
underlying reality! 

Do democratic institutions matter? Burawoy asks for evidence. Perhaps 
the best place to start is with the concluding paragraph of his review 
essay. Unlike capitalism, which had the opportunity to contain (or export) 
its barbaric tendencies, Burawoy writes, “socialism did not have such a 
chance—for world historic reasons rather than internal limits—to refash- 
ion itself before it was overrun” (p. 30). But, after registering the almost 
poignant desperation in this passage (socialism would have had a radiant 
future), we have to ask: What explains the difference? Why was socialism 
unable to be remade while capitalisms flourish? Burawoy’s explanation, 
“world historic reasons,” evokes the same logic of globalization that he 
shares with the neoliberals, and his claim that capitalism managed to 
“handle its contradictions by reconstituting itself” would not be surprising 
to any economist with faith in self-correcting markets. 

There is another explanation. It involves the robustness of the very 
institutions that Burawoy dismisses: Unlike the economic institutions of 
capitalism, socialism never coexisted with the political institutions of de- 
mocracy. Capitalism (market competition, the profit motive, capital ac- 
cumulation, private property in productive assets, choose your descriptors) 
did not reconstitute itself. Its barbaric tendencies were contained by dem- 
ocratic constraints (Streeck 1997). Capitalisms (emphatically in the plural) 
thrive not simply because decentralized markets are more self-correcting 
than centralized hierarchies but also because the clash of the incongruous 
logics of market and democracy repeatedly yields new vitality. Socialism 
failed, in part, because it lacked diversity of organizations; capitalisms 
adapted because democratic institutions facilitated the organization of 
diversity. It was not the single logic of the market that triumphed but the 
heterogeneity of organizing principles. 

Do democratic institutions matter in postsocialism? For some time, the 
prevalent view has been that strong, unconstrained, insulated, political 
executives are more likely to produce effective economic reforms. We 
argued, by contrast, that institutional constraints on political executives 
could be enabling, and we referred to this process as extended account- 
ability. In addition to the vertical accountability of elections, we empha- 
sized horizontal accountability within the state: reformers who were con- 
strained by coalition partners, parliaments, and courts would be more 
likely to adopt moderate policies. Accountability would be further ex- 
tended when reform politicians had to take into account broad societal 
interests as represented by trade unions and other associations. Extended 
accountability extends the time horizon of reformers (because they must 
anticipate and take into account ex ante the social impacts of reform 
policies); and it extends the time horizon of the citizenry (because welfare 
measures and other institutional safeguards make it more likely that they 
will extend credibility to reform politicians). On the basis of our East 
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Central European comparative case studies, we concluded that account- 
ability and authority were not in trade-off: institutionally constrained 
executives were more capable of carrying out sustained reform.’ 

But why be concerned about accountability when immiseration is wide- 
spread throughout the region? Postsocialism has produced impoverish- 
ment, but it has not done so uniformly. During the decade, some societies 
have lost 60% of their national wealth with inequalities reaching Latin 
American levels; others have dramatically reversed course. Accountability 
matters because its variation shapes these differences. Our case study 
findings were preliminary, but a recent study (EBRD 1999), drawing on 
surveys of 24 postsocialist countries in Eastern Europe and the former 
Soviet Union, indicates that the concept of extended accountability is 
robust. With variables corresponding to our concepts, it finds an inverse 
correlation between the power of political executives and progress in ec- 
onomic reforms. Level of democracy, coalition governments, and the de- 
gree of political competition are all positively correlated with economic 
reforms (EBRD 1999, pp. 104—13). Moreover, inequality has increased far 
more rapidly in countries that have less accountable political executives. 
From 1988 to 1994, for example, the percentage change in the Gini co- 
efficient (a standard measure of inequality) was 10% in reforming Hun- 
gary but almost 100% in Russia (EBRD 1999, p. 111; Hellman 1998). If 
you are interested in patterns of poverty and inequality, attention to po- 
litical institutions is not misplaced. 


CONCLUSION 


How is postsocialism a laboratory? Burawoy makes it clear that he sees 
postsocialism as a laboratory to test existing theories, and he is emphatic 
that the theoretical terrain is well mapped. In fact, it was already mapped 
almost a century ago—for his only named places are Marx, Weber, and 
Durkheim. Yes, it is the 21st century, yet only “classical” social theorists 
need apply.” 

But the social transformations after the fall of communism are so mo- 
mentous that they are unlikely to leave existing theories intact for simple 


? Because Burawoy ignores our discussion of extended accountability, his caricature 
of how we might compare Hungary and Russia is entirely misleading (see Bruszt [in 
press] for a brief treatment of the Russian case). 

* Lucky for us that we exhibited some Durkhetmian influence or we would not even 
merit the inapt “neoclassical” labell While we do not deny this Durkheimian influence 
(e.g., the importance of deliberative associations and the enabling features of con- 
straints), if we must trace intellectual roots to the turn of the previous century, we 
probably share more with Weber and the American pragmatists. Like them, we adopt 
not an ethics of absolutes and ultimate ends but an ethics of responsibility in which 
actions are judged by their consequences (hence the importance of accountability in 
our analysis). For similar recombinations, see especially the work of the new French 
pragmatists such as Bruno Latour, Luc Boltanski, and Laurent Thevenot. 
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testing. Postsocialism is a laboratory—for developing new concepts and 
new theories.* Instead of a game in which my Durkheim trumps your 
Weber or my Marx trumps your Durkheim, we need new hybrids, with 
theoretical combinations not limited to the trinity. In our work extended 
accountability, for example, combines Peter Evans’s (1995) concept of 
extended embeddedness and Guillermo O’Donnell’s (1994) horizontal ac- 
countability. Similarly, we bring ideas from property rights theory together 
with network analysis in arguing that the structure of property has net- 
work properties in which the unit of property (as holdings) is a network 
with varying network analytic features (topology, connectivity, density, 
extensivity, etc.). And we draw on organizational ecology and complexity 
theory in arguing that adaptability is a function of the organisation of 
diversity, a capacity to interweave diverse evaluative principles. 

Within the firm as well as within the polity, the challenge of postso- 
cialism is the coordination of heterogeneous interests where public and 
private are mixed, the boundaries of firms are blurred, and legitimating 
principles are not bound to discrete domains. These are the problems of 
transforming economies—no less for sectors of market volatility and rapid 
technological change in the developed West than in the inchoate social 
orders of Eastern Europe and the former Soviet Union. 

If the successful Hungarian manager must be as skilled in the language 
of debt forgiveness as in the language of negotiating with a prospective 
multinational partner, the CEO of a start-up firm in biotechnology might 
well survive only if she is as adept at writing grant proposals to federal 
agencies as she is in making the pitch to prospective venture capitalists. 
We need not travel to Eastern Europe to encounter difficulties in assessing 
the value of firms, when debates about the principles for evaluating “new 
economy” stocks fill the front pages of our newspapers. For problems of 
horizontal accountability we can study processes of lateral accountability 
in the emerging collaborative structures of American firms large and small. 
We are not strangers to the problems of parsing public and private, for 
we need look no further than the complex proprietary arrangements be- 
tween private firms and public universities in the fields of computer sci- 
ence, biotechnology, new media, and engineering. And the search for a 
mutually comprehensible language across the cultures of science, politics, 
and business in the human genome project offers no less acute problems 
of public and private accountability. Postsocialism is a laboratory—not a 
place where we run experiments, but a site where social experimentation 
produces lessons applicable for our own challenges in this epoch of or- 
ganizational transformation. 


* Too much of the work in our discipline is policing boundaries and adjudicating claims. 
In the field of economic sociology, e.g, we would trade five articles that adjudicate 
between rational choice, institutionalist, and organizational ecology (name your camps) 
for a good article that draws from different theories and methods to develop fresh 
conceptualizations of emerging problems. 
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Postcommunist Capitalism, Culture, and 
History” 


Michael D. Kennedy 
University of Michigan 


Michael Burawoy is justly renowned for his ability to pose among the 
most interesting questions and challenges for sociology. His invitation to 
postsocialist theory is no exception, but his opposition between Marxist 
and neoclassical sociology may get in the way of recognizing a critical, 
historical, and cultural sociology beyond Marx. 

Tn this discussion, I like Burawoy’s description of the difference between 
neoclassical and classical sociology. Neoclassical sociologists Eyal, Sze- 
lényi, and Townsley (1998) and Stark and Bruszt (1998) are more inter- 
ested in exploring capitalism's possibilities through its plurality than in 
elaborating the classical concern for capitalism's systemic distinction. The 
neoclassical sociologists' interest in capitalist variety also attends insuf- 
ficiently to the global articulation of national capitalisms. Burawoy es- 
pecially uses the common concern of Marx, Weber, and Durkheim for the 
individual’s estrangement from capitalism to cast postsocialist theory’s 
critical difference with neoclassical sociology. 

Burawoy’s postsocialist theory shares a cultural disposition with the 
classics and the authors he reviews that limits sociology’s critical capac- 
ities, however. Classical sociology does not just agonize over anomie, 
alienation, and meaninglessness. Wuthnow (1987, p. 27) also explains how 
this sociology treats culture as a “reflection” of something more real, which, 
through the “strategic application of skeptical scholarly knowledge,” can 
conquer “self-estrangement primarily by extending its knowledge of the 
world.” While Eyal et al. (1998) and Stark and Bruszt (1998) may not 
share Burawoy’s normative concerns, they are closer to his cultural dis- 
position, and to that of the classical tradition, than what some significant 
currents in critical social theory would recommend. 

Calhoun (1995, p. 11) suggests that critical sociology might be organized 
around “theory that is self-conscious about its historicity, its place in di- 
alogue and among cultures, its irreducibility to facts, and its engagement 
in the practical world.” The starting point for any sociology, Pierre Bour- 
dieu (1990, p. 177) argues, should be its own sociology: “All the propo- 


1 Direct correspondence to Michael Kennedy, International Insitute, 1080 South Uni- 
versity, Suite 2660, University of Michigan, Ann Arbor, Michigan 48109. 
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sitions that this science enunciates can and must be applied to the subject 
who practices the science.” For these authors and others, sociology’s crit- 
ical cultural capacity depends on an extended engagement not only with 
the consciousness of history’s subjects or the symbols of their world, but 
with the constitution of the problem that motivates theory and practice. 
Burawoy’s review suggests one approach to this sociology of knowledge 
production, but his commitment to a Lakatosian scientific project in the 
Marxist tradition (1990, 2000) leads him to miss significant challenges to, 
and accomplishments in, the study of postcommunist capitalism.” 


KNOWLEDGE PRODUCTION IN THE STREAM OF HISTORY 


Burawoy opens his essay with reflections on the “end of ideology” in 
American sociology after both World War II and after communism’s col- 
lapse in Europe and Northern and Central Asia. He identifies the intel- 
lectual transformation of two individuals with this transformation of 
American scholarship he finds troubling. Szelényi and Stark are no doubt 
leaders in the study of class and economy after communism, but Bura- 
woy’s introductory frame anticipates a recurrent problem in Burawoy’s 
sociology. His theory focuses on the link between the ethnographic mo- 
ment and a systemic tendency, in this case the scholar’s biography and 
the ideological trend in American sociology. That theoretical strategy fo- 
cuses one’s gaze on the problem of the broader trend, and the actor’s 
adaptation, or resistance, to it. Postsocialist theorists resist that ideological 
trend, while the neoclassical sociologists are trapped in capitalist hegem- 
ony. Burawoy’s critique does not speak of any false consciousness, but 
he is not far away when he suggests that their optimism is “rooted more 
in faith than in reality” (p. 5). Here, however, I am afraid that Burawoy 
misses the reality of knowledge production by focusing on two individuals 
and their implication in his critique of ideology without attending to those 
mediations that structure their theory and practice. 

Burawoy misses the coauthors, collaborators, informants, institutions, 
and alternative publics shaping the work under review. These conditions 
of knowledge production are at best backstage, disappearing behind the 
tale of American sociology’s retreat from class. East Central Europe is 
posed as “fertile soil” for growing an American sociology he dislikes, rather 
than as a site where history, and scholarship, has been made. It would 
be quite unfortunate to read these two books through the prism of Amer- 
ican sociology at work. They are far more than that. The two volumes 
represent international collaboration in articulation with the practical en- 


1 The terminology varies. Stark and Bruszt (1998) and Burawoy and Verdery (1999) 
refer to those societies formerly ruled by communists as postsocialist, while Eyal et 
al (1998) and others prefer postcommunist capitalism. The semantic difference is 
consequential, but deserves more discussion than I can provide here. For elaboration 
of this and other themes raised in this essay, see Kennedy (2002). 
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gagement of East Central European and especially Hungarian intellec- 
tuals in social change. I am sure Burawoy does not intend to reinforce 
the American ethnocentrism Hughes (1961) highlighted decades ago, but 
by missing the global articulation of knowledge production in making 
communism’s collapse and postcommunist capitalism’s alternatives, he 
misses an extraordinary opportunity for extending critical sociology. 

In other contexts, Burawoy recognizes the importance of practical en- 
gagement in scholarship, most notably when he found one reason for 
Trotsky’s superior theory of revolution to rest in his explanation from 
within the stream of history (Burawoy 1989). The authors whose work 
he examines here are also writing from within the stream of history, but 
apparently they are standing in the wrong part of the stream. Burawoy 
is distressed by Stark’s and Szelényi’s apparent abandonment of class, or 
at least their loss of grounding in or acknowledgement of the working 
class or subaltern. He does not like their apparent ignorance of social 
justice and their acceptance of capitalism. But in order to magnify the 
tragedy of their implicit optimism, he winds up writing a tale of individual 
transformation whose narrative is shaped more by the ideology he cri- 
tiques than it is by the intellectual biographies he laments. 

Although Szelényi et al. (1988) focused on peasant innovation, Szelényi's 
world reputation—making book with Gydrgy Konrád (Konrád and Sze- 
lényi 1979) was not grounded in the deeply subaltern. It theorized an 
intelligentsia on the route to class power. Its ancestry owed more to Hun- 
garian populism than to Marxism (Kennedy 1994), but Szelényi fit with 
the then-blooming interest in the critical sociology of intellectuals and 
knowledge (Gouldner 1979). Szelényi could thus be recognized as a part 
of Marxist inquiries (Burawoy and Skocpol 1982), even while both he and 
Gouldner were better characterized as “ridge riders” between Marxism 
and sociology. Stark’s (1982) dissertation focused on new class configu- 
rations in capitalist and state socialist societies, but his later work on labor 
markets (1986) already anticipated his subsequent manifest institution- 
alism. I recognize far more continuity than does Burawoy in the work of 
these individuals, but I think Burawoy is quite right on another score. 
Class is not where it used to be, and socialism is not the counterculture 
to capitalism it once was (Bauman 1976). The intellectual field around 
social change has shifted dramatically, in part because Eastern Europe is 
far more important to American sociology than it ever was. But one misses 
the significance of that shift by focusing on what has happened to class 
analysis and socialism. 

One of the most dramatic changes in sociology has occurred in the 
explosion of opportunity to conduct ethnographies in Eastern Europe. 
Communist rule severely restricted this kind of scholarship (Verdery 1996). 
In fact, one reason Burawoy wound up working in Hungary under com- 
munist rule was because he could not gain access to factories in Poland. 
Today, however, ethnographies are among the most important method- 
ologies available for addressing the character of postcommunist capital- 
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ism. And ethnography is not limited to manufacturing sites or subaltern 
communities. The ethnographies in Burawoy and Verdery (1999) also 
address courtrooms, fiscal policies, and libraries. Verdery (1996) has ar- 
gued elsewhere that one of the greatest challenges for understanding so- 
cialism and its sequel is the ethnography of the state. While some critical 
sociologists might be content to transport sensibilities of social justice 
directly to the postcommunist world, ethnographers have cautioned us 
that politics, from elections to gender, should not be read through a West- 
ern lens (Stukuls 1997, 1998; Gal and Kligman 2000). Given the hegemony 
of nationalism as a discursive formation (Calhoun 1997), especially in 
Eastern Europe, critical sociology also risks a great deal when it fails to 
address its articulation with the nation (Suny 1993; Suny and Kennedy 
1999). 


WHOSE POSTCOMMUNIST CAPITALISM? 


Both sets of authors under review and Burawoy minimize the challenge 
of this national articulation. Of course the authors and Burawoy are 
writing about different national spaces, but they treat them primarily as 
sites for comparison rather than as different grounds for knowledge pro- 
duction, much less subject formation. Consider, simply, Hungary’s cen- 
trality. This was Michael Burawoy’s first country for learning about com- 
munist rule. David Stark began in Yugoslavia, but he has worked mostly 
in Hungary since the late 1980s. Iván Szelényi is one of the leading figures 
of Hungarian social science. László Bruszt is not only the former acting 
rector of Central European University in Budapest, but was himself a 
participant in the Hungarian Round Table negotiations of 1989. If one 
considers a broader sociology of postcommunist change, one is struck by 
the degree to which Hungarian research, Hungarian data, and Hungarian 
intellectuals in diaspora or at home, shape the debate in the United States. 
János Kornai’s (1992) political economy provides the point of departure 
for many assessments of socialism. József Borócz, Martha Lampland, Akos 
Róna-Tas, and Szonya Szelényi, all leading figures in the American so- 
ciology of postsocialism, are grounded in Hungary. 

Burawoy challenges the centrality of Hungary in discussion by intro- 
ducing Russia. But why Russia? Its size, military power, and position as 
the home of the October Revolution certainly elevate its significance, but 
it is also a place that encourages deep pessimism about capitalist futures. 
As Burawoy and Krotov (1992) argued after the Soviet Union's collapse, 
this was mercantile capitalism at best. Perhaps Burawoy is drawn to a 
more “critical approach to capitalism in order to rescue the positive po- 
tential of socialism” when capitalism is so badly done (p. 30). At the same 
time, however, Russia hardly produces much optimism about socialist 
possibilities either. While its communist party may call for a return to the 
rights of state socialism, they have little sense of how this could be 
achieved in the new economic and political environment (Cook, Orenstein, 
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and Rueschemeyer 1999). But perhaps we should not be looking at so- 
cialism in terms of feasibility. Perhaps we should look at it in terms of 
resistance and look to the country best known for mobilizing opposition. 
Poland is the nation in the world once ruled by communists with the 
greatest tradition of labor movements. Solidarity, its independent self- 
governing trade union of 1980-81, was an emancipatory movement em- 
bodying all sorts of alternatives that were themselves “rapidly closed off” 
when martial law was imposed (Kennedy 1991). When we seek to exhume 
socialism, we cannot forget that it was communist state violence that 
destroyed alternatives—some of them socialist—many times, most notably 
in Hungary in 1956, in Czechoslovakia in 1968, and in Poland in 1981. 
This history matters, especially in Poland. It forms part of a tradition 
that is recalled to shape the politics of protest, the politics of parties, and 
the politics of the labor movement itself (Kennedy and Stukuls 1998; 
Ekiert and Kubik 1999). And these different histories affect the plausibility 
of theories that overlook the articulation of the nation in their argument. 
Konrád and Szelényi’s (1979) anticipation of intellectuals on the road 
to class power was grounded in Hungarian social conditions and history, 
even though its reference was to state socialism (Kennedy 1992). When 
Eyal et al. (1998) write that this class project has been realized, made 
manifest in the coalition government between the parties descended from 
the dissident cultural intelligentsia and communist party reformers, they 
elevate Hungarian experience in generalization. Poland has not (yet?) 
managed such a coalition at the national level, and only very occasionally 
at the local level. Indeed, the controversy attending the individual decision 
by the prominent Solidarity activist Barbara Labuda to work in the cab- 
inet of the very popular president Aleksander Kwasniewski, a former 
communist, suggests that Polish cultural politics remain very different 
from those in Hungary. While we might find evidence for the success of 
this fourth new class project in Poland, it is not a question that Polish 
scholarship would emphasize. The enduring significance of “us and them” 
in the popular, and sociological, imagination suggests that the communist 
past remains very much a part of the present, at least in Poland. 
Burawoy may counter that I commit the same mistake as the authors 
he reviews. I focus too much on the various origins of communism’s 
collapse and too little on the relations that are defining capitalism. After 
all, as Burawoy notes above, capitalism’s “systemic logic effectively wipes 
out origins,” which then calls for study that subordinates “the study of 
historical paths and trajectories to careful in situ analyses of actual social 
relations. . . ethnographic data that will reveal the day to day world of 
strategic action bound by changing constraints” (p. 23). In this framework, 
national articulations appear hardly relevant given the globalism of cap- 
italism and its emergent class dynamic. Marx thus can be reborn in so- 
cialism’s wake. That optimism, however, requires another kind of faith, 
and a compatible set of historical interpretations. Burawoy writes, “Cap- 
italism’s early history was no less horrific than communism’s but it man- 
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aged to cultivate its potentialities and handle its contradictions by recon- 
stituting itself. Socialism did not have such a chance—for world historic 
reasons rather than internal limits—to refashion itself before it was over- 
run” (p. 30). 

In order for Burawoy to retain that optimism, he has to work within 
“Marxism as Tradition” (Burawoy 2000). For the living tree of Marxism 
to survive, its proponents must cut off the malignant limbs in hope that 
the disease has not spread (Burawoy 2000, p. 156). That, however, is the 
cultural point: he constitutes his science within a tradition of inquiry that 
relies on interpretations of history that preserve its core. Those who view 
Marxism as rotten to the roots, or who find it but one element of a critical 
intellectual tradition, could entertain the idea that “socialism did not have 
a chance” because of its own internal limits. Who, after all, overran so- 
cialism? Certainly in 1956, 1968, and 1981, communist rule overran so- 
cialism’s promise. Confronting this history invites us to consider Marx- 
ism’s “nightmare questions” (Gouldner 1980). 

Marxism can avoid the “nightmare questions” that challenge its core 
presumptions by focusing on those scientific problems, like capitalism’s 
trajectory and class relations, that it does best. And this is quite appro- 
priate for a Lakatosian approach to science (Burawoy 1990, 2000). But 
if Marxism’s core rests in the struggle for socialism itself, as Burawoy 
seems to imply, Marxism’s most critical challenge does not lie in exhuming 
socialism. If Eastern Europe matters to socialism’s viability, Marxism’s 
most critical challenge lies in addressing socialism’s cultural effect. For 
those who have not lived in a society ruled by communists, socialism can 
function as “an ontologically absent but epistemologically structuring de- 
sire” rather than a lived experience with great emotional effect and very 
different national meanings (Kennedy and Galtz 1996). Even after this 
generation passes, the connotations of communist rule will not disappear 
from memory. That memory will affect the ways in which socialism, even 
if based on resistance to capitalism, can be articulated. If socialism is an 
unnamed form of resistance to capitalism using the organizational forms 
left by communist rule, then, perhaps, ethnography unbound can suffice. 
If, however, socialism is a counterculture that requires a postsocialist 
theory elaborated by East Europeans, the meaning of communist rule 
cannot be overlooked. Engaging the history of communist rule matters 
more than Marxism as Tradition appears to emphasize. Understanding 
its implication in cultural politics is critical “to rescue the positive potential 
of socialism.” But what of another nightmare question? What if the focus 
on socialism stands in the way of critical sociology? 

To the extent that Marxism’s theoretical and scientific core remains 
identified with the struggle for socialism, it may face unnecessary barriers 
to developing a critical sociology for, and out of, Eastern Europe. To the 
extent that Marxism, or critical sociology, is about the relationship be- 
tween knowledge and practice, about the empowerment and disempow- 
erment of various social actors, and about the relationship between culture 
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and power, the postsocialist world is more than fertile soil or a laboratory. 
Some have argued that it is genesis itself (Jowitt 1992), although that is 
a bit more hyperbole than I can manage. But I do find the invitation to 
think with my colleagues from Eastern Europe about social and intel- 
lectual change both compelling and liberating. Calhoun’s (1995) emphasis 
on theory’s place in dialogue among cultures is therefore central to the 
mission of critical sociology, and critical for theory’s relevance to practice. 

In this sense, our authors may represent the critical tradition of sociology 
more accurately than does Burawoy. Socialism is no longer a vision of 
the future, nor much of a guide to action in Eastern Europe. Transition 
from plan to market, from dictatorship to democracy, is hegemonic as it 
never was before. Of course there are alternatives. China’s economic 
growth has been relatively spectacular, and Burawoy (1996) suggests that 
Russia should be viewed through its lens. China’s record on democracy 
and human rights hardly inspires the emancipatory hope that socialism 
did at one time for very many people, however. Our Hungarian authors 
and their colleagues have been at the heart of social change itself, and 
social change in these times is based on variations in capitalism’s config- 
uration, democracy’s institutionalization, and civil society’s mobilization. 
Bauman (1992) argued that critical intellectuals have for the most part 
become the “interpreters” for dispossessed groups, but over the last decade 
in the postcommunist world, they have played an invaluable role in leg- 
islating alternatives for better or worse. Eyal et al. (1998) and Stark and 
Bruszt (1998) were looking in the right place to develop critical sociology. 
Eyal et al. even define their sociology of social change by it. 


THE CULTURE AND PRACTICE OF POSTCOMMUNIST ELITES 


Eyal et al. (1998) proclaim a new sociology based on a theory of the fourth 
new class project of the East Central European intelligentsia, on the one 
hand, and social mobility surveys of elites, on the other. Their sense of 
social change is based explicitly on patterned attributes of economic, po- 
litical, and cultural elites at different points of time, primarily in Hungary 
but to some extent in Poland and the Czech Republic, and to a much 
lesser extent but for stark contrast, in Russia. Their sense of capitalism 
is based on the qualities of these agents and their theorized relationships 
to others, primarily within their class. The authors argue against a quite 
complex theory of political capitalism that they have refined to allow 
evaluation through elite surveys. Action primarily takes place through 
individual adaptation of elites with different forms of capitat—when, for 
example, socialist technocrats and anticommunist opposition from the 
days of communist rule ally to find a new place as the hegemonic man- 
agerial bloc presiding over postcommunist capitalism. 

This is much less a theory of social change than it is an account of 
changes in elite capital and habitus at different times in recent East Eu- 
ropean history. Postcommunist capitalism’s alternative futures are un- 


1144 


Review Symposium: Kennedy 


derstood in terms of the dominance of different fractions of the elite—the 
technocratic-managers, foreign investors, and their “comprador intellec- 
tual allies” and new entrepreneurs. These are not collective actors in the 
sense of the term most theorists of class struggle or social movements 
know. These are collective actors whose collective action can be found in 
the cumulation of individual strategies. True to Weberian sociology, the 
analyst is the one best situated to recognize classes in the identification 
of common patterns. You apparently cannot rely on their organization or 
coordinated effort to show their class formation, although others have 
offered an alternative account doing just that (e.g., Gianoplus 1999). 

Culture is quite important to Eyal et al.’s (1998) argument. The civi- 
lizing project of the East Central European intelligentsia serves as the 
spirit that travels through history to provide different versions of the 
intellectuals’ mission. Bourdieu’s invocation tempers the Hegelian im- 
pulse, as the authors (p. 17) emphasize the “dialectical interaction between 
agents (their dispositions, habits, biographies, collective memories) and 
their positions (in institutions, class relations, and networks).” The cultural 
bourgeoisie is also building the postcommunist capitalist order. The “spirit 
of postcommunism” (see also Kennedy and Gianoplus 1994) drew on the 
idea of civil society and monetarism to become the power bloc’s ideology. 
That power bloc uses three rituals—purification, sacrifice, and confes- 
sion—initially to purge postcommunists from power and later society from 
socialist pollution to build the rational economy. 

Culture is, however, also marginal to the argument. Following Bour- 
dieu, Eyal et al. (1998) treat culture as a form of capital and use the 
dominance of different types of capital to distinguish social structures. 
But this Bourdieuian notion of capital fails to recognize its mediating 
function in the constitution of actors or in the coordination of action 
(Calhoun 1995, p. 155). Consider, for instance, how the complicated bi- 
ographies one might find in life histories are understood through categories 
of actors. Eyal et al. (1998) distinguish dissidents, whose degrees of op- 
position go unmarked, from socialist technocrats and managers, all with 
different allocations of cultural capital. Their kinship ties, friendship net- 
works, early career trajectories, and personal memories during communist 
rule and its end go unmarked before their classification, whose class base 
in turn shapes the action that gets theorized. We learn about ideologies 
that reflect those class positions and about rituals that help make capi- 
talism without capitalists, but the individual strategies themselves dis- 
appear beneath class. Instead, things happen, based on global imitation 
and path dependence, that alter the value of certain types of capital. In 
their words, “Those who are able to adjust their trajectories to meet social 
change most successfully are those who possess the most diverse portfolio 
of different kinds of capital” (39). The authors do not really explain under 
what conditions the imitation or the path is chosen, but that, apparently, 
is a matter for their individual agents to decide, in a way befitting Bour- 
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dieu’s habitus. One might recall the words of János Kenedi (1982) i in ahi 
“Do it yourself.” 

Indeed, the whole argument, apart from Bourdieu’s invocation, reso- 
nates with the work emerging from Hungary more than elsewhere. The 
data, while initially from Hungary, Poland, the Czech Republic, and Rus- 
sia, are mainly from Hungary in the latter years of the decade. When 
narratives, even with comparison, are elaborated, Hungary tends to pro- 
vide the illustration (e.g. pp. 183-84). There is even a certain style to the 
argument. It reminds me of the argument that Akos Róna-Tas (1997) 
made about the second great transformation, which was made in small 
incremental steps. I also recall József Bordcz’s (1995) description of the 
dynamics of postcommunist capitalism with “fusion,” the “creative rein- 
terpretation of old and new elements, creating substantively distinguish- 
able qualities of social experience,” in which he uses the musical repertoire 
of Béla Bartók to produce the image. Citing the Hungarian pianist Zoltán 
Kocsis, “Bartók's oeuvre is a perfect synthesis between central east Eu- 
ropean folklore and composed music in the hands of a superbly creative 
individual.” It even reminds me of Burawoy and Lukdcs’s (1992) argu- 
ment about painting socialism, rather than building an organization like 
Solidarity. While certainly Polish culture also had a sense of załatwiać, 
or to arrange for things, Polish sociology under communist rule, and even 
today, still struggles with a sense of social structure connected to that past 
in which Solidarity made “us and them.” Hungarians hardly have that 
sense in their scholarship, or in their politics (Kennedy 1992). Indeed, 
Stark and Bruszt (1998) argue that this lack of a mobilized civil society 
enabled the Hungarians to leap ahead of Poland in negotiating a more 
democratic transition at the Round Table. 

Perhaps because they rely as much on rather detailed stories of networks 
that produce unfamiliar actors taking the helm of a capitalism whose type 
we hardly know, Stark and Bruszt (1998) develop a sociology of social 
change that depends more on action than on classification. Even their 
categories—recombinant property or deliberative associations—are based 
much more on a model of communication than one. based on categories 
of attributes. In this sense, they are much closer to the third generation 
of cultural theory that Wuthnow (1987) describes than are Eyal et al. 
(1998). Despite the latter’s invocation of habitus, culture seems to reflect 
class in the argument’s narrative much more than it creates possibilities 
for action. Stark and Bruszt (1998), by contrast, seek the possibilities and 
identities emergent in new accounts of action (Kennedy 1999). 

Underlying both volumes, however, is a sense of social change that is 
very different from what it was when class struggle and social movements 
were our guide. Rather than making history, narratives are organized 
around adjustment to social change, with social change as the result of 
adjustment. “Making do” was certainly appropriate for describing the 
character of socialist entrepreneurs in the interstices of first and second 
economy, but is this really what elites do? This is certainly not a heroic 
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disposition, but it is a sign of the times, and a reflection of the culture 
that is structuring social action. In this, Burawoy is quite right. These 
two volumes reflect the times, not so much of American sociology but of 
East Central European praxis in a global transition culture. 


A CRITICAL SOCIOLOGY OF POSTCOMMUNISM 


Reframed this way, one can appreciate anew Burawoy’s lament. It is not 
just that our authors focus on elites, but that these elites are not inspiring. 
They are only trying to create a normal society, based on some fusion of 
the past and Western examples. They are trying to make a better society 
than what they had. It is only a variation on what exists, however, and 
not an alternative modernity. But then again, they already had such an 
alternative that fell to great, if nationally and socially variable, applause 
within Eastern Europe. And of course another modernity, so far unmen- 
tioned, is not far away. 

I am struck that theories of postcommunist social change can overlook 
the wars that have accompanied it. Neither of the books under review, 
nor Burawoy himself, addresses the wars in Central Asia, the Caucasus, 
or the Balkans. Is it because they are apart from transition? Is it because 
war is a result of nationalism and not of communism’s end? All of the 
authors tend to structure their arguments within national societies en- 
gaging in relatively peaceful transition, but that is a part of the problem. 
Postcommunist capitalism is not only a national system. It is not just a 
global economic system. Transition is also a global cultural formation. 
Elite East Central European collective action is directed toward mem- 
bership in transnational o izations, most prominently NATO and the 
European Union, not only because of Western hegemony but also because 
of a reasonable fear of political instability. Wars on postcommunism’s 
southern tier are a good inspiration for seeking the security afforded by 
membership in the West. Thus, regardless of war’s place in Durkheimian, 
Weberian, or Marxist traditions of sociology, it is absolutely central for 
thinking about the constitution of a culture that guides postcommunist 
capitalism’s making. 

Of course this alternative if barbaric modernity is hardly one that elites 
embrace. However, its existence shapes the culture in which decisions 
about alternative futures and visions of normality are cast. Within tran- 
sition culture, the West is constructed as an alternative to ethnic cleansing, 
mass rape, terrorism, and bombing civilians, and that alternative in turn 
is linked to the unresolved end to socialism and its search for a new 
legitimating principle in nationalism. NATO’s bombing of Serbia has of 
course complicated the normative calculus facing East European leaders, 
but Russia’s second war on Chechnya has not assured them either. I agree 
with Burawoy that we need to look beyond variations on capitalism in 
imagining East European alternative futures. But we also need to look 
beyond the mode of production, and toward fundamental elements of 
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human security, in envisioning Eastern Europe. Indeed, against this back- 
ground we might gain a better handle on the remarkable cultural hegem- 
ony of global capitalism. 

It is not just a question of the number of foreign investors and intel- 
lectual comprador bourgeoisie. Regardless of numbers, the culture of tran- 
sition has itself become hegemonic and so obvious that we hardly discuss 
why elites adjust to fit their societies into a global capitalist future. Al- 
though Stark and Bruszt (1998) note that Hungarian socialists lurched 
from their path as the public’s caretaker in the mid-1990s to the imposer 
of financial discipline, they managed this observation without explaining 
why certain postsocialist pathways seem obvious, and others appear im- 
possible. In this sense, we must attend not only to elites adapting to change, 
but also to how elites sort through the larger set of rules and resources 
that structure the adaptation. The authors all acknowledge it, sometimes 
referring to this sorting as a choice between the national path and the 
Western example. But they treat these phenomena far more obviously 
than they should. 

As postcommunist capitalism’s elites recombine property or reconstruct 
national path and Western examples in new ways of making do, they are 
not just creating novel combinations out of otherwise coherent cultural 
structures. Instead, these cultural formations are themselves emergent and 
filled with contradictions and possibilities that imaginative elites, and 
entrepreneurial activists, exploit. Agency, Sewell (1992) reminds us, de- 
rives not only from the confrontation of habitus and structure, or inno- 
vation through networks, but through the contradictions of structures 
themselves. And that, it seems to me, is the big challenge facing critical 
sociology. 

Burawoy, in his plea for postsocialist theory, focuses on the search for 
alternatives. I, for one, would prefer to understand how postcommunist 
capitalism came to be so dominant. However, all of our authors, including 
Burawoy, assume that hegemony without explaining how it happened. 
Part of the answer lies in a refined cultural sociology that focuses on the 
constitution of the objects of inquiry and subjects of history in making 
transition across national articulations. It also depends on explaining why 
some cultural formations acquire the power that they do. Alternative 
modernities in the socialist past and in postcommunism’s wars have made 
transition culture seem like a pretty good bet, at least to those who assume 
institutional responsibility. But I suspect that the Western alternative 
would not be so compelling if the world’s commodification were not so 
overwhelming. 

I would find it extremely helpful if Burawoy and others in the Marxist 
tradition would go to their deep roots in critical realism to explain cap- 
italism’s power. Exhuming socialism may be necessary for identifying 
injustice within a Marxist problematic, but one might pursue injustice in 
other terms that can be more consequential. Marxism is, however, a nec- 
essary point of departure for understanding capitalism’s deep structure. 
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It would be extraordinarily useful for those interested in understanding, 
rather than assuming, capitalism’s power if critical sociology might work 
a bit to explain the commodity and its fetish within postcommunist cap- 
italism. Hungary’s most renowned Marxist, György Lukács (1971, p. 84), 
put the question succinctly when he asked, “How far is commodity 
exchange together with its structural consequences able to influence the 
total outer and inner life of society?” Martha Lampland (1995) provides 
a critical, historical ethnographic foundation for this inquiry in her ex- 
ploration of the commodification of peasant labor within socialist Hun- 
gary. Hungary is once again at the center, with Marxist inquiry at its 
heart. But that may be where I finish my selection from the Marxist 
legacy. 

Instead of exhuming socialism, one might follow those critical traditions 
of sociology—feminism and pragmatism among others—that develop 
their scholarship in articulation with the language of actually existing 
struggles. One might even turn to a sense of critical theory that identifies 
its intervention with immanent critique. Here, freedom offers incredible 
potential. Transition culture is not just about new forms of property. Its 
normative power rests on its association with freedom. Fortunately for 
that immanent critique of global capitalism, Sen (1999) and others are 
developing a discourse around freedom that puts economic oppression, 
physical violence, and political repression, among others, at the fount of 
freedom’s limitation. Rather than ask how social conflicts reflect social- 
ism’s positive potentials, we might instead organize our inquiry around 
how freedom’s meaning might be extended to assure human security, 
broader democratic deliberation, and deeper social justice. Instead of hav- 
ing to argue against transition we might argue that freedom is what 
transition should be about. By constituting the problem of freedom’s 
extension in transition culture, a critical, historical, and cultural sociology 
could help to make a different space for inquiry and practice around 
emancipation. And that space will not reproduce the opposition between 
socialism and capitalism that helps to limit freedom to its bourgeois sense 
in transition culture. 
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In Making Capitalism without Capitalists, Gil Eyal, Iván Szelényi, and 
Eleanor Townsley (1998) bridge a vast distance by linking some findings 
of a 1993 synchronic multicountry stratification survey of Central and 
Eastern Europe to a neo-Weberian, comparative-historical argument 
about social change. In Postsocialist Pathways, David Stark and László 
Bruszt (1998) embed the highlights of Stark’s earlier work on institutional 
change in the macropolitics of policy choice and the micropolitics of man- 
agerial clique action. Michael Burawoy’s essay offers a largely metalevel 
critique of those two book-length sociological reflections on social change 
after state socialism.” Szelényi, Stark, and Burawoy are leading American 
sociologists who conduct comparative-historical work on the transfor- 
mations of that part of Europe formerly controlled by the Soviet Union. 

This discussion allows us to open a more generic debate about com- 
parative-historical sociology’s models of social change. This is important 
because the comparative-historical approach offers an opportunity to con- 
sider social change in the present without the high-modernist assumption, 
burdening much of sociology, that present modernity has in effect “erased” 
(Burawoy, p. 1106) the past. It makes it possible to assess the role of 
external influences in social change and allows brisk movement among 
institutional effects of different levels of abstraction, thus promising anal- 


11 am grateful for comments on earlier versions by Judit Bodnár, Eric Kaldor, and 
Caleb Southworth. Direct all correspondence to József Borécz, Department of Soct 
ology, Rutgers University, 54 Joyce Kilmer Avenue, Piscataway, New Jersey 08854- 
8045, E-mail: jborocr@rc.rutgers.edu 

2 The acceptability of Burawoy’s commentary hinges on a two-part question: How 
does sociology intersect with class politics, and how should it do so? As the second 
part is a matter of taste, 1 will refrain from addressing it. Instead, I will focus on the 
problem of class politics, scholarship, and their separability. The North American 
reader’s sense of rony might be stimulated in knowing that in Central Europe there 
exists a similar practice of political pidgeonholing—only with more complex patterns 
and much more irony. Iván Szelényi—a prominent member of the democratic oppo- 
sition forced into political exile during the 1970s and an intellectual known to be very 
closely tied to the liberal party—has been labeled both a closet right-wing populist 
(Dupcsik 1994) and, along with Stark’s coauthor László Brusrt and other Hungarian 
sociologists, a lefty (Bozóki 2000). 
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ysis of a key issue of our time, the impact of global processes on the local 
and vice versa. 

First, I address two specific difficulties that comparative-historical so- 
ciology faces as it enters the context of Central and Eastern Europe; then 
I discuss generic concerns of social change with a special emphasis on 
historicity. It is best to read my contribution in conjunction with Akos 
Róna-Tas's (1998) poignant summary of the “path dependence” and “cap- 
itals” theories in the post-state-socialist transformation. The availability 
of that well-crafted summary allows me the luxury of not reiterating the 
easence of the two interlinked sets of arguments here. 


THE “EAST” IS WEST 
Even our geographical focus is elusive. Difficulties emerge on two levels: 
naming and grasping. 

Naming.—As the intellectual historian Larry Wolff (1994) articulates, 
the idea of “Eastern” Europe has been a prime example of disdained and 
patronized alterity at least since the Enlightenment.’ The 19th-century 
imperial-scholarly exercise of orientalism (Said 1978) inaugurated the 
“West's” grand underlying qualitative contrast to an exocitized Other in 
the “East.” Wolff portrays an Enlightenment thought that created, pos- 
sibly even before the onset of Orientalism, another form of alterity: a 
quantitatively inferioralized sense of otherness located within the “West” 
itself. The label of a capitalized “Eastern Europe” denotes a set of societies, 
separate and unequal, that are undeniably Western in their ways but 
inferior in their achievements. Economic history suggests that the key 
dimension of that inferiority lays with those societies’ subordinate position 
in the world economy—so much so that the very concept of 
(semi)periphery was devised partly to refer to their experience during the 
emergence of capitalism in Europe (see, e.g., Wallerstein 1974, p. 103). 

Because of the sense of inferiority connoting the label, the symbolic 
geography of the region has been subject to extended debates among social 
historians and other specialists.‘ The label “Eastern Europe” evokes an 
ambiguity of meaning depending on which of the two notions of “east- 
ernness” is launched: either “non-” or “inferior” Europe. In this context, 
the adjective “central” often ends up serving as a mere euphemism. 

This double (qualitative and quantitative) notion of “east-west” alterity 
dominated the intellectual world of Western Europe at the birth of the 
social sciences. Much of the oeuvre of the Central European—German, 
Italian, Austrian, Polish, or Hungarian—classics of sociology can only be 


* In keeping with the traditions of AJS, the adjectival portion of names of regions is 
capitalized. The reader is asked to read them in a nonreified manner. 

* For a useful recent summary and review, see, e.g., Kontler (1999) 

* Observe, e.g., the following index entry in Wallerstein (1974, p. 396): “East-central, 
see Europe, eastern ” 
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understood as reactions to the striking mismatch between the streamlined 
Western theoretical imageries, interpreted as universal patterns of mo- 
dernity, and the dizzying maze of their societies’ empirical patterns that 
were supposed to be mapped on each other in this new, generic—that is, 
nonnational—science of the social form. Were sociology a truly compar- 
ative-historical social science—instead of knowledge production locked, 
as if by default, into methodological nationalism, Western solipsism, or 
both—sociologists would focus on social practices outside the narrow 
limits of West Europe and North America. The fact that, so far, main- 
stream Western sociology has failed to do so contributes greatly to its 
severe problems of relevance outside of the Western core (see, e.g., Alatas 
1995, 2001). A similar argument applies to the way in which sociology 
has sadly “retreated into the present” (Elias 1987). 

As a result, more than 10 years after the collapse of the Iron Curtain, 
the idea of a shared, global field of reflexive practice addressing the variety 
of modern capitalist experience that includes both the eastern half of 
Europe and North America, as well as other parts of the globe, is utopian. 
Eastern Europe is widely thought of, in North American and West Eu- 
ropean academia, as an external area. It is never the subject, and rarely 
even the object, of any comprehensive, comparative-historical discussion 
of the social form. This exclusion of the circumstances outside the Western 
core of the capitalist world economy renders sociology’s claims for uni- 
versality and genericness sadly untrue.‘ 

Had Eyal et al. linked their justification for a neoclassical sociology to 
an explicit critique of the oversimplified conceptualizations and the West- 
ern focus of our discipline—instead of positing the second coming of 
capitalism in the impure sociological laboratory called Eastern Europe—I 
suspect their call for a neoclassical sociology would have prompted an 
enthusiastic embrace from Burawoy. In that case, the analogy with neo- 
classical economics (a charge that is, in my reading, a nominalist mis- 
understanding of Eyal et al. and an exaggeration for Stark and Bruszt) 
would have also been avoided. A specifically sociological kind of neo- 
classical interest, not tainted with the influence of economics but empha- 
sizing the analytical primacy of the social, can be defined as that enterprise 
which seeks to provide more inclusive formulae for the fundamental ques- 
tions of modernity first addressed so imperfectly by the classics (for an 
attempt in this vein, see Böröcz [19976]. Most pertinent to the subject 
matter at hand are such issues as the necessity and directionality of change, 
the dialectics of change and constancy, and the contingency and dimen- 
sionality of institutional change. If, as a shorthand, we take the term 
“classic(al)” to denote a body of scholarship that cannot be avoided, a 
neoclassical approach means addressing explicitly the fundamental, for- 


$ Contributing to a Central European unease about this traffic in ideas is the fact that 
current academic salary discrepancies between U S. institutions and various countries 
of the eastern half of Europe (10-25 times) can be much bigger than proportions in 
per capita GDP (3-10 times). 
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mative concerns of such scholarship using the empirical materials of the 
present day. 

Grasping.—Because of the complex social-institutional patterns in 
which they have articulated structures, agents, ideologies, and practices 
of capitalism, nationhood, statehood, and identity, the societies of this 
unnameable part of Europe have always called for sociology in the com- 
parative-historical mode. The institutional variation in late-state-socialism 
is the basis of Stark and Bruszt’s justification for their choice of a strategy 
that contrasts the Central European societies’ “paths of extrication” from 
state socialism. That is, however, in apparent logical conflict with their 
refusal to conceptualize social change in generic terms, a shortcoming that 
weakens the power of Stark and Bruszt’s well-taken insistence on a com- 
parative approach to capitalisms. This comparative strategy reifies East 
European regionality and obscures the similarities of the Central Euro- 
pean processes to other experiences of social change. By insisting on the 
differences in the paths of extrication within Central and Eastern Europe, 
Stark and Bruszt implicitly problematize the usefulness of more far-rang- 
ing comparisons (e.g., the use of the Hungarian experience of the late 
1960s and the early 1980s as a theoretical key to understanding contem- 
porary China and the easy hops between Hungary and Russia in some 
of the literature on post-state-socialism, including Burawoy’s essay).’ Even 
more significant, this variety of state socialist social practices has the 
potential of diminishing the very raison d’étre of the entire enterprise of 
post-state-socialist sociology by blurring its point of departure, the solid, 
singular image of “the” (state) socialist (communist, etc.) status quo that 
serves, in turn, as Stark and Bruszt’s main justification for comparing 
only post-state-socialist instances of social change to each other. 

As Judit Bodnár (2001) notes, the comparative-historical sociology of 
state socialism has faced a severe collinearity problem by dealing with 
societies that were state socialist, thus representing a modernist version 
of high modernity whose institutional patterns were somewhat (although 
not entirely) at odds with patterns of Western capitalist societies; histor- 
ically specific and thus culturally somewhat, although not completely, at 
variance with their Western counterparts (which were of course quite 
varied themselves); and globally located, that is, very much part of an 
overarching project of global capitalist, great-power, imperialist modern- 
ity that featured some Western societies in a prominent, core position. 

These three dimensions overlap so much that the decomposition of their 
effects is impossible by empirical methods. Furthermore, the oppressive- 
ness of political practices in these countries was foregrounded, while their 
achievements (e.g., centripetal mobility in the world-system, increases in 
collective consumption, impressive extensions of social policy, etc.), and 
the effects of historical-cultural and global-macrostructural factors were 
systematically suppressed in the “totalitarianism” perspective dominant 


' For a balanced review, see again Róna-Tas (1998). 
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in “East European” area studies until the mid-1980s. Compounding the 
difficulties is the fact that the societies of this area, although subject to 
imperial domination, have never, in contrast to much of the contemporary 
Third World, been subject to modern colonial rule.* Nor have their states, 
unlike those of Western Europe, ever had any overseas possessions. 


CONTINGENT CLOSURE 


Conceptual collinearity has haunted much of the sociological study of the 
current transformation: decomposing the effects of the region’s 
(semi)peripheral position in the world economy, its peculiar sociocultural 
heritage, and its immediate state socialist past is extremely difficult. 
Clearly, Central Europe’s historical experience is both quite relevant to 
what ought to be the mainstream of sociology—theoretically inquisitive 
comparative-historical analysis—and slightly “off” from the Western 
mainstream in its morphological pattern. Here I outline three dimensions 
of a framework that could illuminate the Central European experience, 
with special attention to the historical dimension. 

My analysis can be organized around social closwre, that is, the insti- 
tutional process “whereby one group monopolizes advantages by closing 
off opportunities to another group” (Murphy 1988, p. 8).? What ties the 
two books and Burawoy’s review together is precisely their treatment of 
different modalities of closure: 

1. Stark and Bruszt focus on transformations in the rules that regulate 

property, a centrally important, complex institution of closure under 
capitalism. 

: Eyal et aL devote their analysis mainly to understanding the selection 
mechanisms that structure the placement of individuals in managerial 
positions (those in position that exercise control over productive as- 
sets), appended by a treatise on “managerialism,” a macro-institu- 
tional-level theory. 

3. Burawoy’s essay above criticizes both works for ignoring what he 

sees as the privileged site of closure under capitalism: class relations 

viewed from the position of the exploited. 


N 


The Past Is Now 


Some of the earliest sociological work on post-1989 social change in Cen- 
tral Europe reproduced the tendentious ahistoricity of many economic 


* Imperial dynamics, though they display different “tensions of empire” (Cooper and 
Stoler 1997), are just as significant for the production of (semi)peripherality, exclusion, 
and oppression as are colonial practices Elsewhere (Borocz 1992, 1999), I have pro- 
posed the notion of dual dependency to capture some of the intricacies of institutional 
change in imperial relations under Stalinism and thereafter. 


’ See also Parkin (1974), Manza (1992), and Bdrbdcz (19978). 
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and political science models by transposing the overtly teleological concept 
of “transitions” (to “market” or “democracy” and sometimes both). The 
concept of “transitions” was abandoned gradually, first with such inten- 
tionally unbalanced metaphors as the “transition from socialism,” (e.g., 
Róna-Tas 1994; Hillmann 1994) echoed in Stark and Bruszt’s “paths of 
extrication” (1998), later in favor of the nonteleological language of “trans- 
formation(s)” (Róna-Tas 1997; also Stark’s multiple works) and the “un- 
certain transition” (Burawoy and Verdery 1999). “Path dependence” was 
useful in helping the analysis of social change move away from brash 
teleology: the term accounts for the idea that “the past matters.” That 
statement is only surprising, of course, if at any point we entertained the 
idea—ludicrous to comparative-historical sociology—that it may not. Yet, 
path dependence was a useful antidote to the venom of teleology in 
transition. 

Just a small step away from the ahistoricity of economic models, path 
dependence offers a very restrictive model of social change: plainly stated, 
it suggests simply that the state of an institution at any time is influenced 
(or, in more sanguine variants, it is determined) by the state of that in- 
stitution at a preceding time.” It argues that, once a lock-in mechanism 
is in effect, change is difficult. It says nothing about how difficult it is 
and why, whether it is equally difficult for all institutions and types of 
social organizations, how the difficulty of change is reduced or amplified, 
why change occurs at all, or how we recognize an effective, path-depen- 
dence producing lock-in mechanism when we see one (other than labeling 
examples of institutional resilience to change “path dependent” in retro- 
spect). Comparative historiography, theoretical history, historical anthro- 
pology, and even historical sociology have done much to develop sensitive 
conceptual tools to capture the many possible roles of the past in present 
social change; the absence of such considerations from this literature is 
quite remarkable. 

Read in these terms, much of David Stark’s work on post-state-so- 
cialism can be summarized as a move away from an emphasis on path 
dependence toward the contingency of social change and the examination 
of politics as a terrain in which locking in and change are acted out (cf. 
Stark 1992 with Stark and Bruszt 1998). Just as institutional economist 
János Kornai’s (1992) verdict on the economic system he calls, inter- 
changeably, “socialism” and “communism” is based on the binary contrast 
of formation theory (“capitalism” vs. “communism-socialism”), Eyal et al.’s 
argument of post-state-socialist “managerialism” considers the specific in- 
stitutional legacy of state socialism as the only theoretically significant 
variable that explains a post-state-socialist outcome (thus shielding both 
pre-state-socialist historical, cultural, or political legacies and synchronic 
external effects as possible contributing causes). In that sense, even though 


* Arthur, Ermoliev, and Kaniovaki explain the economic idea of “path dependence” 
with respect to choices regarding firm location (1994, p. 35). 
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they do not identify with path dependence, the internal structure of their 
argument is reminiscent of it: What matters most in the current trans- 
formation of the object (say, post-state-socialist Hungary) is the imme- 
diately preceding state of that object (state-socialist Hungary). That is 
especially visible in Eyal et al.’s causal linking of managerial success 
during the first phase of property change to the intellectuals’ New Class 
project of a good two decades before (see Konrád and Szelényi 1974; 
Szelényi 1986-87; Bor&cz 1989, 1993). This is noticeable especially if con- 
trasted with Eyal et al.’s analysis of the complex legacies of Burgertum 
and Bourdieuesque-Colemanian capitals, in which they employ a much 
more complex sociohistorical toolkit than path dependence. A similar 
notion is expressed in David Stark’s suggestive metaphor of the new 
institutions being built not just on, but with the ruins of socialism (Stark 
1996), the main idea behind the suggestive concept of “recombinant prop- 
erty” (Stark and Brusrt 1998, chap. 5).” 

Eyal et al *s theoretical move, of embedding the “capitals” framework 
in a sociohistorical argument developed from a Weberian notion of Stand, 
evokes two generic considerations related to the analytical consequences 
of conceptual streamlining. Eyal et al. explicitly reject the moderniza- 
tionist framework: “The distinction between tradition and modernity is 
unhelpful in understanding what socialism was” (1998, p. 67). Their point 
of departure is a double equation: “Weber is a historical sociologist who 
describes two distinct logics of stratification: rank order [Stand] and class 
stratification [Klasse (p. 67). From this they argue that rank and class 
always embed and contextualize each other, with some elements of their 
individual-based features, modeled as “capitals,” dominant. They see 
“bourgeoisness” (Bürgertum) divided into property-based (Besitrbiirger- 
tum) and education-based (Bildungsbirgertum) components that have, in 
the “belated” part of Europe, represented not just analytical categories 
but distinct social groupings. They argue that a longue durée pattern of 
the predominance of the education-based bourgeoisie is detectable in the 
results of a comparative multisociety survey and explains the success of 
the late-state-socialist managerial elites in retaining their positions at the 
top of enterprises after 1989. 

“Rank,” however, is quite a reduced reading of Weber’s analysis of 
Stand. Depending on the theoretical intent of the analysis of social closure, 
the concept of Stand can be read with at least three different sets of 
meanings. It can be interpreted, in the context of a historicist-idiographic 
analysis, as “estate,” denoting a multiplicity of overlapping legal and social 
distinctions, such as the medieval estates, denominational groups, pro- 
fessional groups, and various constellations of ethnic distinctions, patterns 


u Tn a subsequent, uniquely comprehensive analysis of the ownership network struc- 
tures of the 500 largest Hungarian companies, Hungarian sociologist Balárs Vedres 
(2000) bas confirmed the existence of the conceptually crucial, and intuitively com- 
pelling Starkian recombinets but found altogether six of them (one of which is an 
example offered by Stark), hardly an overwhelming proportion. 
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of geographic “rootedness” and mobility, and linguistic practices. On the 
positivist-nomothetic extreme, we find the received notion of “status” so 
predominant in the North American sociometry of stratification, which 
assumes free movement among these positions and models empirical pat- 
terns of inequality in such movement. The area proper for comparative- 
historical sociology, the conceptual midpoint between the two extremes, 
is occupied by the notion of “contingent closure”—+that reading of Stand 
in which new institutions are created by the mixing of elements that are 
supplied by history but in which neither the past, nor any other single 
dimension of power, overdetermines the outcome.” 

“Rank” reproduces one side of the two-sided coin of Stand. As his key 
passage on ständische Lage testifies,’ Weber’s model of Stand includes 
clear references to relational features (such as the social organization of 
marriage or even communal eating) as much as such ranking mechanisms 
as prestige and privilege, along with life conduct that can be read as both 
a network and a ranking mechanism; I offer an illustration of these in 
figure 1. In it, I added the distinction of conflict/cooperation under “net- 
work mechanisms” and introduced the distinction of formalAnformal 
mechanisms to allow for examination of the claim, raised by a number 
of scholars, concerning the predominance of informality, especially infor- 
mal network assets, in the transformation.“ The reduction of Stand into 
“rank” makes us focus only on the right-hand side of this graph. The 
“rank” reading loses a sense of sociability and groupness that is very much 
part of the original Weberian notion." 

Bourdieu's and Coleman’s notions of capitals (as in cultural capital, 
social capital, political capital, etc.) have resonated in mainstream soci- 
ology because they can be used as replacements for the positivist notion 
of status while allowing a modicum of analytical historicity and sociability. 
The notion of “conversion mechanisms” preempts criticisms, often raised 
against stratification research, that its evidence consists of mere statistical 
artifacts. Instead, the notion of conversion posits actual social mechanisms 
that transpose forms of capital while creating space for a historical analysis 
of the accumulation of those forms of capital. 

Capital clearly represents a step away from the abstract formalism of 
stratification analysis. The trouble is that, as students of political economy 
will no doubt notice, this metaphorical use of the term capital forgets 
about one set of institutional-level complications—the distinction between 


» Elsewhere (Bórbcz 1997) this argument is developed in detail. 

4 Weber ([1920] 1978, pp. 305-6). 

“ For a discussion of this claim, along with some background on how it emerged, see 
Böröcz (2000a). 

3 To be also noted is that formal educational attainment—the shared content of the 


notions of intellectualness, managerness, and cultural capital—can be seen to have 
meaningful bidirectional causal ties to each of the 10 cells of the two tables in figure 
1. In other words, formal education itself contains both network and ranking mech- 
anisms. It is orthogonal to the distinctions presented in this model. 
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contingent closure 
relational (network) mechanisms ranking (hierarchical) mechanisms 
conflict cooperation prestige privilege life conduct 


FORMAL 
INFORMAL 


FIG. 1 —Typology for analyzing contingent closure, based partly on Weber’s model of 
standische Lage. 


wealth and capital. That distinction rests on the idea that not all forms 
of wealth are relevant for capitalist accumulation: wealth needs to become 
capital and be valorized, which involves its subjection to a specific social 
institution called, depending on the context of the analysis, the production 
process, the capital-labor relation, exploitation, or the appropriation of 
surplus labor. The valorization of capital represents a very narrow strip 
of institutional alternatives in which accumulated wealth can be ex- 
pended. Much of what happens with accumulated value is final con- 
sumption or thesauration—outside of the realm of capitalist accumulation 
per se. 

The metaphors of cultural, political, or social capital provide no theory 
of valorization. They do not distinguish among various uses—profit- 
making versus wastefulness—of the capitals they denote. To be noted also 
is the fact that the core social content of at least two of those capi- 
tals—social network ties and knowledge as a reflexive and creative linkage 
to the world—are widely valued and enjoyed in their noninstrumental 
form, in and for the sake of themselves, without their being launched in 
purposive-instrumental social action in the interest of seeking social po- 
sition. Another shortcoming of the language of capitals—which allows its 
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use in stratification analysis—s its frequent reduction to a trait of indi- 
viduals rather than collectivities (Portes 1998; Portes and Landolt 2000). 

Finally, all three of the metaphorical capitals are different from eco- 
nomic wealth/capital when they are launched, even under the most 
rational-instrumental conditions. When cultural, social, or political capital 
is activated, there is no direct, “accountable” loss to the actor who applies 
it. Meanwhile, social and political capital can be lost or radically devalued 
in social mechanisms that have nothing to do with their valorization. In 
fact, nonuse is likely the most frequent cause of the depletion of social, 
cultural, and political capital. In addition, cultural “capital,” and also some 
forms of political and social “capital,” are “sticky”: they may be quite 
difficult to pass on. As a result of a combination of nondepletion on being 
launched, the general lack of interchangeability among the three capitals, 
their resilience to transitivity, and the absence of a general measure of 
exchange value, they fail to lend themselves to any sense of rational 
bookkeeping or calculation. Thus it appears that capitals are an elegant 
but oversimplified economism. My sense is that Eyal et al. rely on this 
language for two reasons: first, they need to provide a link between the 
survey-based evidence regarding personnel changes at top managerial 
positions (a set of individual-level observations) and their institutional- 
level argument of historical change; second, because their “rank” reading 
of Stand lacks a relational component, which they supply through the 
notion of social capital. 

Burawoy criticizes both volumes for their insufficient emphasis on class 
relations. In Marx, the term “class” is used in at least three, mutually 
incompatible meanings, depending on the context. The dichotomous- 
antagonistic notion of class struggle of his political theory differs from 
the tripartite, descriptive political-economic idea he uses for some de- 
scription of 19th-century capitalism (in which the working class, capi- 
talists, and the rentier class are grouped according to the main source of 
their livelihood). In his narratives about past societies, Marx refers to a 
large number of “class” locations in a highly idiographic-descriptive fash- 
ion. Add to this polysemy Weber’s idea of classes as unities of social 
inequality in discontinuous, uneven social space. Iván Szelényi’s analysis 
of the state socialist new class has never had much to do with Marx’s 
antagonistic-political concept of class; it is best described as a Weberian 
rereading of Marx’s critique of modernity, applied to the post-Stalinist . 
status quo in state socialist Central Europe. In terms of that interpretation, 
it is difficult to claim—as I think Burawoy does—that Szelényi has moved 
away from a class-based interpretation of “Central and East European” 
reality. Some passages in Eyal et al.’s treatment of the contemporary 
relevance of Stand and Klasse offer an explicit, complex argument re- 
garding the mutual embeddedness of the two. 

David Stark has indeed moved from an initial interest in the com- 
plexities of industrial relations in Hungary to disentangling the transfor- 
mations of bundles of property. The idea that conceptual work on the 
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institutional sociology of the massive privatization of public property is 
necessafily less relevant to a critical understanding of contemporary cap- 
italism than a focus on shop-floor relations is, however, not evident to 
me. Reproduction of managerial power, restructuring networks, and the 
shop floor appear to be equally important, strategic points of insertion 
for sociological analysis. 


All Cases Are Linked 


Much of the scholarship on the post-state-socialist transformation has a 
tendency of underestimating the impact of external effects. Foreign pat- 
terns exert an effect through the media, foreign education of the political, 
economic, and cultural elites as well as through the wide travel experience 
of all European populations (e.g., Hungary annually receives four times 
more foreign visitors than its population). International tourism is a widely 
neglected factor of social change through informalization, the introduction 
of sizable incomes in hard currency, and the creation of extreme consumer 
expectations (Böröcz 1996). 

The relative advantages and disadvantages of privatization policies in 
the Czech Republic, Hungary, Poland, Russia, Slovakia, and what was 
formerly East Germany have been widely compared in public discourse 
and entered into subsequent policy choices and modifications in each of 
those countries. Since 1989, political discourse has commonly made ref- 
erence to highly idealized standards of behavior and the institutional 
legacies of specific West European societies.” 

Probably the single most momentous force in determining the course 
of large-scale social change in Central Europe comes from the European 
Union. The EU is currently considering the applications for accession to 
full membership by 10 states of formerly state-socialist Europe, with each 
applicant already an associate member of the EU. One of the three criteria 
for membership is that the applicant state should, by the time of the 
beginning of negotiations for accession, have transposed and implemented 
the acquis communautaire—the EU’s community laws and regulations. 
This produces a very powerful political, economic, and communicative 
dependence (Böròcz 2000b). How this borrowing, transposition, and im- 
plementation takes place is a veritable gold mine for comparative soci- 
ology (see, e.g., Jacoby 1999). The NATO membership of three of the 
region’s societies serves as another, increasingly important avenue of in- 
stitutional borrowing and recontextualization. 

Because of the significance of recontextualization in institutional 
change, it is surprising how little of the rich anthropological literature on 
recontextualization and hybridity finds its way into the comparative-his- 


“The Hungarian Constitutional Court, e.g., has actively implemented an Austrian- 
German type of judicial review (Halmai 1996-97), contributing to the widespread 
borrowing and recontextualization of institutional elements from those societies. 
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torical sociology of post-state-socialist social change. The unintended con- 
sequence of that absence is a sociology of social change in which all wheels 
are constantly (reinvented. 


The Glocal Is Everywhere 


The term “glocal” refers to the idea, widespread in work addressing the 
globe as a single entity, that the analytical separation of phenomena of 
different scales (i.e., global vs. local) is increasingly untenable. I use it 
here as a sensitizing tool to the existence of large-scale processes that 
influence the institutional- and individual-level outcomes of social change 
on the local level. 

With the comparative-historical analysis of social change, time (spe- 
cifically, the temporal distance allowed by analysis after the onset of the 
transformation for changes to take shape) is of utmost substantive sig- 
nificance. Had they not closed their data tables between 1993 (e.g., Eyal 
et al. 1998, table 4.13) and 1996 (table 4.15), the foreign direct invest- 
ment—dominated, large-scale privatization of state assets at least in some 
of the region’s societies (most markedly Hungary where, by now, virtually 
all of the state’s alienable assets have been passed on to private, over- 
whelmingly foreign, investors)” would have made it imperative for 
Eyal et al. to conceptualize economic globalization, the region’s 
(semi)peripheral position in the world economy, and the significance of 
transnational actors as central to determining the outcomes of the trans- 
formation—all the more so given the precipitous drops in the post-state- 
socialist economies’ shares in global product, along with the widening 
discrepancies between their GDP and GNP figures caused by the dramatic 
increase in the relative significance of foreign-owned companies in their 
economic output. 

Stark and Bruszt’s chapter 2 does address foreign direct investment. 
It does so, however, only in passing and as a generic dimension describing 
possible privatization strategies derived from the Hungarian debates on 
property change. In the discussion of the binary alternative of “markets 
versus Clan,” as well as in the book’s subsequent treatise on states and 
in such sparkling metaphors as deliberativeness in associations, enabling 
constraints, and extended accountability, there is little conceptual reflec- 
tion on how the most momentous structural force in the transformation 
process—the penetration of foreign capital—has worked to enable foreign 
capital while disabling local society. Equally missing is acknowledgement 
of the fact that in at least some of the region’s societies, the transformation 
took place under severe external debt dependence, causing the fiscal crisis 


Hungary has been, since 1990, the destination of over $18 billion in foreign direct 
investment (U.S Department of State 2000) According to the last annual report of 
the Hungarian State Privatization and Holding Company (APV Rt. 2000), of the 1,859 
enterprises owned by the state on January 1, 1991, only three were currently owned 
by the state by early 2000 (Diczháxi, n.d, p. 126). 
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of the post-state-socialist state (e.g., Campbell 1996). This omission ‘is 
surprising, given some literature about the curious presence, and in many 
ways quite counterintuitive behavior, of the European state socialist econ- 
omies in the capitalist world economy (see, e.g., Frank 1977; Chase-Dunn 
1982; Gorin 1985; Kennedy and Smith 1989; Bòröcz 1992), as well the 
significance of foreign actors in the economic transformation after state 
socialism under great shortages of domestic economic assets (Bórdcz 1992 
and 1993). 

It is difficult to overestimate the significance of global forces in local 
institutional change in Central Europe. The region’s large organizations 
of public authority have metamorphosed from being comprador states 
(the local state within the state socialist status quo whose legitimacy rests 
with the state socialist imperial center, i.e., outside its borders) into auc- 
tioneer states that, instead of democratizing access and optimizing control 
over the collective property they inherited in a very concentrated form 
from their state socialist predecessor, first hand control over to sitting 
management and eventually liquidate all public assets entrusted in their 
care to private, overwhelmingly foreign, investors at a discount (Bordcz 
1999). All this took place under the global ideological aegis of monetarist 
neoliberalism and fiscal restriction, pushed by global lenders and their 
organizations. In direct challenge to the modernizationist view of the role 
of aid and foreign direct investment, informality has been ubiquitous in 
the transformation largely because, and not in spite of, the influence of 
foreign actors (Holmes 1999; Böröcz 1993, 20008). 

Conventional macroanalysis pays little attention to, and has been ill- 
equipped to address, local institutional variation, creating a paucity of 
work on the integration of the global and the local. With its emphasis on 
the local, conventional institutional analysis has a proclivity for pushing 
global factors to the background, so little work is done on the effects of 
glocality. The experience of European capitalisms after state socialism 
presents a very fertile context for exploring the cleavage between the two. 


SUMMARY 


I have argued that the two books make important but relatively small 
steps in what is a potentially vast conceptual space for comparative- 
historical research. Although successful on their own terms, they leave 
much of the analytical terrain unexplored. Figure 2 includes a visual 
representation of this space in three dimensions along the three edges of 
a cube, meeting in the origin (marked as “0”). The left-right axis represents 
the possibility to consider the institutional legacies of the past as potent 
forces in the present. This articulates an analytical world in which the 
modernity of an institutional arrangement is not thought of as necessarily 
antithetical to or destructive of the past but one that carries the patterns 
of the past within the present. The “origin-to-casewise” axis depicts the 
dimension in which recontextualizing effects can be addressed, accounting 
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Eyal, Szelényi & Townsley 


Stark & Bruszt 


Fra. 2.—Conceptualization of willingness to consider historical, across-case, and glocal 
mechanisms. 


for the wide influences of our various societies as “cases” on each other. 
Finally, the vertical axis of “origin-to-glocal” indicates the dimension that 
allows accounting for the glocal phenomena, including the global aspect 
of Burawoy’s interest, an analysis of global class relations. 

The two books are reminders that a truly neoclassical sociology of social 
change would utilize the entire space of the three conceptual dimensions. 
The complexity and significance of social change in Central Europe, and 
the need to reinvent comparative-historical sociology in order to make it 
true to its promise, call for no less. 


REFERENCES 


Alatas, Syed Farid 1995. “The Theme of ‘Relevance’ in Third World Human Sciences.” 
Singapore Journal of Tropical Geography 16 (2): 123-40. 

. 2001. “The Study of the Social Sciences in Developing Societies. Towards an 
Adequate Conceptualization of Relevance.” Current Sociology 49 (2): in press. 

Arthur, W. Brian, Yuri M. Ermoliev, and Yuri M Kaniovski. 1994. “Path-Dependent 
Processes and the Emergence of Macrostructure.” Pp. 33-48. In Increasing Returns 
and Path Dependence in the Economy, edited by W. Brian Arthur. Ann Arbor: 
University of Michigan Press. 





1165 


American Journal of Sociology 


APV Rt 2000. Privatisdcids monitor 1999. Állami Privatizációs és Vagyonkezeló 
Résrvénytdrsasig September 24. http//www apvrt hu/m5.html 

Bodnár, Judit. 2001. Fin-de-Millénaire Budapest: Metamorphoses of Urban Life. 
Minneapolis: University of Minnesota Press 

Borocz, Jóxsef. 1989. “Mapping the Class Structures of State Socialism in East-Central 
Europe ” Research in Social Stratification and Mobility 8:279-309. 

1992 “Dual Dependency and Property Vacuum: Social Change on the State 

Socialist Semiperiphery.” Theory and Society 21:77-104. 

. 1993. “Simulating the Great Transformation: Property Change under 

Prolonged Informality in Hungary.” Archives européennes de sociologie / 

Europaisches Archiv fur Sosiologie / European Journal af Sociology 34, no. 1 (May): 

81-107. 

. 1996. Leisure Migration. A Sociological Study on Towrism. Oxford: Pergamon 











Press 

1997a. “Social Change with Sticky Features and the Failures of 

Modernizationism ” Immovation 10 (2): 161-70. 

. 1997b. “Stand Reconstructed: Contingent Closure and Institutional Change.” 

Soctological Theory 15 (3): 21548. 

1999. “From Comprador State to Auctioneer State: Property Change, 

Realignment and Peripherialization in Post-State-Socialist Central Europe.” Chap 

10 in States and Sovereignty in the Global Economy, edited by David A. Smith, 

Dorothy Solinger, and Steven Topik. London: Routledge. 

20002. “The Fox and the Raven: The European Union and Hungary 

Renegotlate the Margins of “Europe.” Comparative Studies tn Society and History 

42, no. 2 (October) 847-75. 

. 20005. “Informality Rules.” East European Politics and Societies 14 (2): 
348-80. 

Bozóki, András. 2000 “Érzékenység és turelem.” Élet és irodalom, 36. September 25. 
http://es.fullnet hu/0036Andex.html 

Burawoy, Michael, and Katherine Verdery, eds. 1999 Uncertain Transition: 
Etknographies of Change in the Postsocialist World. Lanham, Md.. Rowman & 
Littlefield. 

Campbell, John. 1996. “An Institutional Analysis of Fiscal Reform in Postcommunist 
Europe.” Theory and Society 25 (1): 45-84. 

Chase-Dunn, Christopher K., ed. 1982. Socialist States in the World-System. Beverly 
Hills, Calif.: Sage. 

Cooper, Frederick, and Ann Laura Stoler. 1997. “Between Metropole and Colony: 
Rethinking a Research Agenda.” Pp. 1—40 in Tensions of Empire: Colonial Cultures 
in a Bourgeois World, edited by Frederick Cooper and Ann Laura Stoler. Berkeley 
and Los Angeles: University of California Press. 

Diczházi, Bertalan N.d. A kulfoldi tőke szerepe a privatizációban. Számadás a 
talentumról Budapest: Állami Privatizációs és Vagyonkezelő Résrvénytársaság és 
GJW-CONSULTATIO konzorcium. 

Dupcsik, Csaba. 1994 “Korunk hősei.” BUKSZ-Budapests konyvszemle (Nyár): 
151-54 

Elias, Norbert. 1987 “The Retreat of Sociologists into the Present.” Theory, Culture 
and Society 4:2-3. 

Eyal, Gil, Iván Szelényi, and Eleanor Townsley. 1998. Making Capitalism without 
Capitalists: The New Ruling Elites in Eastern Europe. London: Verso Books. 

Frank, André Gunder. 1977. “Long Live Transideological Enterprise! The Socialist 
Economies in the Capitalist International Division of Labor.” Review 1, no. 1 
(Summer) 91-140. 

Gorin, Zeev. 1985. “Socialist Societies and World System Theory A Critical Survey.” 
Nature and Socisty 49, no. 3 (Fall): 332-66. 

















1166 


Review Symposium: Böröcz 


Halamal, Gabor. 1996-97. “Free Speech in the New Hungarian Constitutional 
Practice.” International Journal of Sociology 26 (4): 66-75. 

Hillman, Arye L. 1994. “The Transition from Socialism: An Overview from a Political 
Economy Perspective.” Ewropean Journal of Political Economy 10, no. 1 (May): 
191-225. 

Holmes, Stephen. 1999. “Can Foreign Aid Promote the Rule of Law? Drawing Lessons 
from Failures in Legal Reform.” East European Constitutional Review 8, no. 4 (Fall): 
68-74. 

Jacoby, Wade. 1999. “Priest and Pentitent: The European Union as a Force in the 
Domestic Politics of Eastern Europe.” East European Constitutional Review 8, no. 
1/2 (Winter/Spring): 62-67. 

Kennedy, Michael D., and David A. Smith. 1989 “East Central European 
Urbanization: A Political Economy of the World-System Perspective.” International 
Journal of Urban and Regional Research 13, no. 4 (December): 597-624. 

Konrád, George, and Iván Szelényi. 1974 The Intellectuals on the Road to Class Power. 
Translated from the Hungarian by Andrew Arato and Richard E. Allen. New York: 
Harcourt Brace Jovanovich. 

Kontler, László. 1999. “Central Europe. Ten Years After.” Exropean Review of History 
6, no. 1 (Spring): 9-14. 

Kornai, János. 1992. The Socialist System: The Political Economy of Communism. 
Princeton, N.J.: Princeton University Press 

Manza, Jeff. 1992. “Classes, Status Groups, and Social Closure: A Critique of Neo- 
Weberian Social Theory.” Current Perspectives in Social Theory 12:275-302. 

Murphy, Raymond. 1988. Social Closure: The Theory of Monopolization and Exchusion. 
Oxford: Clarendon Press. 

Parkin, Frank. 1974. “Strategies of Social Closure in Class Formation.” Pp. 1-18. in 
The Social Analysis of Class Structere, edited by Frank Parkin. London: Tavistock. 

Portes, Alejandro. 1998 “Social Capital: Its Origins and Applications in Modern 
Sociology.” Annual Reviews of Sociology 24:1-24. 

Portes, Alejandro, and Patricia Landolt 2000. “Social Capital: Promise and Pitfalls of 
Its Role in Development.” Journal of Latin American Studies 32:529-47. 

Róna-Tas, Akos. 1994. “The First Shall Be Last? Entrepreneurship and Communist 
Cadres in the Transition from Socialism.” American Journal af Sociology 100 (1): 
40-69. 

. 1997. The Great Surprise of the Small Transformation: The Demise of 

Communism and the Rise of the Private Sector in Hungary. Ann Arbor: University 

of Michigan Press. 

. 1998. “Path Dependence and Capital Theory: Sociology of the Post-Communist 
Economic Transformation.” East Ewropean Politics and Societies 12 (1). 107-31. 

Said, Edward 1978. Orientalism New York: Vintage. 

. 1992. “Path Dependence and Privatization Strategies in East Central Europe.” 

East European Politics and Societies 6, no. 1 Winter): 17-54. 

. 1996. “Recombinant Property in East European Capitalism.” American Journal 
of Sociology 101, no 4 (January): 993-1027. 

Stark, David, and László Brusrt. 1998. Postsocialist Pathways: Transforming Politics 
and Property in East Central Ewrope. Cambridge: Cambridge University Press. 
Szelényi, Iván. 1986-87. “The Prospects and Limits of the East European New Class 
Project An Auto-Critical Reflection on ‘The Intellectuals on the Road to Class 

Power.’” Politics and Society 15 (2): 103-44. 

U.S. Department of State. 2000. Background Notes: Hungary. September 24. http:// 
www.state.gov/www/background_notes/hungary_9908_bgn.html 

Vedres, Balizs. 2000 “A tulajdonosi hálózatok felbomlása: A rekombináns 
tulajdonformák srerepe és a hazai nagyvállalatok tulajdonszerkezetének jellemzői 














1167 


American Journal of Sociology 
a kilencvenes évek végén”  http://www.bke.hu"vedres/pges/PapervTul2000/ 


Tulajdon2000.htm. February. l 
Wallerstein, Immanuel. 1974. The Modern World-System: Capitakst Agriculiure and 


the Origins of the European world-Economy in the sixteenth Century. New York: 


Academic Press. 
Weber, Max. (1920) 1978. Economy and Society: An Outline of Interpretive Sociology. 


Edited by Günther Roth and Claus Wittch. Berkeley: University of California Press 
Wolff, Larry. 1994. Inventing Eastern Europe: The Map of Civilization on the Mind 
of the Enlightenment. Stanford, Calif: Stanford University Press. 


1168 


Book Reviews 


Civic Engagement in American Democracy. Edited by Theda Skocpol and 
Morris Fiorina. Washington and New York: Brookings Institution Press 
and Russell Sage Foundation, 1999. Pp. x+528. $49.95 (cloth); $19.95 
(paper). 


David Plotke 
New School University 


The value, quality, and effects of civic life have long been major subjects 
for American sociology and political science. The most recent wave of 
interest in civic life derives from varied sources, including studies of the 
republican elements of early modern political thought and analyses of 
Reagan’s efforts to limit the federal state. 

Theda Skocpol and Morris Fiorina locate their valuable collection amid 
recent debates. The coordinates of this location are given by James Cole- 
man’s use of the term “social capital,” Robert Putnam’s studies of Italian 
political life and American civic decay, rationalistic accounts of cooper- 
ation, and, in the background, Bellah et al.’s Habits of the Heart. This 
collection offers a careful and realistic account of contemporary civic life 
in the United.States. The authors are reasonably impatient with loose 
jeremiads and casual claims that things are okay. 

The authors use historical records, data from national membership 
organizations, and, to a lesser extent, survey research. They find a con- 
siderable decline in national membership organizations, from the PTA to 
fraternal associations. Yet the number of national lobbies and advocacy 
organizations has increased, and some of these have achieved large mem- 
berships. While American civic life remains relatively robust in compar- 
ative terms, there has been a notable shift toward groups that solicit money 
or try to add names to their membership list as against groups based in 
local chapters of more or less active members. Thus the decline of civic 
life looks less pronounced if one counts organizations than if one gauges 
participation in the new universe of groups. 

This reshaping and partial decline of civic activity includes several 
elements. Race is relevant insofar as legal integration may have had the 
perverse effect of weakening some national civic groups—some whites 
withdraw from settings whose racial character can no longer be guar 
anteed. Another factor has to do with changing religious commitments. 
Absent the great upsurge in evangelical denominations, religion would 
have surely declined. Yet these newer denominations make a more modest 
civic contribution than do the older variants of American Protestantism 
that they have replaced. 
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A third feature of civic action is persistent inequality: people with more 
resources are more civically engaged. The level of such inequality is strik- 
ingly high. The steady and cumulatively dramatic decline in unions is 
part of this pattern, and labor’s decline may warrant even more emphasis 
than it receives in this volume. Especially in the decades of their greatest 
strength, unions encouraged and supported other forms of civic action. 
Thus union decline may have had major negative effects on other groups. 

This collection’s main strength lies in its rich and interesting accounts 
of the shape of civic and associational life. These will make the work 
durably valuable. Beyond this contribution, the book raises two hard 
questions: How do we explain this mixed pattern of change and decline 
in civic life? And why does it matter? 

The second question is highlighted by data to which several articles 
refer. With its troubles, civic activity in the United States remains con- 
siderably higher than in most European countries—yet in many of these 
countries voting rates are much higher and crime rates are much lower 
than in the United States. These comparisons pose a real problem for the 
many claims that increasing civic engagement is a crucial means of in- 
creasing political participation or reducing various social problems. What 
are the expected benefits of a major reallocation of time and resources to 
civic action? And how much should we value these benefits against the 
alternatives? 

While the explanatory question is more directly addressed, no con- 
vincing answer emerges. Robert Putnam’s recent Bowling Alone finds a 
far larger decline in civic life than do the authors of Civic Engagement. 
Putnam attributes this decline mainly to television and generation change 
(restating the main lines of cultural and sociological criticism of an emerg- 
ing American mass society from World War I into the 1960s). The authors 
of the present volume are skeptical about this explanation. They also 
believe there is less to explain, as they gauge the overall decline in civic 
activity to be less dramatic than does Putnam. 

Yet it is not easy to say precisely what the authors believe to have 
caused the civic shifts that they do identify and mainly criticize. One 
problem is that many things have changed in a consistent direction during 
the same decades that advocacy groups have tended to replace member- 
ship associations. The number of major league baseball teams has grown 
considerably, as has the frequency of air travel. How can one find efficient 
causes amid this wealth of possible explanations? 

Several lines of inquiry might be pursued further, via the judicious 
treatment (mainly historical and institutional) that such matters get in 
this volume. It would be worth looking more carefully at the development 
of the new advocacy organizations. In several cases, as with NOW, serious 
initial efforts were made to create locally based national membership 
organizations. Why were such efforts not successful? 

One possible reason for the decline of national membership organiza- 
tions, which is only briefly mentioned in this volume, is the growth of 
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the national state. This issue is politically charged, and one hesitates to 
augment the lazy antistatism that has permeated so much of American 
political and intellectual life. Yet Skocpol’s account (in “How Americans 
Became Civic”) shows that the rate of formation of national multipurpose 
membership organizations fell in the 1930s and never recovered—at the 
same moment that a dramatic and durable increase occurred in the role 
of the national state. 

Changing forms of household organization (which get little attention 
in this volume) may also be significant. For decades complaints of civic 
decline have often been accompanied by stern lectures to women that this 
dire social fate has been tightly linked to their growing interest in work 
and reduced engagement with family and community. This hectoring is 
objectionable, but we should take it for granted that antifeminists will 
hang their hat on every conceivable social trend. Questions about how 
time and resources are allocated to civic life within contemporary house- 
holds merit further inquiry. 

Third, the main shifts in civic activities seem to invite explanations 
based on modernization. Organizations like the Elks Club that were large 
at midcentury may have declined partly due to standard processes of 
modernization. With rising incomes and more choices, people seek to meet 
the needs that such organizations once met through more specialized 
forms—new insurance and savings plans, advocacy groups, charitable 
contributions, golf and swimming clubs, and so on. This shift had a 
sharply evaluative edge. At some point in the 1960s or 1970s the multi- 
purpose organizations that the authors tend to favor gained a wide rep- 
utation as boring and definitely not cool among the sons and daughters 
(and now grandsons and granddaughters) of the strata from which they 
had drawn their most active members. Happily, the present volume lacks 
the comical piousness that often infects such discussions, in which the 
reader is chastised for not belonging to the Kiwanis Club or the Masons 
by cosmopolitan academics who are themselves about as likely to be active 
in such groups as they are to become Mennonites. 

This book belongs on the short list of works produced in the last decade 
or so about civic life in the United States from which one can actually 
learn something. The numerous strong articles repay careful attention for 
anyone interested both in current debates and in figuring out what the 
next round of debates should address. 
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Civil Society and the Political Imagination in Africa: Critical Perspec- 
tives. Edited by John L. Comaroff and Jean Comaroff. Chicago: Uni- 
versity of Chicago Press. Pp. x1i+318. 


Mahmood Mamdani 
Columbia University 


First written for a May 1966 conference, this selection of papers is now 
published with a substantial introduction. The idea of civil society may 
be “Euro-centrically founded” and “fully compromised” as an “analytical 
term,” but the editors claim that it merits anthropological attention be- 
cause this “tool of the social imagination” has become “the metaphor of 
the moment” (pp. x, 2). They aim “to reverse the priorities of much recent 
academic work on civil society”: rather than ask what the idea of civil 
society can tell us about contemporary Africa, they probe what the specific 
set of African cases “tell us about the planetary appeal of the Idea” 
(p. 3). 

The introduction traces the planetary journey of the Idea from its Scot- 
tish birth (1789) to its global appeal (1989). Between the two moments 
lies the journey from the West to the non-West. Following Jean-Francois 
Bayart’s claim that “liberal democratic forms have been variously do- 
mesticated across the continent,” the Comaroffs aim to identify what 
“African hybrids, Africanized modes of civil society, might actually look 
like” (p. 20). To do so, they shed the “Eurocentric tendency to limit civil 
society to a narrowly defined institutional arena” and acknowledge the 
growth of “‘uncool’ forms of African association; forms dubbed partisan, 
parochial, fundamentalist” (pp. 21-22). 

But the recognition of uncool forms of civil life does not sit well with 
the claim of the Idea gaining planetary appeal. Neither does reference to 
the Idea’s Eurocentric origin come to grips with its colonial journey to 
explain why the post-1989 planetary reach of the Idea came to be fractured 
by a key difference. For lack of space, I will advance three propositions. 
First, the Idea civil society does not exhaust the political imagination of 
postcolonial Africa, The “uncool” forms of African association do not 
speak the language of civil society, but of tradition. Yet, none of the 
languages rapidly reshaping the public sphere in postcolonial Africa can 
be understood in isolation because all are born of a common history. 
Second, this history—of colonialism—points to the key difference between 
1789 and 1989, one glimpsed through contrasting histories of state for- 
mation that gave rise to the nation-state in Europe and the colonial state 
elsewhere, the former marked by the tendency to homogenize difference, 
the latter by the tendency to reify it. To the extent that the legacy of the 
colonial state undergirds the formation of political identities, discourses, 
and organizational impulses in contemporary Africa, the colonial context 
is key to understanding the institutional and discursive forms of contem- 
porary political struggles. Finally, while the flourishing of “uncool” forms 
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does testify to the reshaping and expansion of civil society—understood 
analytically—it actually challenges the claimed “planetary appeal” of the 
Idea, understood ideologically and programmatically. 

The editors gloss over the colonial context and contrasting histories of 
state formation, instead choosing to emphasize the Eurocentric origin and 
the planetary appeal of the Idea. Fortunately, not all the contributors 
follow their lead. The problem with democratic categories, Gary Wilder 
points out in chapter 2, is not simply that “they emerge out of European 
histories,” but that they “have been entwined with practices of European 
domination” (p. 64). William Bissell (p. 125) begins chapter 5 on Zanzibar: 
“What shape might civil society assume in social formations that have 
been produced, to a greater or lesser extent, within the context of colo- 
nialism?” His answer both recognizes that race and religion, sect and 
ethnicity, were the very form of civil society under colonial domination 
and explains why they would become the cutting edge of initiatives to 
expand and reshape the public sphere after colonialism. 

In making a clear distinction between the two uses of civil society—the 
analytical and the ideological—+hese contributors have opted for actually 
existing civil society over civil society as the programmatic agenda so dear 
to mainstream social science. But the ecumenical move alone is problem- 
atic. This is evident in chapter 4, where Mikael Karlstróm discusses how 
elective local assemblies and clan associations have structured expanding 
political participation in post-1986 Uganda. But he remains silent about 
the radically different consequences of each in a region where migrants 
constitute anywhere from one-third to one-half of the population. Whereas 
an ethnic basis for reorganizing the public sphere would emphasize de- 
scent as the basis of participation and disenfranchise immigrants, a civic 
basis would emphasize residence and enfranchise immigrants. The clans 
stand for the former principle, the local assemblies for the latter. 

For social science to recognize the legacy of the colonial in the post- 
colonial, and the many initiatives to reshape it, is no doubt an advance. 
But the legacy needs to be problematized. Chapter 6 on Sierra Leone 
suggests a beginning, and chapter 8 on the contradictory nature of Islamist 
movements in Niger presents a fuller and more rewarding effort. 

In sum, the otherwise rich introduction is limited by a double pre- 
occupation: more with the Idea of civil society than with the political 
imagination in Africa, and more with the Eurocentric origin of the Idea 
than with its colonial journey. But the range of perspectives inspiring the 
essays is broader than the title suggests. For this reason, the book—the 
introduction and the essays—takes us a valuable step further in the debate 
on civil society and that on the African political imagination, provided 
we do not insist on collapsing the two. 
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State/Culture: State-Formation after the Cultural Turn. Edited by 
George Steinmetz. Ithaca, N.Y.: Cornell University Press, 1999. Pp. 
x+433. $59.95 (cloth); $22.95 (paper). 


Thomas Ertman 
New York University 


The past two decades have witnessed not only the emergence of a separate 
interdisciplinary field of cultural studies spanning the social sciences and 
the humanities, but also the spread of culturalist perspectives throughout 
many areas of sociology and anthropology. This “cultural turn” (p. 1) has 
had far less of an impact upon those areas of political science and political 
and historical sociology that concern themselves with the state. This im- 
portant volume, which brings together 12 papers presented at a Wilder 
House conference at the University of Chicago in 1995, seeks to revitalize 
the study of state formation by allowing it to partake of the fruits of the 
cultural turn. In its ambitions this collection bears a strong ressemblance 
to Peter Evans, Dietrich Rueschemayer, and Theda Skocpol’s Bringing 
the State Back In of 1985, and its impact should be equally great. 

In his stimulating introduction, George Steinmetz contends that state 
theory currently finds itself in an “unsatisfactory condition” (p. 23) char- 
acterized by an unproductive stalemate between Marxists/neo-Marxists 
and Weberians/neo-Weberians over the degree of autonomy that the state 
enjoys vis-a-vis society in general and social classes in particular. At the 
same time, research employing both society-centered and state-centered 
approaches has pointed, often more implicitly than explicitly, to the im- 
portance of political discourse and ideology as key determinants of var- 
iation in the modern Western state, determinants that both Marxist and 
Weberian traditions are ill-equipped to integrate fully into their expla- 
nations because of their slighting of culture as a causal factor. Further- 
more, the rise of rational choice within political science offers no way out 
of this “impasse” (p. 25) because it operates with a rational actor model 
that emphatically rejects cultural particularism. Yet even here, as Julia 
Adams’s exemplary contribution to this volume demonstrates, culture 
performs crucial yet unacknowledged explanatory work in the attempts 
by rational choice theorists like Douglass North to account for state de- 
velopment in medieval and early modern Europe. 

The logical and convincing conclusion that Steinmetz draws from his 
reading of much contemporary work on the state is that this field would 
be greatly enriched, indeed revitalized, if it were to undergo a “cultural 
turn” of its own. Yet how precisely should culture be defined and its 
relationship to the political in general and the state in particular be con- 
ceptualized? Here Steinmetz, in keeping with his goal of “staging [a] dis- 
cussion among the various cultural perspectives” (p. 27), avoids direct 
programmatic answers. For these one must turn to the volume’s 13 essays. 

These fall into two broad categories. First, there are four theoretical 
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contributions (Pierre Bourdieu, Timothy Mitchell, John Meyer, and 
Charles Tilly) each of which seeks to disentangle the relationship between 
culture and the state within the context of more general analytic frame- 
works. The solutions offered range from Bourdieu’s well-known argument 
that the state and its elites shape national culture (national identity, modes 
of perception) via the educational and judicial systems; through Mitchell’s 
Foucault-inspired counterimage of modern culture and the modern in- 
dividual as the product of broader disciplining mechanisms not subject 
to the instrumental control of particular elites; to Meyer’s picture of na- 
tional cultures and the political and economic institutions that support 
them as embedded in and legitimated through an “exogenous, and more 
or less worldwide, rationalistic culture” (p. 123). It is a shortcoming of 
the volume that these three dense, brilliant pieces largely talk past one 
another: only Meyer addresses the claims of his co-contributors directly. 
In his epilogue, Charles Tilly goes some way toward compensating for 
this shortcoming by drawing on a number of contributions, especially that 
of Meyer, to present his own attractive “relational approach” to state and 
culture, which generates social structures and processes from culturally 
embedded interactions” (p. 410). 

Second, there are the volume’s nine case studies, which provide a far 
more tangible sense of what “bringing culture back in” to the analysis of 
the state means in practice. Three of these contributions, those of Julia 
Adams, Philip Gorski, and Steven Pincus, focus on early modern Europe 
and emphasize, respectively, the crucial role played by familial patri- 
monialism, religion (Calvinism), and popular nationalism in state for- 
mation during this period. Moving beyond Europe, further essays by 
Andrew Apter and by Nader Sohrabi show how Western political prac- 
tices and models could be succesfully assimiliated into indigenous 
traditions during the nation-building process in Nigeria (Apter) and Iran 
(Sohrabi). Finally, four contributions by David Laitin, Ann Orloff, Mabel 
Berezin, and Bob Jessop return to the modern West. They share a common 
analytic concern with the role of discourse in identity formation and nation 
building in the Soviet successor states (Laitin), in the gendering of the 
Anglo-Saxon welfare state (Orloff), in fascist attempts at nation building 
in Italy (Berezin), and in various critical models of the modern capitalist 
economy and polity Jessop). While all of these pieces are of uniformly 
high quality, those of Adams, Apter, Laitin, and Berezin deserve special 
mention for their methodological creativity. 

In summary, while the editor’s decision not to privilege one under- 
standing of culture has resulted in a certain theoretical diffuseness, the 
richness of this volume’s contributions should provide inspiration to all 
those who seek to reinvigorate the study of the state and state formation. 
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Organising Women in Contemporary Russia: Engendering Transition. By 
Valerie Sperling. New York: Cambridge University Press, 1999. Pp. 
x+303. $59.95 (cloth); $22.95 (paper). 
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When socialist states first began democratizing, domestic and foreign ob- 
servers placed much hope in the growing power of civil society. There 
was widespread agreement that new local nongovernmental associations 
such as Charter 77 in Czechoslovakia and Solidarity in Poland were the 
building blocks of a new democratic social order. At first, the prospects 
for these groups seemed excellent. With the help and encouragement of 
sympathizers from abroad, these groups spearheaded David and Goliath 
efforts to reign in the power of corrupt government elites. At the time, 
they seemed to have won. However, these hopes soon faded. Much to 
everyone’s disappointment, well over a decade later, local public life bears 
little resemblance to a vibrant civil society. Why? Although numerous 
scholars have turned their attention to solving this puzzle, particularly as 
it bears on women’s activism, theories of collective action have remained 
largely absent from these discussions, until now. 

In Organizing Women in Contemporary Russia, Valerie Sperling deftly 
utilizes social movement theory to illuminate the reasons why activists in 
postsocialist societies so frequently pursue mobilization strategies that 
appeal to foreign sympathizers but limit their immediate local impact. 
Democratization in Russia legalized civil disobedience and civic activism, 
and thus created political opportunities for women’s activism. But rather 
than giving rise to a mass mobilized, “contentious” social movement (sat- 
isfying the stringent requirements of scholars such as Tarrow and Tilly), 
it led to the creation of a cautious style of women’s activism, much of it 
dependent on third-party support from abroad. 

Sperling identifies a wide variety of obstacles to locally based conten- 
tious women’s activism. Soviet citizens, men and women alike, were 
obliged to participate in stuffy, hierarchical official organizations like the 
Young Pioneers and the Communist Youth League. This experience has 
made most Russians suspicious of activists and reluctant to join new post- 
Soviet voluntary associations. It has also made it difficult to single out 
women as an aggrieved group. Feminism was for decades a taboo topic, 
and now it is often caricatured. Even women’s activists look with sus- 
picion on feminists who seek to equalize gender relations, for most as- 
sociate such efforts with Soviet state expectations that they work full time 
while single-handedly shouldering a heavy burden of domestic 
responsibilities. 

While Soviet mobilization tactics have created an unfavorable local 
climate for feminism, the Soviet Union’s collapse has attracted foreign 
interest and financial support from abroad for local feminist causes. As 
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a result, women’s activism varies considerably across Russia. Explicitly, 
feminist groups are centered primarily in Moscow and other large cities, 
typically have a small membership, and often subsist on foreign assistance 
and encouragement. More numerous in smaller Russian cities are mothers’ 
groups that embrace conventional gender roles, have a larger membership, 
and depend on state funds and local donations. As a result of this division, 
rather than engaging in a common confrontation with the government or 
staging public protests, women’s activists in Russia tend to come together 
at present only for seminars and conferences, typically with Western as- 
sistance. Perhaps over time, this small but divided activist community 
will come together and overcome these ideological divisions. 

This book is well theorized and well researched in addition to being 
highly readable. Organizing Women in Contemporary Russia carefully 
integrates classic questions drawn from social movement theory (whether 
political opportunities and third-party support can overcome the relative 
dearth of indigenous organizations and the underdevelopment of civil 
society) to explain and account for the principal forms of activism within 
this social movement. It draws on abundant research: all told, the author 
conducted dozens of open-ended interview: 66 with activists representing 
50 women’s organizations in three Russian cities (Moscow, Ivanovo, and 
Cheboksary) and a series of others with a various participants in the 
Russian women’s movement, ranging from journalists and academics to 
women government officials and state legislators. It tells a compelling 
story: after providing a comprehensive review of published sources that 
trace the Russian women’s movement’s emergence, Sperling draws on 
her observation of numerous events and presents the results of her in- 
terviews in a fluid narrative that puts her own research to good use to 
illustrate mobilization tactics activists use in various contexts. Her con- 
clusion focuses on the unintentionally divisive effects of foreign aid and 
is followed by a highly useful appendix in which she provides numerous 
summary statistics on activists and organizations. 

Sperling’s monograph should prove of interest to many audiences. It 
advances our understanding of women’s activism during transitions to 
democracy. It sheds light on the role transnational advocacy plays in 
globalization processes. It provides valuable lessons for cross-national 
research on social movements, civil society, and voluntarism. It should 
also prove of interest to scholars interested in the influence of the inter- 
national women’s movement, and of course, it sets a new standard for 
scholarship on post-Soviet civil society. 
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Gösta Esping-Andersen’s rich new book analyzes the roots of the crises 
of today’s welfare regimes (“regimes” rather than “states,” for it is relations 
among states, labor markets and families that are analytically central). 
These lie not principally in uniform processes of “globalization,” but in 
the disjuncture between existing institutions and risk structures dramat- 
ically changed from those obtaining during the so-called Golden Age of 
the postwar welfare state. These risk structures have been transformed 
by shifts in economies and households: from industrial production dom- 
inance and demand for low-skill workers to service-sector predominance 
and demand for skilled workers, and from the predominance of male 
breadwinner families to dual-earner and sole-parent families and single 
people. The new political economy presents important trade-offs between 
equality and jobs especially, but Esping-Andersen, who wants to defend 
welfare states, argues that they must be changed. (One gets some sense 
of the political and intellectual ground that Esping-Andersen has tra- 
versed by noticing that “decommodification”—an analytic key to welfare 
regimes in his earlier work—has been recast as the “rigidities” of labor 
markets and has been displaced by the greater attention to women’s 
commodification.) However, change will come in country-specific ways: 
while many of the factors challenging present welfare arrangements cross 
the developed world, the logics of liberal, conservative, and social dem- 
ocratic welfare regime institutions shape political and policy dilemmas in 
distinctive ways. 

Esping-Andersen set the agenda for comparative analyses of welfare 
states and policy regimes with his now-classic 1990 book, The Three 
Worlds of Welfare Capitalism (Princeton University Press). The current 
book represents an intervention in an ongoing series of conversations 
stimulated by the earlier volume and therefore takes on a number of 
themes, loosely unified under the rubric of the challenges welfare regimes 
face, given new postindustrial “social foundations.” In part 1, he responds 
to several bodies of scholarship that developed to evaluate and contest 
the claims, concepts, and theoretical assumptions of that earlier book, 
especially around his regime typology and the dimensions of regimes (de- 
commodification, stratification, and the state-market nexus). Here, in re- 
sponse to feminist critiques and research, he has undertaken a thorough 
revision of his notion of regimes—previously defined almost exclusively 
by state-market relations—to incorporate the role of families in welfare 
provisioning. In light of a number of alternatives put forward by com- 
parativists, he also reconsiders and then reconfirms his tripartite typology 
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of social democratic, conservative-corporatist, and liberal regimes. He 
does allow that the Southern European grouping is notable principally 
(as compared to the larger conservative group) for the degree of its fam- 
ilialism. In part 2, he extends the earlier work on postindustrial employ- 
ment, and again shows—with richer and more recent data—that the dis- 
tinctive institutional characteristics, capacities, and politics associated 
with the three regimes heavily determine how states can and are likely 
to respond to postindustrial economic challenges. In part 3, Esping- 
Andersen finishes with a catalog of the new social risks to which welfare 
states should be—but may not be (politically) capable of—responding. 
Yet he does not offer any answers to the political dilemma of “the deadlock 
of median-voter support for anachronistic modes of welfare production” 
(p. 184). Indeed, the entire volume is oddly apolitical: instead of political 
struggles, we have only the path-dependent workings out of entrenched 
welfare-regime institutions. These do give the political difficulties faced 
by different groups of countries distinctive characteristics. However, one 
wonders about the politics internal to each country: Have partisan dif- 
ferences ceased to matter for welfare outcomes? 

Esping-Andersen’s new work shows the significant scholarly payoff 
from the sustained conversations that have come to characterize com- 
parative welfare regime analysis, including those between feminists and 
others. His revised regime analysis highlights new material on household 
economies, showing that family provisioning remains significant and that 
nonfamilial child care provision—by states or markets—is critical for 
women’s employment. He argues that encouraging men to do more do- 
mestic work is unlikely to affect women’s capacity to work, relying on 
regression analyses of the impact on women’s employment levels of child 
care provision and men’s contribution to domestic work; however, the 
potential impact of men’s contributions (if, somehow, they adopted more 
egalitarian behaviors) cannot be tapped when current levels show so little 
variance that they could not explain the wide cross-national variation in 
women’s employment. Differences in service provisioning across regimes 
do affect women’s employment and fertility, important for their contri- 
bution to welfare-state sustainability and economic productivity. Provision 
of services by state or market makes possible relatively high female em- 
ployment levels and generates employment by increasing demands for 
services, while leaving provision to families has contributed (paradoxi- 
cally) to a low-fertility, low-employment equilibrium that is the least de- 
sirable situation for contemporary states. These findings have made him 
a supporter of policies to promote women’s employment, a welcome al- 
ternative to analysts who assume that promoting women’s employment 
will only reduce the number of jobs available for men. 

Although it does not change his basic typology, Esping-Andersen does 
note service-provisioning differences within regime types identified by 
feminist scholars—France and Belgium differ from other conservative 
continental European countries, while among the liberal group, the United 
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States differs from Britain. He says we should not expect typologies fo- 
cusing on gender to line up with his essentially class-based regime group- 
ings. However, for those interested in the relationships between gender 
and class, this lack of congruence is quite significant: within regimes’ 
distinctive institutional contexts, gender relations and gender politics 
make a difference: the gender division of labor—represented by concepts 
such as the male breadwinner/female caregiver model or gender differ- 
entiation—differs within each of the three regimes. 

I think Esping-Andersen is missing out by not taking gender on board 
analytically, thus obscuring key social and political sources of the post- 
industrial changes in which he is interested, including men’s and women’s 
mutually influential attempts to secure resources, power, autonomy, and 
identity within the context of gender relations. For example, he has 
adopted feminists’ concept of “defamilialization” to indicate institutionally 
moving services from the family to markets or states, a prerequisite for 
women’s employment. Yet feminists have also been concerned with policy 
strategies that undercut women’s economic dependence on husbands. 
State policies to provide child care may be linked to this goal in that 
employment has been the chief route by which women have pursued 
greater autonomy and power within and outside families, but other av- 
enues (e.g., caregiver allowances) have been supported, too. Conceptually, 
it is confusing to refer to changes in the institutional location of servicing 
(from family to market or state) and women’s escape from economic 
dependency by the same term. But this conceptual confusion reflects his 
lack of attention to men’s and women’s gender strategies and gender 
ideologies (to use Arlie Hochschild’s terms). Esping-Andersen simply relies 
upon taken-for-granted assumptions about what women want—two chil- 
dren, employment (but not careers?), plus child care and elder care sup- 
plied by the state or the market. He does not seriously entertain more 
egalitarian and less employment-oriented visions: “Most households, we 
can assume, would prefer to reduce the necessary unpaid hours for both 
partners” (pp. 59-60). Egalitarian or not, moral concerns about the quality 
of care as well as deeply cathectic gender identities are wrapped up in 
men’s and women’s decisions about care and employment. Can we un- 
derstand the low-fertility equilibrium obtaining across much of Europe 
simply with reference to institutional deficiencies, and not to the contes- 
tations around domestic work and its worth that play out across house- 
holds? (The fact that these equilibria are informally referred to as “birth 
strikes” might suggest the latter!) Similarly, can the rising rates of divorce 
and nonmarital childbirth, made possible by women’s economic inde- 
pendence, be understood apart from gendered understandings of marriage, 
love, and sexuality? Given the significant gender gaps in political support 
to welfare states observed across Western polities—gaps rooted in house- 
hold arrangements and women’s employment—as well as the importance 
of women’s employment and fertility to the fiscal health of welfare 
states and the productivity of postindustrial economies, gender analysis 
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will be invaluable to understanding current and future social policy 
developments. 

Once again, Esping-Andersen has laid out for us an ambitious academic 
and political agenda. This book will certainly be of interest to the com- 
munity of comparative welfare regime analysts, but should be read as 
well by analysts of U.S. welfare reform, for it reveals that many of the 
policy changes these scholars are charting are not confined to America, 
Political sociologists, analysts of labor markets and households, stratifi- 
cation researchers, and economic sociologists will all find the book pro- 
vocative and informative for themselves and their graduate and advanced 
undergraduate students. 


Restructuring Caring Labour: Discourse, State Practice, and Everyday 
Life. Edited by Sheila M. Neysmith. Oxford: Oxford University Press, 
2000. Pp. x+253. $24.95 (paper). 


Margaret K. Nelson 
Middlebury College 


The essays in Restructuring Caring Labour do the great service of bringing 
together two bodies of literature that are generally kept distinct—that 
which deals with economic and social restructuring and that which deals 
with caregiving. In doing so they clearly demonstrate the various ways 
in which shifts in the production and distribution of goods and services 
have made more problematic women’s paid and unpaid responsibilities 
for, and daily activities of, providing care for others. 

Each of the book’s three sections focuses on a different aspect of re- 
structuring and a different set of consequences. Four essays in “Part 1: 
The Powerful Presence of a Retreating State in Daily Life” explore how 
cutbacks in social welfare transform women’s lives as service recipients. 
Janet E. Mosher’s essay, “Managing the Disentitlement of Women,” ex- 
amines both material and ideological issues: it argues that the cutbacks 
associated with welfare “reform” threaten the economic well-being of poor 
women and their families while the discourse underwriting these cutbacks 
increases the stigma associated with being a welfare recipient. Karen J. 
Swift and Michael Birmingham’s “Location, Location, Location”” con- 
siders “welfare recipients” as well, demonstrating that women differently 
situated—as defined by the kind of services received and immigrant 
status-—were affected by cutbacks in varied and contradictory ways. In 
“Single Mothers and the Press,” Patricia M. Evans and Karen J. Swift 
return to ideological issues. They show that newspaper articles in the 
1990s, as compared to those a decade earlier, drew on statistics and expert 
talk to portray single mothers as not just “hapless” but “bad.” Only one 
chapter in this section Jane Aronson, “Restructuring Older Women’s 
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Needs”) breaks ranks with this attention to welfare recipients and single 
mothers to explore the effects of the retreating state on older women. 

The second section—*Of Time, Work, and Voluntary Action”—is es- 
pecially innovative in its inclusion of essays that examine how state re- 
structuring has shaped volunteer and community work. Susan Prentice 
and Evelyn Ferguson argue that because women are “never entirely free 
of the motherhood mandate and the social expectations that they ought 
at all times to make their children their first priority . . . identifying when 
mothers’ volunteer participation is ‘freely’ chosen is a bit like chasing a 
chimera” (p. 136). Marge Reitsma-Street and Sheila M. Neysmith (“Re- 
structuring and Community Work”) pick up this same theme when they 
raise the question of whether welfare recipients can find autonomy, em- 
powerment, and a sense of belonging by engaging in community work 
when they are required to engage in these activities. In the book’s only 
historical piece, Carol Baines examines how an earlier transition in social 
work practice (1949-64) affected children and their care providers. 

The three essays in the final section, “Disappearing Jobs Reappearing 
as Invisible Work,” shift the book’s attention both from state policies to 
more narrowly “economic” ones and from women as recipients of social 
services to women in the labor force. With great insight, Marie Campbell 
(Knowledge, Gendered Subjectivity, and the Restructuring of Health 
Care”) demonstrates how cutbacks in the nursing staff of hospitals increase 
the demands placed on those who remain even as the managerial discourse 
underwriting downsizing makes ever more invisible the subtle work 
nurses do to hold their workplaces together and to ensure responsible 
care for patients. The last two articles explore how the unpaid work of 
care is affected by, and in turn shapes, women’s positions in the restruc- 
tured economy. Drawing on data about a rural area in which key man- 
ufacturing jobs had recently disappeared, Belinda Leach (“Transforming 
Rural Livelihoods”) shows that responsibility for the daily care of their 
families so limited the manner in which women sought replacement jobs 
that they were left relatively worse off than their husbands. In one of the 
few articles attentive to issues of race, Roxana Ng demonstrates how 
restructuring of the garment industry has heightened the exploitation of 
immigrant women. 

As this overview has suggested, the essays cover a very broad range 
of topics and issues. Neysmith’s introductory essay (“Networking across 
Difference: Connecting Restructuring and Caring Labour”) wrestles with 
the common themes and analytic concepts that transcend the individual 
foci. These include issues of power and agency (To what extent are women 
“victims” of restructuring forces and under what conditions are they em- 
powered to act on their own behalf?), framing (How, e.g., is the single 
mother constructed in the discourse of welfare reform?), citizenship, and 
social location (To what extent do power relations reproduce class, race, 
and gender inequities?). The essay also highlights two impressive features 
of the book as a whole. The first is the truly collaborative nature of the 
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endeavor: the contributors to the book were members of an interdisci- 
plinary group of scholars who came together as a “network” to share their 
findings; their writings include more than token references to other essays. 
The second is the deep commitment on the part of all authors to “feminist 
praxis.” As Neysmith accurately notes, each author is committed “not only 
to making] visible the restructuring costs borne by women but to do so 
in a way that will promote social change in the social conditions that 
disenfranchise those who take on caring responsibilities” (p. 13). 

All of these articles are situated in the Canadian context, and in every 
essay the author is admirably attentive to the specific features of that 
context relevant to understanding the issues at hand. U.S. readers will 
find the broad outlines—state retrenchment, the shift from manufacturing 
to services, and increasing privatization—highly (and depressingly) fa- 
miliar. Moreover, because the lessons transcend the specifics, the work is 
of great general interest. Although many of the individual essays make 
for quite dense reading, they are well worth the effort put into them. 


Time and Poverty in Western Welfare States: United Germany in Per- 
spective. By Lutz Leisering and Stephan Leibfried. Cambridge: Cam- 
bridge University Press, 1999. Pp. xvi+379. $74.95 (cloth). 


Tony Atkinson 
Nuffield College, Oxford University 


Time and Poverty in Western Welfare States is a fine example of social 
science research that is theoretically well informed and empirically well 
grounded. The reader learns much about the dynamics of social assistance 
receipt in unified Germany, and the authors skilfully place this subject in 
its wider context. The book has its origins in the German text Zeit der 
Armut published in 1995 and translated into English by John Veit-Wilson. 
The present volume is, however, more than a translation, having been 
substantially revised. Together the authors and translator have, in the 
words of Ralf Dahrendorf’s foreword, “done a truly outstanding job in 
rendering a complex study into an eminently readable text” (p. ix). 

At the heart of the book is section 2, which describes the results of a 
longitudinal study of a 10% sample of social assistance claimants in the 
city of Bremen, interviewed initially in 1989 and followed up to 1994, 
and a parallel study of Halle in East Germany conducted from 1991 to 
1995. This empirical section is preceded by a theoretical introduction (two 
chapters) on the welfare state and the life course approach to poverty. 
The empirical study is followed by section 3 (three chapters) on the his- 
torical and social context of poverty in Germany, and section 4 (four 
chapters) on the future implications. Each section opens with an appro- 
priate woodcut from the 16th or 17th centuries—an interesting contrast 
to the Jackson Pollock on the book jacket. 
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To understand the findings, it is important to note that the authors 
equate poverty with the receipt of social assistance (Sosialhiffe), stating 
that “to claim social assistance is a sign of social descent” (p. 22). Not 
only is the resulting group small in size (3% of the German population), 
but this definition means that poverty is defined by administrative pro- 
cedure, rather than an external criterion, with the consequence that its 
dynamics may be influenced by administrative action. Moreover, no ac- 
count is taken of the position of those eligible for, but not claiming, social 
assistance. The authors cite studies in the 1980s finding non-take-up, or 
“hidden poverty” to be around 50%; more recent studies for the 1990s 
show even higher figures. An important part of the poverty story on which 
the present book cannot throw light is the conditions under which people 
have sufficiently low incomes to be eligible for assistance but do not make 
claims. 

Interpreting poverty in this sense, the authors find that “individual 
poverty situations are more liable to change and the people affected are 
more capable of coping than has hitherto been assumed” (p. 239). Between 
a half and three-quarters of all claims last for less than a year. These 
findings are interesting, as are the interviews with a small subsample in 
which claimants were asked for their subjective assessment as to whether 
receipt of assistance was “bridging” or “long term.” One has, however, to 
question the authors’ claim that their findings depart radically from past 
views of poverty. As they themselves recognize, the notion that people 
move in and out of poverty during the course of their lives was clearly 
described in Seebohm Rowntree’s study of poverty in York in 1899. The 
concept of the individual life-cycle has been central to the analysis of 
poverty by economists. While this may in some respects be less rich than 
the life course approach of the present book, it does make explicit the 
role of intertemporal allocation decisions. People are buffetted by all kinds 
of adverse fortune, but they have some degree of control. The ability to 
cope is in part a function of past income and resources. To this extent, 
we need to look at the flow of resources in periods both before and after 
receipt of assistance. 

Contrasting the life course view with a static or class view of poverty, 
Ralf Dahrendorf sees the former as sounding “a note of hope and op- 
portunity” (p. x), but there is a trade-off. If there is a higher degree of 
mobility among the poor, then a larger fraction of the population expe- 
riences poverty at some point in their lives. How do we judge such a 
situation? If a now-famous actor spent much of his early years “resting” 
in receipt of social assistance, then this may well be regarded as simply 
an evening out over time. In contrast, the dependence of a single mother 
on assistance, even for a short period, may be of considerable social con- 
cern if it coincides with a crucial period in the development of her children. 

In the space available for this review, it is not possible to do justice to 
the many directions in which the reader is led. A book on social assistance 
claimants in Bremen may not sound like a wide-ranging contribution to 
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social science, but the authors have used their empirical research as the 
platform on which to build a life course analysis of considerable interest. 


The Road to Poverty: The Making of Wealth and Hardship in Appalachia. 
By Dwight B. Billings and Kathleen M. Blee. Cambridge and New York: 
Cambridge University Press, 2000. Pp. xiv+434. $59.95 (cloth); $24.95 
(paper). 


Kathleen Stewart 
University of Texas, Austin 


This book brings us deep into the socially constructed, and far from 
timeless, poverty of central Appalachia. Arguing against both the culture 
of poverty and the internal colonialism models of persistent poverty in 
the region, The Road to Poverty uses court records, census data, land 
deeds, tax records, and ethnographic records of Clay County, Kentucky, 
to uncover the social origins and long-term institutional basis of rural 
poverty. Focusing on the systemic problems, rather than the symptoms, 
of poverty by tracing the economic, sociocultural, and political develop- 
ment of Clay County, it asks how a place grows poor. 

Billings and Blee meticulously research the historical legacy of three 
equally weighted structures: capitalist markets, state coercion, and cul- 
tural strategies. They are able to document how early markets and politics 
shaped the accumulation of wealth by a few Clay Countians who were 
well-positioned in local commerce and manufacturing, how slave labor 
used in salt extraction helped to inhibit economic development and di- 
versification, how the early social dynamics in the patriarchal moral econ- 
omy of agrarian Appalachia left a lasting imprint on the southern moun- 
tains today, and how early political conflicts deformed public life, helped 
to define local government as an instrument for private gain, and blunted 
the capacity of local government to confront poverty and envision alter- 
native futures. 

Arguing that cultural strategies cannot be understood apart from the 
larger context of capitalist markets and state coercion, the authors trace 
the discursive legacy of stereotypes of Appalachian fatalism, tradition- 
alism, and familism. The patterns of slave-ownership, early industrial 
development, and the growth of a commercial economy contradict sim- 
plistic assumptions about the isolation of antebellum Appalachia and the 
primitiveness of its early economy. Far from producing a disregard for 
land as a commodity, the kinship expectations and neighborhood consid- 
erations involved in agricultural self-sufficiency actually mediated the 
purchase and sale of land. Heavy reliance on courts in the 19th century 
to enforce financial obligations and to resolve disputes over debts con- 
tradicts stereotypes of Kentuckians as having little regard for the law. 
The roots of clientelism can be found in many of the earliest social prac- 
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tices of frontier Clay County, and its intensification and entrenchment 
can be traced through the collapse of political authority and other im- 
personal social institutions in Clay County in the 19th century under the 
pressure of social, economic, and political contradictions. The famous 
Appalachian feuds are shown to be an effect not of an ingrained and 
timeless sensibility, but of factionalism and schisms among the elites; elite 
conflicts intruded into public life, paralyzed local political institutions, 
and eventually erupted into intense violence at the turn of the century, 
which forced nonelites as well to decide which side they were on. 

Far from being particular to Appalachia, the cultural strategies and 
social relations of familism and traditionalism began to emerge under 
specific crisis conditions in the late nineteenth century: slow economic 
growth, population increase and the subdivision of family lands, land 
shortage, and soil depletion. The same social patterns had emerged under 
the same crisis conditions in the late 18th century in older regions of the 
country such as New England. 

In Appalachia, long-standing social, political, and economic patterns 
of inequality helped structure persistent poverty in the 20h century. Highly 
contentious and exploitative relations among outside capitalists in the 
timber, mining, oil, and gas extractive industries and Clay County’s local 
elites and nonelites fashioned the economic vulnerability that character- 
izes the county today. Entrenched political corruption, stagnation, and 
governmental incapacity to address the systemic conditions of poverty 
are not the peculiar symptoms of a isolated local way of life but the 
complex effects of a long history of industrial development, state coercion, 
and the politics of social inequality. 

Finally, Billings and Blee are careful to debunk the notion that Ap- 
palachia is now and has long been a stagnant culture of passivity. They 
point out, for instance, that Appalachia’s traditionalism emerged in part 
as a resistance to wage labor at the turn of the century. They further note 
that Central Appalachia may well lead the nation today in the prolifer- 
ation of vigorous citizens’ movements for social, economic, and environ- 
mental justice and that these movement have been fueled by a place- 
based commitment to land, family, and historical memory and a strong 
emphasis on self-rule and social equality. 

The book ends by suggesting four lessons from this case that are useful 
to basic research on inequality and rural poverty more generally: (1) that 
market-driven development is not a panacea for poor regions; (2) that 
even impoverished regions have rich social capital resources that must 
be preserved and enhanced; (3) that the local state is not simply a passive 
reflection of economic conditions, but a significant factor in its own right 
in shaping the course of rural social development; and, most generally, 
(4) that a long-term viewpoint on institutional legacies is crucial for under- 
standing the challenge of persistent poverty. 
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Children of the Land: Adversity and Success in Rural America. By Glen 
H. Elder, Jr., and Rand D. Conger. Chicago: University of Chicago Press, 
2000. Pp. xx+373. $35.00. 


Barbara Schneider 
University of Chicago 


The booming U.S. economy has failed to reach several regions of the 
country, most notably the rural Midwest. Already economically con- 
strained by the agricultural crisis of the 1980s, today fewer and fewer 
independent family farmers can anticipate a life of farming or perhaps 
even living on a farm. Familiar with studying the hardships of life for 
families and their children in his seminal work, Children of the Great 
Depression, Glen Elder and his coauthor Rand Conger in this new book 
examine the lives of young people in rural America as they move from 
middle school through high school graduation. Following a sample of 
over 400 children, their parents, and their grandparents, the authors use 
extensive interviews, surveys, and teacher reports to show that, in the 
midst of severe economic hardship, many of these adolescents are suc- 
ceeding at school, have rich and productive social lives, and display strong 
feelings of self-confidence. Why these young people are resilient and re- 
sourceful when confronted with serious family economic adversities is the 
major theme of this book. 

Focusing on “ties to the land,” the authors show how differences in 
family ties to agriculture convey important variations in how parents are 
involved in their communities and the social investments they make in 
their children. Comparing four panels of families; some who continue to 
farm, some who farm part-time, some who work in nonfarm jobs but 
grew up on farms, and some who live in rural areas but with no ties to 
farming, they find it is those families that have some ties to the land, 
even if only through their childhood environment, that tend to promote 
self-confidence, social leadership, and academic excellence in their chil- 
dren. Nonfarm families who live in agricultural dependent counties, but 
have no direct connections to the land and farming are the most vulnerable 
to economic swings and their children appeared to be the least resilient. 

What is it about “ties to the land” that influences the development of 
young people? The answers the authors provide are not only analytically 
compelling, but they reveal invaluable insights for solving many of the 
problems facing our urban and suburban school communities as they 
struggle to provide meaningful environments for socializing and educating 
our adolescents into productive adults. Having “ties to the land” fosters 
taking responsibility, a sense of “stewardship” for conserving the land for 
generations to come. It creates a web of connections among members of 
the immediate and extended family and strengthens bonds across gen- 
erations. Children tend to have significant ties to their grandparents, 
which helps them develop a strong sense of self. These connections come 
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with explicit expectations and obligations that, if not carried out, can 
result in life-threatening situations for animals, and ultimately the family 
livelihood. Standards of performance, whether they involve participation 
in worthwhile community or school activities or involvement in chal- 
lenging work, contribute to a sense of personal mastery. 

What seems to dominate the world of these young people are caring 
adults who play a leadership role in their lives. Regardless of education 
levels, income, and residential stability, these farm families are signifi- 
cantly more involved in civic groups, churches, and schools compared to 
nonfarm households. Parents who are socially engaged have children who 
are socially engaged. Even parents with limited time and other resources 
manage their children’s lives so they can participate in many different 
types of activities. This does not go unnoticed by their children, who 
perceive them as efficacious adults and role models. Although parent 
involvement continues to be promoted as a key factor in helping children 
learn, here we see what it is about these networks, how they can potentially 
operate, and what kind of real commitments parents make to ensure their 
child’s success both academically and socially. Religion is also important 
in these teenagers’ lives, and they look forward to and participate in 
services and church-sponsored activities. Peers also share these sentiments, 
and they along with their parents form strong social networks where the 
values of the family, school, and community are constantly reinforced. 

Not everything that happens in these communities is idyllic, and the 
authors show some of the negative consequences to children of working 
too closely with parents or being in small communities that encroach on 
personal autonomy and exercise considerable social control. Yet despite 
some of the constraints these youngsters face in their daily lives, the 
authors conclude that we have much to learn from the children and 
families of the land. And, clearly, that is the case. Here is a detailed 
comprehensive study, expertly analyzed, that presents us with new con- 
ceptual tools for examining adolescents’ lives as they move through high 
school and beyond. Building on earlier work, the authors carefully and 
systematically demonstrate what helps young people cope with adversity, 
which has relevance not only for rural communities, but for our urban 
and suburban communities as well. 


Farewell to the Peasantry? Political Class Formation in Rural Mexico. 
By Gerardo Otero. Boulder, Colo.: Westview Press, 1999. Pp. xiii+ 186. 
$60.00. 


Michael J. Watts 
University of California, Berkeley 


Gerardo Otero returns to the Mexican agrarian question, arguably one 
of the most dramatic and important political theatres of the 20th century, 
which has been at the center of theoretical and political debate in Mexico 
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since the 1910 revolution. In its classic formulation in the late 19th century, 
the agrarian question had three moments: the differentiation of the peas- 
antry, the politics of class formation in the agrarian sector, and the in- 
tersectoral political economy of capitalist industrialization. Otero focuses 
primarily on the first two and does not really provide a systematic treat- 
ment of the latter. He seeks both to transcend the Lenin-Chayanov debates 
(p. 2), which have stimulated much controversy in Latin America agrarian 
circles since the 1970s, and to distinguish himself from the multiple var- 
iants of both the campesinismo (focused on struggles over and access to 
land) and prolatarismo (accentuating the role of wages) approaches to 
Mexican agricultural transformation. 

His treatment turns equally on a different account of the forms and 
character of social differentiation in the countryside and on an alternative 
model of political class formation. The general purpose, says Otero, is to 
question class reductionist assumptions in variants of both Marxism and 
populism through a comparative regional analysis of three case studies. 
All three regions—Puebla, Sonora, Coahuila/Durango—have in common 
the fact that capitalist agriculture was installed during the 1930s, which 
provided the original focus of agrarian struggles (p. 5). Otero shows 
through careful historical analysis and contemporary survey data (coupled 
with fieldwork conducted in the 1980s and 1990s) three differing trajec- 
tories of class formation from ostensibly similar origination points. His 
comparative political economy approach emphasizes process rather than 
structure in order to accentuate how the economic location of rural pro- 
ducers does not determine political class formation that is, rather, me- 
diated by the state, regional culture, and leadership type (p. 21). As he 
puts it, class formation cannot be deduced from class position alone but 
is overdetermined by the content of demands and struggles, the character 
of class organizations, and the degree of autonomy of the movements (p. 
24). Underlying this approach is an implicit political project, namely that 
in the wake of the collapse of socialism reform within the circumference 
of capitalism represents the only alternative, and correspondingly that 
there is a space for what he calls “market-oriented but non-capitalist 
culture” (p. 25) given expression through two alternatives to proletarian- 
ization (the wage form): peasant-entrepreneurship (essentially the deep- 
ening of petty commodity production) and “postcapitalist production” 
(self-managed cooperatives and collective forms of organization). Yet in 
rural Mexico, the heterogeneity of economic class positions are capable 
of generating a panoply of differing political outcomes depending on state, 
culture, and leadership. 

Farewell to the Peasantry? consists of two major parts. The first (chaps. 
3 and 4) sees postrevolutionary Mexico as the product of a land redistri- 
bution harnessed to bourgeois development and state hegemony. The re- 
forms to the Constitution in 1992 in tandem with neoliberal and agrarian 
policies represent, however, a major overhaul of the original agrarian 
reform program. Chapter 4 provides an analysis of social differentiation 
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between 1930 and 1990 and confirms the analysis of Alain de Janvry that 
(1) the middle peasantry are disappearing (the “double crisis” of capitalist 
agriculture and the peasant economy), (2) the majority of direct producers 
are semiproletarian, and (3) uneven development has fostered marked 
regional heterogeneity. “Depeasantization without full proletarianization” 
(p. 9) raises the question of the direction and content of their class conflicts 
and struggles. 

Part 2 explores this question through the three case studies. In the case 
of Laguna, agricultural workers struggled for “typically proletarian de- 
mands” but received land from the state, and the ejidatarios unleashed 
a program of self-management and democratic production in the newly 
collectivized ejidos. But the state and the agrarian bourgeoisie crushed 
these initiatives, which in turn produced political fragmentation and an 
impoverished semiproletariat. In Atencingo, the collective ejidos were 
dismantled, and individual (Le., household) production stimulated a pro- 
cess of internal social differentiation from which emerged “peasant en- 
trepreneurs” on the one side and a deeply impoverished semiproletariat 
(dependent on migratory income) on the other. In the Yaqui Valley, Otero 
sees the emergence of postcapitalist struggle in response to aggressive 
capitalist penetration in the region and mestizo in migration; new demands 
involved not only land but other means of production and the democratic 
self- management of the productive process. Each of these three trajec- 
tories contained differing forms of political discourse and contrasting al- 
liances and solidarities with “progressive and democratic forces” (p. 150) 
in Mexico. 

Farewell to the Peasantry is in some regards an unfashionable sort of 
political economy reflecting perhaps the influence of Ivan Szelényi and 
Erik Wright, with whom the author worked in the course of the disser- 
tation on which this book is based. But it is a model of rigorous com- 
parative analysis, and the author bas provided an exceptionally rich and 
well-argued narrative that successfully explains why class agents that 
started out in similar positions followed differing paths and destinies. His 
sensitivity to the multiple determinants of class struggle and class for- 
mation is exemplary, even though the author is, I think, much stronger 
on state intervention than on either culture or leadership (the latter is 
quite undeveloped). One might also have wished for more on intrahouse- 
hold dynamics (self-exploitation?) and the role of gender, particularly since 
Otero places so much analytical weight on systems of reproduction. But 
this does not alter the fact that Otero has written a compelling and in- 
novative book, attuned to the jagged rhythms, the radical displacements, 
and the multiple trajectories of capitalist transformation in the Mexican 
countryside. The recent victory of Vincente Fox, and the thickening of 
civil society that Otero emphasizes, makes Farewell to the Peasantry a 
prescient text. Pity about the price. 
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Crossing Borders, Reinforcing Borders: Social Categories, Metaphors, and 
Narrative Identities on the U.S.-Mexico Frontier. By Pablo Vila. Austin: 
University of Texas Press, 2000. Pp. xiv+290. $42.50 (cloth); $19.95 
(paper). 


David Spener 
Trinity University 


In this timely and pathbreaking ethnography, Pablo Vila describes the 
narrative process of identity formation that influences relations among 
the various regional, ethnic, and nationality groups residing along the 
U.S.-Mexico border. Using a dialogical method that draws upon the Frei- 
rean tradition of concienciación (see Paulo Freire, Education for Critical 
Consciousness [New York: Seabury Press), Vila bases his analysis on a 
series of 254 focus group interviews conducted in the “twin cities” of 
Ciudad Juárez, Chihuahua, and El Paso, Texas, in the 1990s, in which 
participants discussed photographs taken of scenes in everyday life in 
both cities. These groups were composed of men and women of different 
ages, nationalities, religions, class backgrounds, ethnicities, and migratory 
experiences. Expecting his informants to organize their identities around 
the obvious, axial divide between Anglo-Americans and Mexicans on the 
border, Vila is surprised to find far more complex processes of symbolic 
differentiation at work. In reviewing the transcripts of these discussions, 
the author uncovers a number of recurrent identity “plots” incorporating 
a varied repertoire of social categories and metaphors involving ethnicity, 
nationality, and Mexican region of birth. Vila’s analysis of these plots is 
illustrated throughout the book with vivid excerpts of dialog from his 
informants. 

In Juárez, for example, many long-term residents distinguish themselves 
from newcomers from Southern Mexico, whom they maintain are re- 
sponsible for many, if not most, of the city’s social ills. Many juarenses 
also regard themselves as different from—and superior to—Mexican 
Americans and Mexican immigrants living across the border in El Paso, 
whom they regard as being “addicted” to a U.S. consumer lifestyle and 
as frequently looking down on or even mistreating “real” Mexicans like 
themselves. At the same time, Vila identifies another plot, in which some 
fronterisos (residents of the Mexican border cities) see themselves as being 
more advanced and successful than other Mexicans by virtue of the pos- 
itive influence of the U.S. economy and culture on the border. Upon 
crossing the international boundary into El Paso, however, the distinctions 
juarenses/fronterizos make between themselves and other Mexicans are 
lost on Anglos and Mexican Americans for whom they are “simply” Mex- 
icans. Indeed, one of the aspects of the border that makes it an ideal 
setting for reflecting upon the social construction of identity is that daily 
life there requires residents to routinely move back and forth between 
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two societies that feature very different systems of classification of people 
on the basis of race, ethnicity, nationality, and regional origin. 

Similarly, Mexican Americans living in El Paso tell stories about them- 
selves in relation to multiple others, not just Anglos. Of particular interest 
to sociologists concerned with Latinos in the United States, Vila details 
how many Mexican Americans in El Paso do not feel much solidarity 
toward Mexican nationals, whom they associate with the poverty, cor- 
ruption, and lawlessness that they see as emblematic of contemporary 
Mexico. Some of these U.S.-born ethnic Mexicans adopt an assimilationist 
discourse that downplays their Mexican heritage in favor of their “Amer- 
ican” virtues. More interesting, other third and fourth generation Mexi- 
cans Vila interviews express great pride in their Mexican heritage, but in 
doing so hearken back nostalgically to a mythical, virtuous past that 
featured more “authentic” Mexican traditions. Vila’s findings from his 
focus group interviews were subsequently corroborated by the strong sup- 
port garnered from Mexican Americans for Operation Blockade, the 1993 
crackdown on undocumented border crossers launched by the El Paso 
sector chief of the Border Patrol, who was himself of Mexican ancestry. 
As he notes perceptively in his chapter dedicated to the crackdown, “the 
border patrol was doing physically what ... many El Pasoans (both Anglo 
and Mexican American) had been doing symbolically already: separating 
themselves from Mexican nationals in order to construct a valued nar- 
rative identity of people living in the United States” (p. 169). Vila’s ar- 
guments in this regard call into question much of the emphasis on hy- 
bridization and “border crossing” in the literature on the border region 
and its ethnic Mexican inhabitants (see, e.g., Gloria Anzaldúa, Border 
lands/La Frontera: The New Mestiza [Aunt Lute Books, 1987]; Néstor 
García Canclini, Culiwras híbridas [Grijalbo, 1990]; Oscar Martínez, Bor- 
der People: Life and Society in the U.S.-Mexico Borderlands [University 
of Arizona Press, 1994]; Renato Rosaldo, Culture and Truth: The Remaking 
of Social Analysis [Beacon Press, 1989), since many of his informants, 
rather than embracing a transborder cultural unity among Mexicans, av- 
idly construct symbolic borders that set themselves apart from an “other” 
who is also Mexican. 

Vila’s main orientation is toward the social construction of meaning, 
an issue of considerable interest to cultural sociologists at the moment 
(see, e.g., the symposium, Charting Futures for Sociology: Culture and 
Meaning, Contemporary Sociology [July 2000), and he exploits the special 
characteristics of the border to their fullest in this regard. Unfortunately, 
the book is organized so that his discussion of the broader theoretical 
implications of this rich ethnography is relegated to an appendix rather 
than integrated into an otherwise helpful introductory chapter. Readers 
who are interested in the Gramcsi-tinged poststructuralist theory inform- 
ing Vila’s interpretive stance would do well to read the appendix before 
approaching the regional particulars addressed in the main body of the 
book. 
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At the same time, the appeal of Crossing Borders, Reinforcing Borders 
is not limited to scholars of border studies and the social construction of 
meaning. Sociologists and anthropologists interested in issues of race and 
ethnicity, international migration, transnationalism, and qualitative meth- 
ods will also find much of value to them in this book. Readers who are 
familiar with the literature in these varied fields will undoubtedly make 
connections between the findings reported in Vila’s book and the contri- 
butions of other authors who are not mentioned in the text. With regard 
to race and ethnicity, for example, the works of Joanne Nagel (1994) on 
ethnic construction, Michael Omi and Howard Winant (1986) on racial 
formation, and David Gutiérrez (1995) on relations between Mexican 
Americans and Merican immigrants all came to mind as I read this border 
ethnography. 

Finally, readers should note that this book is but the first installment 
of a two-volume set. The second volume, provisionally titled Border Iden- 
tities: Narratives of Class, Gender, and Religion on the U.S.-Mexico Bor- 
der will be published in 2001. I found this artificial division of the report 
on a single research project into two books to be frustrating since it obliges 
Vila to limit his interpretation of some fascinating narratives to their 
ethnic, national, and regional aspects rather than considering these in 
conjunction with the other important bases for identity formation signaled 
in the title of the second book. Notwithstanding this defect, Crossing 
Borders, Reinforcing Borders constitutes a substantial contribution to the 
sociological and anthropological literature on the border in particular and 
on identity formation more generally. 


The Colonies of Law: Colonialism, Zionism and Law in Early Mandate 
Palestine. By Ronen Shamir. Cambridge: Cambridge University Press, 
2000. Pp. xvi+216. $64.95. 


Lisa Hajjar 
Morehouse College 


In The Colonies of Law, Ronen Shamir focuses on the short-lived (and 
little-known) Hebrew Court of Peace (HCP) to rethink the historic pos- 
sibilities and processes of Jewish nation-building and the establishment 
of the Israeli state. The HCP, launched during the period of the British 
Mandate over Palestine by a group of jurist-nationalists, was an effort to 
utilize law in the revival of a Hebrew nation. Its advocates combined a 
quest for “authentic” Jewish secular laws and legal practices with the 
construction of a modern community-based institution capable of arbi- 
trating contemporary disputes. The HCP was separate from both the 
colonial (British) state and the rabbinical courts and sought to establish 
its legitimacy on the very basis of this separation. But, as Shamir explains, 
the secular Zionist leadership and Jewish religious authorities pursued a 
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course of alliance and reliance on the colonial state, which served to 
undermine and ultimately destroy the HCP as an alternative legal system. 
The significance of the HCP initiative and the causes behind its demise 
are the empirical themes of this book. 

Shamir uses the history of the HCP to engage in a sociology of the 
“nation.” In part, this entails an effort to bring the colonial state back in; 
Zionist historiography tends to “forget” the colonial state by narrating the 
emergence of Israel in terms of struggles against and communal autonomy 
from British institutions in Palestine. However, by focusing on the HCP, 
Shamir’s work demands a reevaluation of the relationship between Zi- 
onism and colonialism. The processes of Jewish state building pursued 
by the Zionist leadership gave rise to integration and appropriation of 
political institutions and legal models, namely those of the British. This 
statist disposition ran counter to (and prevailed over) a nonstatist national 
project that the aborted HCP represented. 

This history of the HCP is a study of contrasts between what could 
have happened and what did happen, and why. For example, by the 
19203 most Jews in Palestine were from Europe but were subjects of a 
European colonial authority. Were they colonized or colonizers? By en- 
gaging this question around legal debates and institutions (British, Jewish- 
religious, and “New Hebrew” secular-nationalist), and the various social 
forces within the Yishuv that favored one or another of the options, 
Shamir clears a vista to review the impact of the colonial past on the 
Israeli present. 

The book begins and concludes with discussions of the appeal and 
limits of “organic” lawmaking and legal institution building. Of critical 
importance throughout is Shamir’s emphasis on the need to tease out the 
articulations and interplay of nation and state, religiosity and secularism, 
modernity and tradition, and legal professionalism and “popular justice.” 
The central chapters develop explanations about support for and oppo- 
sition to the HCP, whose advocates assumed an anticolonial stance and 
saw the development of this system as means of carving and creating a 
national Jewish space within yet not part of a British colonial order. In 
contrast, secular nationalists affiliated with the Zionist establishment re- 
sisted such anticolonialism because of a refusal to see themselves as col- 
onized. Because the HCP was a communal rather than statist enterprise, 
its authority to enforce Jaw depended on voluntary participation, will- 
ingness to submit conflicts to the tribunals and to abide by their decisions. 
The HCP’s authority was “popular” and its powers of enforcement were 
limited to shaming “shirkers.” Cultivating legal legitimation depended on 
participation and shame, which were forms of “New Hebrew” national 
praxis. 

For the religious establishment, the HCP’s efforts to create and ad- 
judicate Jewish law distinct from rabbinical authority was perceived as 
threatening and was therefore opposed. Although the Zionist and religious 
leaderships had sharply divergent interests and views, a mutual opposition 
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to the HCP helped cement a secular-religious compact. This compact 
evolved into an Israeli state that is secular and a Jewish nation that is 
sectarian since boundaries of inclusion and exclusion (i.e., who is a Jew) 
are subject to supervision and control by (orthodox) religious authorities. 
Secular Zionists “gave” this power to those authorities to gain the support 
of non- or antinationalist Jews during the Mandate period in order to 
consolidate a Jewish nation-state. 

Shamir’s study provides a model social history of hegemony and 
counterhegemony of interest to readers with little specialized knowledge 
or interest in Israel It highlights the constitutive—and con- 
flicted—capacity of law in the processes of “imagining” and building 
nations and states, and the legacy of European domination over other 
parts of the world. As a historical sociology of the Yishuv and its ante- 
cedent, this book is crucial reading for scholars who work on Israel be- 
cause it helps articulate the research agendas of a “second wave” of critical 
sociology and revisionist history. 


Battered Women in the Courtroom: The Power of Judicial Responses. By 
James Ptacek. Boston: Northeastern University Press, 1999. Pp. xii+240. 
$50.00 (cloth); $20.00 (paper). 


Neil Websdale 
Northern Arizona University 


James Ptacek draws from feminist scholarship on violence against women 
and more traditional interactionist sociology to interpret the judicial han- 
dling of cases of woman battering in two of the busiest district courts 
(Dorchester, Quincy) in Massachusetts. He employs content analysis of a 
random sample of 100 domestic violence case files that include the com- 
plaint forms, restraining orders, and women’s handwritten affidavits; field 
observations of the behavior of 18 judges; eight face-to-face interviews 
with judges; and telephone interviews with 40 women who filed for re- 
straining orders in 1992. The results provide useful insights into battered 
women's experiences with the judiciary. Ptacek also contributes to our 
understanding of the complex forces acting upon judges as they “do au- 
thority.” The language of women, albeit from telephone interviewing and 
court paperwork, is carefully presented. The author finds that judges are 
highly influential social actors in the disposition of domestic violence cases; 
contributing not only to the immediate resolution of competing needs in 
the courtroom, but also to the subsequent lives of the adversaries. 

The book opens with a high-profile case in Massachusetts in which a 
judge insensitively bungles a restraining order hearing, contributing, no 
doubt, to the subsequent murder of Pam Nigro Dunn by her estranged 
husband. Ptacek takes us through the research into domestic violence, 
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pointing out its social patterning by class, gender, and race. He then offers 
a useful history of judicial responses to battering in Massachusetts. 

The author acknowledges that the more recent epistemological em- 
phasis on the voices of women has contributed greatly to our understand- 
ing of domestic violence. As Ptacek himself implies, he is on somewhat 
shaky ground when imputing the motives of men from a content analysis 
of the affidavits written by women. Interviewing those women face-to- 
face would have provided helpful additional information. Talking with 
their male partners, itself an approach riddled with difficulties, might 
have provided yet another window into men’s motives. In his example 
of how abusive men tell women restraining orders are “just pieces of 
paper,” (p. 169) Ptacek states, “He is saying the order will not protect her, 
that it will not prevent him from harming her, and that he feared no 
sanctions for violating it. Such a remark by a batterer is a threat” (pp. 
169-70). How does Ptacek know this? According to the classic power- 
control wheel, the abuser’s flippant attitude to the restraining order is 
one more way of threatening her. However, what of the hypothetical 
African-American abuser’s motives? He has long been degraded and ha- 
rassed by societal racism and the gulag mentality of the criminal justice 
system. When this black abuser tells his victim the restraining order is a 
“piece of paper,” is he telling her that he will victimize her in spite of the 
order? Or is he articulating a more diffuse and historically grounded 
resentment at a system that has terrorized and failed to protect his people? 
My point is that we do not know, and Ptacek’s resort to feminist expla- 
nations would be more powerful if he also pointed out the frequent failure 
of those approaches to explore race/ethnic and class relations. 

My epistemological fussiness also reared its head in Ptacek’s failure to 
match up his observations and subsequent interpretations of judicial de- 
meanor with what the 18 judges themselves intended to convey, or what 
other parties understood that demeanor to mean. We are treated to an 
almost linear typology of demeanors: a continuum from “good natured,” 
“bureaucratic,” “firm or formal,” “condescending” to “harsh,” without 
really knowing the socially situated meanings attached to that judicial 
behavior by various social actors, including judges themselves. Our un- 
derstanding of Ptacek’s typology and the behavior of the 18 judges he 
observed is not really extended much by his eight personal interviews 
with different judges who are among the most progressive in Massachu- 
setts on matters concerning domestic violence. He may have taken us 
further by interviewing the women who wrote the affidavits, the judges 
who presided over those cases, and the abusers themselves; thus tightening 
the correspondence between sources and identifying disparate patterns of 
socially situated meanings. 

My criticisms notwithstanding, this book assumes an important place 
in the literature on domestic violence. It is well written and offers insights 
not previously available. Judges are important players in the resolution 
of these ugly affairs, but criminal justice solutions will always be of limited 
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worth in the arsenal against this grinding form of tyranny. Insofar as 
Ptacek’s book begins to situate judicial handling amidst a plethora of 
social and historical forces, it makes a worthwhile contribution and invites 
further research and scholarship. 


Codes and Contradictions: Race, Gender Identity and Schooling. By 
Jeanne Drysdale Weiler. Albany: State University of New York. Pp. 
xli+248. $23.95 (paper). 


William Van Buskirk 
La Salle University 


In this study academic achievement is intimately tied to identity forma- 
tion. The author distinguishes her work from the “black box” thinking of 
Marxist/structuralist reproduction theorists, the embedded advantages 
theories of the “new sociology of education,” and unidimensional feminists 
who privilege gender over class and race. Academic achievement is a 
function of identity formation understood as socially constructed. In this 
context, gender, race, and class are less nouns than interacting verbs that 
achieve salience from the concrete everyday interactions of students who 
try to make sense of the world as they pursue academic credentials. 

The study attempts to track the play of gender, class, and race on 
developing educational aspirations of white working-class, African-Amer- 
ican/Puerto Rican, and South American ninth-grade girls. The setting is 
an alternative high school in New York, a school for students who have 
failed in previous education. The school has a track record of success, 
graduating 85% of its students (compared to less than 50% in New York 
City high schools). 

As students speak for themselves, they show marked differences in 
terms of career goals, family aspirations, and the value of academic 
achievement. White working-class girls preoccupied with “traditional fem- 
inine” values exhibit vague career aspirations and see future well being 
tied to finding a mate. African-American and girls of Puerto Rican descent, 
who have little experience with men as breadwinners, put clear priority 
on finding a place in the job market. These young women put romance 
on hold, fear early pregnancy, and have male friends their own age (as 
opposed to the previous group, whose romances tend to be with older 
boys). Finally, academically successful Dominican/South American girls 
saw themselves as “getting settled” in educational credentials before mar- 
riage, while those who did poorly emphasized a traditional feminine path 
of marriage and romance. All girls experienced conflicts between academic 
aspirations and traditional patterns of emotional attachment. In these 
descriptions, gender, race, and class memberships triangulate complex 
negotiations through which the girls understand their worlds, imagine 
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their futures, and conceive incipient strategies for negotiating the conflicts 
embedded in their communities. 

These negotiations proceed in the setting of an alternative high school: 
It is briefly described as having a “cluster” structure in which 20-25 girls 
are grouped together with a teacher, a guidance counselor, and a “house 
mom.” Teachers bond with the girls, becoming friends rather than au- 
thority figures. The school is located on a:college campus and students 
have college IDs. There are “internships” available for girls who do well 
in class and a career education course where girls learn about the world 
of work and receive mentoring. Traditional classes are also offered. 

The key elements of AHS consist of its track record of success, em- 
powered faculty, and positive relationships between faculty and students. 
Cooperative learning and active learning, approaches characteristic of 
upper middle class schools, are utilized extensively. Classroom control is 
implicit with a minimum of control over student behavior. 

The author sketches the impact of the school setting by positing an 
unintentionally communicated gender code through which girls received 
messages about appropriate aspirations and behavior. These symbolic 
messages are internalized largely through the girls’ interpretations of 
teacher behavior and school practices. Messages may be consistent or 
conflicted, and girls may internalize them or resist them. Classroom de- 
scriptions are extremely lively and demonstrate vividly her central point 
about the primacy of interpretation to the da of “gendered, 
classed and raced” identities. 

The approach used in this study is quite promising in indeystanding 
academic achievement and school reform in a much subtler and “student 
friendly” way. However, I found myself wishing it had cast a wider net 
to capture more practices as they affected the girls’ emerging identities. 
There was little or no mention of disciplinary codes, school mission, or 
traditional courses (especially Algebra and English). In addition, when 
the focus shifts to the school setting, the girls cease to speak for themselves. 
We see them through observation and inference of author and teacher. 
The potential richness of the gender code as a way of understanding 
identity formation and academic success is thus blurred. More indirect 
observation would also help. Descriptions of how teachers are empowered 
through administrative practices would be interesting. Indeed, the ‘ad- 
ministrator/leader voice seems to be missing entirely. The school seems 
at times to be unled by anything more than interpersonal relations. A 
reflection from the principal on the results of the study might have added 
a lot in terms of tying organizational context to the incipient processes of 
identity formation. 
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Medicalized Motherhood: Perspectives from the Lives of African-Ameri- 
can and Jewish Women. By Jacquelyn S. Litt. New Brunswick, N.J.: 
Rutgers University Press, 2000. Pp. xiv+189. $20.00 (paper). 


Julia Grant 
Michigan State University 


In the 1920s, advocates of scientific parenting often drew on the character 
of Topsy, the slave child in Harriet Beecher Stowe’s Uncle Tom’s Cabin 
who “just grew,” as the antithesis to the properly brought-up child. That 
the iconic Topsy was black was no accident, as Jacqueline Litt’s extremely 
interesting and provocative Medicalised Motherhood: Perspectives from 
the Lives of African-American and Jewish Women underscores, Her book 
illuminates the race, class, and ethnic underpinnings of the tenets of sci- 
entific motherhood and the stark differences in the ways in which these 
tenets were received by women. 

Through an analysis of interviews with 38 women born between 1894 
and 1930, Litt explores African-American and Jewish immigrant women’s 
responses to expert child care advice during the first half of the 20th 
century, a pivotal period of medicalization. Her decision to contrast these 
groups proves enlightening, since both were newcomers to the world of 
scientific motherhood during the early decades of the century, as immi- 
grants to the New World and migrants to the urban North. Their re- 
spective experiences with the U.S. medical establishment were radically 
different and indicative of the degree of incorporation of each group into 
mainstream culture. 

One of Litt’s most significant findings is her discovery that Jewish- 
American women embraced scientific motherhood as a keystone of assim- 
ilation and upward mobility. Second-generation Jewish women distin- 
guished themselves from their mothers by their valuing of modern 
scientific remedies for children’s maladies and their reliance on neigh- 
borhood physicians. They were supported in these choices by a network 
of Jewish doctors from within their own communities and like-minded 
mothers from their ethnic background who equally valued the new mod- 
ern ways. Litt argues that women’s decisions to adopt new child care 
methods were not simply a matter of following “technical requirements 
for their children’s good health” but involved “promoting a gendered and 
racialized respectability” (p. 3). Litt sees modern motherhood as “status 
work” that involves distinguishing responsible “modern” mothers from 
their more backward counterparts. One thing that is missing from this 
discussion, however, is an analysis of the contributions of Jewish culture 
in fostering a respect for scientific authority. Italian immigrant women 
were arriving in the United States during the same time period but did 
not exhibit the same degree of medicalization as did their Jewish 
counterparts. 

A further problem arises from Litt’s reading of women’s narratives. At 
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times her theoretical construction of women’s motherhood seems rather 
rigidly layered over the words of the women she interviewed. Jewish 
women often viewed the physician’s word as gospel and tended to choose 
doctors who were held in high esteem by their communities. This choice 
certainly provided a passport into the community of forward-thinking 
mothers, but also shows that Jewish women trusted in the power of med- 
icine to promote their children’s good health. Yet Litt seems to feel that 
women’s decisions to partake in modern medicine was more closely tied 
to social aspirations than the immediacy of their children’s well-being, 
an assumption that may not have been shared by her interviewees. 

Litt is right on the mark, however, in demonstrating the differences in 
access and attitudes toward the medical establishment between Jewish 
and African-American mothers. While Jewish women had family friends 
who were doctors, African-American women often had to confront a hos- 
tile white medical establishment. Their entry into the medical establish- 
ment was often provided by white employers or bourgeois reformers from 
within their own race. One of Litt’s interviewees, whose family included 
several generations of physicians, denounced the prevailing trope of 
“Topsy” to describe African-American children: “It just kills my brain for 
people to think that we have been like Topsy and we just growed up all 
of a sudden” (p. 122). For these women, participation in modern medicine 
was a matter of civil rights and a sign of respectability and good parenting. 

Working-class African-American women, though, often viewed the 
medical community with suspicion and distrust. Unlike Jewish women, 
African-American mothers continued to rely on traditional networks of 
advice and support in the rearing of children throughout the process of 
urbanization. The African-American women Litt interviewed described 
traditional home remedies with nostalgia. But modern medical expertise 
surfaced in unexpected places—for instance, in the narrative of a woman 
who got her mothering advice from her own mother, advice that focused 
on the importance of a sterile bottle. Her story reveals the potential power 
of public health campaigns when they tap into the maternal networks 
that are responsible for disseminating child care advice. 

Medicalized Motherhood will be of great interest to scholars and stu- 
dents in the history and sociology of motherhood and medicine, but it 
also should be read by those involved in parent education. Litt claims 
that medicalized motherhood serves to heighten divisions among women, 
but her book also provides clues about how best to lessen the distance 
between medical expertise and diverse communities of mothers. 
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Testing Women, Testing the Fetus: The Social Impact of Amniocentesis in 
America. By Rayna Rapp. New York, NY: Routledge, 1999. Pp. xiv+361. 
$30.00. 


Adele E. Clarke 
University of California, San Francisco 


Rave! Ravel Ravel This is a brilliant book on the new genetics by one 
of the most theoretically sophisticated scholars of our time. Rapp’s work 
makes significant and enduring contributions to multiple areas: sociology 
and science and technology, medical sociology and anthropology, research 
methods, women’s studies, feminist theory, clinical genetics, medicine, 
nursing, and so on. 

The book starts from a radical site: difference of many kinds among 
pregnant women in New York City who were offered amniocentesis to 
diagnose possible genetic abnormalities in their fetuses. Differences in- 
cluded, but were not limited to, race, ethnicity, culture, class, power, gen- 
der identity, citizenship, migration history, employment, education, local 
community, family history, and self-identity. Each and all and other factors 
shaped how those women addressed the entrée of genetic testing into their 
lives. 

The project on which the book is based has been 15 years in the making. 
During this period, Rapp “followed that technology” of amniocentesis to 
wherever she was led in the research process to create an incisive and 
beautifully bridged multisite ethnography that examines a pregnancy test- 
ing laboratory, support groups, families of children with Down syndrome, 
genetic counselors, geneticists, friendships, technologies, lab technicians, 
women who used amnio as well as those who refused the test, and some 
male partners. 

Rayna Rapp has been a major contributor to feminist anthropology 
since she edited Toward an Anthropology of Women in 1975. Honing fem- 
inist theoretical insights still further here, Rapp offers a woman-centered 
analysis of a technology that targets women at a highly vulnerable mo- 
ment—pregnancy. Because they must deal with such transformative tech- 
nologies with little if any guidance from social policy, Rapp appropriately 
calls these women “moral pioneers . . . forced to judge the quality of their 
own fetuses, making concrete and embodied decisions about the standards 
for entry into the human community” (p. 3). 

The book centers on three major areas. First is the study of pregnancy 
and reproduction vis-a-vis the elaboration of life-changing technologies. 
Rabinow terms such intersections sites of “biosociality°—where fluid and 
innovative new social forms and processes must be negotiated. They en- 
gender what I call new “technoscience identities’ risk identities, genetic 
identities, and so on. In this sense, Rapp is studying, through accounts 
of lived experience, a new regime of regulation of the population. What 
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she finds is yet another site of stratified reproduction along the usual social 
faultlines. 

The second area of concern is the complex and often anguished inter- 
section of disability rights and abortion rights. Some disabled rights ac- 
tivists today believe that prenatal testing is the eugenic “final solution” 
for disabled people through organized genocide. Abortion rights, so deeply 
contested and at this historic moment deeply vulnerable in the United 
States and elsewhere, are often justified on the basis of such prevention. 
Abortion is the only available “treatment” for many genetic fetal anomalies 
that, like other genetic problems, biomedicine is increasingly capable of 
diagnosing. Scientists associated with the Human Genome Project and 
with stem cell research have, however, studiously evaded addressing these 
issues, leaving feminists to bear the brunt of this moral stand and to do 
the work of defending abortion rights. Rapp argues very clearly for both 
abortion as an option that allows women and their families to decide 
whether they can raise a disabled child amd dramatically improved in- 
clusion of disabled people in social life and in social and medical services 
for them. 

The third major area of concern lies in the difficult issues of scientific 
literacy and access to technology. Unlike new reproductive technologies 
that treat infertility, genetic testing is widely available to pregnant women 
in the United States. In California, for example, physicians must offer it 
to all pregnant women as part of a state-sponsored program. Women vary 
dramatically in their knowledge of genetics, hereditary diseases, and the 
technology of amniocentesis itself. And, as with so many technologies, 
knowledge about its capabilities, problems, and consequences is also con- 
tested. Pregnant women are thus moral pioneers in a technically ambig- 
uous situation. 

Because it is so well written, Testing the Fetus will be among the most 
widely read books on the new genetics. Rayna Rapp is a storyteller ex- 
traordinaire who carries us along in conversations with her wonderfully 
candid and thoughtful informants. (For more on how this project illu- 
minated the way in which feminist methodology bleeds into everyday life, 
see Rapp’s “One New Reproductive Technology: Multiple Sites” in Re- 
visioning Women, Health, and Healing, edited by Adele Clarke and Vir- 
ginia Olesen [Routledge, 1999].) 

Rapp concludes that there is no single “right way” to provide genetic 
testing, no universal policy about whether and how to offer prenatal 
testing. The situation with genetic testing is parallel to the “cafeteria 
approach” currently used with contraception: offer as many means as 
possible and let women choose based on their own perceptions of their 
needs, goals, and life situations. Of course, like many other new sciences 
and technologies, this too shifts the burden of biomedical decision making 
to users/consumers/patients. Rapp reminds us that we do not know where 
science and technology begin and end as they increasingly become inte- 
grated in our lives and with what it means to be human. 
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Transmen and FTMs: Identities, Bodies, Genders, and Sexualities. By 
Jason Cromwell. Urbana: University of Illinois Press, 1999. Pp. xii+201. 
$42.50 (cloth); $19.95 (paper). 


Laurie Essig 
Yale University 


Jason Cromwell is an anthropologist who has conducted years of fieldwork 
among female-to-male transsexuals (FTMs) and is himself also a female- 
to-male transsexual. Cromwell’s position as a “participant” and an “ob- 
server” puts him in a unique position to deliver a ground-breaking work 
about FTMs. Unfortunately, Cromwell's book provides very little new 
information or insight into the subject This is all the more frustrating 
because I have the sense that Cromwell could have written a rich eth- 
nography of the particular female-to-male transsexual communities to 
which he had access. 

The book begins and ends with a few fascinating pages about Crom- 
well’s own personal journey into masculinity and some far too brief de- 
scriptions of other FTMs. Even in these pages, however, Cromwell does 
not tell us anything new. Cromwell’s point, that human bodies and desires 
are far messier than strict binaries like male/female and heterosexual/ 
homosexual, has been made previously by Sandy Stone and Kate Born- 
stein, themselves transsexuals (see, e.g., “The Empire Strikes Back: A 
Posttranssexual Manifesto” in Body Guards, edited by Julia Epstein and 
Kristina Straub [Routledge, 1991] and Gender Outlaw [Routledge, 1994). 
Still, it is refreshing to read some first-hand accounts of how men with 
vaginas and gay men who used to be women can blur seemingly im- 
mutable categories. In between this too-brief ethnography, Cromwell 
spends his time claiming a transsexual standpoint, recycling theory, and 
creating a “history” of FTMs from secondary sources. 

For instance, Cromwell says that transsexuals are “not like other people. 
Rather than allowing society to dictate who and what they are, they define 
themselves” (p. 43). It is unclear how transsexuals escape the culture in 
which they live or why Cromwell, a student of culture, would ignore how 
transsexualism, like all identities and practices, is forged within our cul- 
ture. Rather than assuming that everything said by transpeople is true, 
Cromwell should have considered how phrases like “I'm not a lesbian” 
and “I'm simply a man” are capable of being both radically transgressive 
and simultaneously supportive of existing power relations. 

Despite Cromwell’s claim of a transsexual standpoint, he rarely quotes 
transsexuals directly. Instead, he makes claims about what FTMs think 
and do without providing any evidence or even examples. Worse, Crom- 
well subjects the reader to his own cursory readings of theorists such as 
Judith Butler. For instance, Cromwell reads Butler’s notion of gender 
performance as meaning that gender is something an individual takes on 
or off at will, like a costume. Cromwell then moves from this misunder- 
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standing of gender performativity to claim that gender is not a perform- 
ance for transpeople or anyone else. This misunderstanding is bad enough, 
but then Cromwell does not provide any evidence of the stability of FTM 
gender performance. 

The historical analysis is nearly as clumsy. Cromwell culls secondary 
sources for stories of FTMs in other cultures and other times. This account 
of the supposed universality of female-to-male transsexualism uses sources 
spanning hundreds of years and dozens of cultures without any analysis 
of how transsexualism's meaning and enactment would have been dif- 
ferent 400 years ago in a radically different culture than it is now for 
Cromwell and his community. My own research in Russia points to a 
very different notion of female-to-male transsexualism than the American 
one. The Russian transsexuals I interviewed believe, like many psychiatric 
and medical experts there, that any lesboerotic desire is pathological and 
should be “cured,” while transsexualism is part of human nature. Russian 
FTMs rarely want a different body, but a different gender, which allows 
them the legal right to marry women, to work in jobs reserved for men, 
and to be viewed as heterosexual along with their female lovers. Russian 
FTMs, however, rarely change their bodies or even their clothes or hair. 
Given this far more metaphorical understanding of transsexualism, it is 
not surprising that there are 10 times as many female-to-male transsexuals 
in Russia as there are male-to-female (the exact opposite ratio as in the 
United States). My point here is not that Cromwell should have considered 
other cultures, but that he should not have assumed a transhistorical and 
transcultural transsexualism. 

In other words, Cromwell could have used his contacts and resources 
to write an ethnography of himself and other FTMs. He could have filled 
his pages not with claims for an unchanging and unerring transsexualism, 
but with the lives of himself and other FTMs and the meaning of those 
stories for American culture. That is what good ethnography is about, 
the ability to participate and observe, to tell stories and to let stories be 
told and to put those stories within an analytic framework. I only hope 
that Cromwell’s next book will actually be about his community of female- 
to-male transsexuals and their bodies, sexualities, genders, and identities. 
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Telling Women’s Lives: Subject/Narrator/Reader/Text. By Judy Long. New 
York: New York University Press, 1999. Pp. ix+185. $55.00 (cloth); $19.50 
(paper). 


Women’s Untold Stories: Breaking Silence, Talking Back, Voicing Com- 
plexity. Edited by Mary Romeroand Abigail S. Stewart. New York: Rou- 
tledge, 1999. Pp. xxiii+282. $75.00 (cloth); $19.99 (paper). 


Judith Wittner 
Loyola University Chicago 


Stories told by women from their own perspectives have added women’s 
lived experience to sociology’s storehouse of data and helped to expose 
assumptions based on men’s standpoints. At the start of the feminist 
movement in sociology, studying women’s lives by listening to their stories 
seemed a good enough method of research and politically significant. Now, 
three decades later, discussions about feminist methodology have become 
central to the ongoing feminist project. At the heart of this discourse are 
shared understandings. Most feminists base their research on insiders’ 
particular knowledge and on their own and others’ multiple perspectives. 
They study everyday life and worlds once relegated to the “private” sphere. 
They treat women as history makers in ways different from the heroic 
modes of action attributed to men. The two books reviewed here are part 
of this discourse. Telling Women’s Lives looks critically at nonfeminist 
biographies, autobiographies, and life histories and draws on feminist 
approaches to elaborate an alternative paradigm that depends on openness 
and mutuality between researchers and their subjects. The contributors 
to Women’s Untold Stories present cases contrasting “master narratives” 
about women with stories told by women and their empathetic interloc- 
utors. Though based on insiders’ stories, these tales focus less on collab- 
orative and reflexive methods of accounting than on the resulting stories, 
moving the discussion in new directions. 

In Telling Women’s Lives, Judy Long criticizes androcentric narrative 
traditions. Most biographies, autobiographies and life histories, she writes, 
have been about men or about women in men’s worlds. They focus on 
dramatic action, not daily life, on visible work, and on independent, sol- 
itary subjects who suppress emotion and ignore their connections with 
others. Long shows how such traditions are fundamentally unsuited to a 
sociology of and for women. She argues that reflecting women’s experience 
adequately requires attention to daily life, emotional labor, and networks 
of relations. Research must be reconceived to work from, not against, 
these actualities, which have “deep roots in female culture” (p. 49). Part 
of the problem, in Long’s view, has been a commitment to a version of 
objectivity that separates a researcher from her subjects, obscures im- 
portant dimensions of the research relationship, and severs texts from the 
conditions of their production. Her project is to put forward a “praxis of 
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empathy” (p. 119) based on the connectedness of “narrators” (Long’s term 
for feminist researchers) and their subjects through their participation in 
‘women’s cultures, their shared activity as knowledge makers, and their 
equivalence as interpreters of the knowledge they jointly produce. The 
connected, empathetic, and visible narrator works alongside, not above, 
her subject, includes her own voice in the text, and allows the subject to 
talk back. Knowledge is produced, “not merely accessed,” in this dialogue 
between narrator and subject (and later between narrator, subject, and 
reader). Sociologists should focus on these relations and this dialogue, 
rather than only, or even primarily, on the lived worlds of subjects. 

The contributors to Women’s Untold Stories contrast familiar “master 
narratives” that naturalize prevailing power relations with women’s own 
accounts of their lived experiences. The editors, Romero and Stewart, see 
this critical work as the first step of a feminist sociology oriented toward 
change. Beginning with women’s stories is a political act that reorients 
inquiry from its focus on the problems of the discipline toward the prob- 
lems and concerns of story-tellers. The task of sociologists is to help con- 
nect these individual accounts to the broader relations of class, race, and 
gender. As Maria A. Gutierrez Soldatenko writes in her case study of a 
sweatshop worker’s career in the Los Angeles garment industry, “instead 
of attempting to understand Berta’s life experience through the globality 
discourse we will examine global social relations through Berta’s indi- 
vidual life experiences” (p. 258). 

The first section, “Breaking Silence,” tells stories that have been 
“crowded out” (p. xvi) by master narratives, such as black women’s strug- 
gles with infertility (against the master narrative of black women’s fe- 
cundity) or Japanese-American women’s internment narratives. These 
stories are about how women actually live in the shadows of life- 
constructing ideologies. The second section is about how women “talk 
back” by challenging master narratives about gender identity, homeless- 
ness, drug-using mothers, maid/employer relationships, and Navajo “tra- 
ditionalism.” P. J. McGann shows how some women improvise resistance 
to master narratives. In her “tomboy’s” life story, the subject, Erika, 
“skirt(s) the gender normal divide” (p. 105) after serendipitously embarking 
on a career as a performance artist. There are also examples of how 
connecting personal stories to more global concerns illuminates the stra- 
tegic choices that are part of daily life. Romero?s story of Olivia, a Mexican 
live-in maid's daughter, is a “microcosm of larger issues in race relations 
today” (p. 157). When her mother’s employers call Olivia “one of the 
family” (p. 142), yet continue to exploit her mother, Olivia understands 
how class and race struggles are part of personal life. This insight helps 
her to choose solidarity with her Mexican working-class community over 
the privileges of upward mobility. These are complex stories with am- 
biguous endings. Alicea and Friedman call attention to the “tension of 
resistance and reproduction” (p. 173) in the account by Millie, a drug- 
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using woman who rejected the popular view of junkie mothers as selfish 
and immoral by embracing the master narrative of sacrificial motherhood. 

Articles in the third section, “Voicing Complexity,” explore women’s 
political, intellectual, and occupational lives in the public sphere. In these 
stories, women invent new possibilities and alternatives to lives shaped 
in and through master narratives. Abigail Stewart draws on Gregory 
Bateson’s notion of improvisation to explain how her subject developed 
her political projects, showing how formulaic understandings of the life 
trajectories of activists hide what we most need to know. Joan Ostrove’s 
stories of activist women counter the myth that activists become “sell- 
outs” later in life and reveal some ways that gender, race, and class shape 
activism. Along similar lines, Xiong and Tatum describe how a second- 
generation Hmong woman creates new ways to be Hmong out of her 
struggles against racism and against the patriarchy of her immigrant 
community. 

Several of the articles in this volume make the narrator/subject relation 
explicit and show its worth. For example, when Amy Schulz collaborated 
with coauthors Faye Knoki and Ursula Knoki-Wilson, she learned un- 
expectedly how Navajo women use traditional knowledge to negotiate 
relevant, contemporary, and resistant Navajo identities with and for their 
English-speaking children and grandchildren. P. J. McGann connected 
her own tomboy biography to those of the tomboys she interviewed, learn- 
ing to collectivize her personal story. But even when not explicitly stated, 
it is clear that each of the studies here grows from an approach that treats 
subjects as knowers and narrators as those who learn from them. 

In sum, Telling Women’s Lives brings together many strands of the 
feminist discourses of the past three decades. It usefully applies feminist 
criticism and methods to its subject matter and offers interesting readings 
of classic sociological life stories. However, Long’s claims that feminist 
work is marginalized seem dated and her focus on the narrator/subject 
relationship reprises a long-standing and already well-developed theme 
in feminist studies. The articles in Women’s Untold Stories move beyond 
concerns with reflexivity to direct attention to the social and political 
spaces in which women are building counter-institutions and narratives. 
Alicea and Friedman conclude that although the women whose stories 
they heard “face the complex and difficult task of carving out space for 
themselves within and against a very complicated web of master nar- 
ratives and discourses,” their stories “keep the door open for possibilities 
and yet unasked questions” (p. 173). That is a major contribution of this 
collection. 
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Feminism and Islamic Fundamentalism: The Limits of Postmodern Anal- 
ysis. By Haideh Moghissi. London: Zed Books, 1999. Pp. x+166. $55.00 
(cloth); $19.95 (paper). 


Azam Torab 
Cambridge University 


Haideh Moghissi’s book is a discourse on the highly charged debate over 
“Islamic feminism.” The author passionately rejects any notion of femi- 
nism within an Islamic framework. As it happens, this debate is partic- 
ularly heated among secular Iranian feminists like Moghissi writing in 
the West. Indeed, this book is more about the author’s polemical and 
provocative contention with other Iranian feminists than about a critical 
inquiry into a theoretical problem implied by the book title, namely the 
relationship of feminism and postmodernism. The problem is, of course, 
linking political activism with theoretical analysis. Certainly no new in- 
sights are gained in this book into or how best to deal with the political 
and theoretical complexities of speaking ‘about’ women while avoiding 
to speak “for” other women. 

These other feminist scholars, Moghissi claims, have adopted an un- 
critical, relativist, postmodernist stance toward fundamentalist regimes 
like the Islamic Republic of Iran regarding women’s issues under the 
guise of hearing “women’s own voices.” She views their studies as “trou- 
bling,” “apologetic,” “journalistic,” “government propoganda,” which in- 
dulge in neo-orientalist “exotic fantasies” about “Muslim women” (pp. viii, 
7, 85), all of which reflects “the drift of professional opportunity” (p. 135). 
Such authors have “suddenly discovered the Islamic path to women’s 
emancipation as the only viable, home-grown and culturally appropriate 
alternative” (p. 40) and “present an enviably rosy picture of women’s lives 
in Islamic societies with little correspondence to reality” (p. 41) since they 
overlook “the role of Islamic legal institutions and practices in maintain- 
ing, through the ages, the specific patriarchal order which circumscribes 
women’s lives in Muslim societies” (p. 38). In sum, “if, in the past, Muslim 
women could not be seen as anything but victims of male aggression, now 
they are represented as independent-minded, gender-conscious citizens 
who participate in the social and political life of their societies” (p. 41). 
Oddly, Moghissi admits that studies of women in the Middle East are 
“moving in the right direction toward a more informed and sensitive 
understanding of the complexity of women’s lives, decoding the coloni- 
alist, self-promoting Eurocentric representation of women in Islamic so- 
cieties” (p. 38). Here she is implicitly acknowledging postmodernism’s 
questioning of how theoretical suppositions underpin data collection, anal- 
yses, and interpretation. 

Moghissi’s derisive dismissal of the contributions of most of this recent 
scholarship is unfair, misjudged, and unproductive. It is one thing to 
dismiss works on grounds of lack of serious scholarship, for which there 
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is little evidence in Moghissi’s book, but it is another thing entirely to 
dismiss other studies on ideological grounds under the guise of scholarship. 
Not only do many of those criticized have different concerns—sociological 
rather than ideological—but they are also from different disciplines with 
other methods. In contrast to Moghissi, they are at least willing to explore 
the possibilities of an “Islamic feminism” in the context of current social 
and political realities. 

Moghissi’s own discussion of “Islamic feminism” is contradictory. She 
claims that the notion of “Islamic feminism” is an oxymoron (p. 134) but 
provides no independent analysis for her argument that no feminism is 
possible within an Islamic framework. Her stance here as to what is 
politically acceptable is as dogmatic as the “fundamentalists” she scorns. 
Yet, feminisms are many, and Moghissi oddly concedes that feminists are 
now urged to respect difference, including “religious” forms of feminism. 
Indeed, toward the end of her book she grants that alternative interpre- 
tations of the Qur’an and other Islamic texts are inevitable with regard 
to women’s rights (p. 130). What conclusions can be drawn? Is it that 
Islam and feminism are not compatible? The answer, as Moghissi finally 
admits (p. 141), is more tangled than supposed. What Moghissi does not 
seem to realize is that it is precisely because in specific contexts some 
forms of difference may be more important than others, so it is essential 
that scholars examine those contexts in which certain differences become 
powerful and are perceived locally as appropriate at a particular time 
and in particular social and cultural formations. 

Moghissi’s book is a continuation of early feminist writings that focused 
on the patriarchal bias in Islamic texts, thereby overvaluing male per- 
spectives despite their feminist impetus. Theoretical concerns have long 
moved on from a focus on structural and material determinants to how 
individuals as active agents position themselves within relations of power. 
If it is true that the grid of “surveillance” (Foucault) of fundamentalist 
regimes has increased, it becomes even more urgent to examine sources 
of empowerment and ways in which people can and do resist, and the 
potentials for change within the Islamic framework. One would thereby 
engage in a more fruitful debate. 


Max Weber and Islam. Edited by Toby E. Huff and Wolfgang Schluchter. 
New Brunswick, N.J.: Transaction Publishers, 1999. Pp. viii+-332. $34.95 


Sald Amir Arjomand 
State University of New York at Stony Brook 


This book is a collection of essays first published in German in 1987 and 
usefully updated with an extensive new introduction by Toby Huff. Its 
appearance well over a decade after its German publication nevertheless 
indicates that the supply of serious English-language scholarship evidently 
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lags behind the demand in the area of the Weberian study of Islam. The 
essays by Huff, Schluchter, and Crone focus on Weber’s sociology and 
the place of Islam in it The other essays are substantive contributions to 
the study of Islamic history and societies from a Weberian perspective. 

Huff rightly defends Weber’s sociology of law against Patricia Crone’s 
virulent and wholesale rejection (chap. 9) and points out that she mis- 
understands his characterization of Islamic law (the best informed of his 
writings on Islam) and that it can be applied with profit to a wide range 
of phenomena, as. in Schluchter’s explanation of different patterns of 
development of states and cities in the Islamic world and the West. Un- 
fortunately, however, inspired by Benjamin Berber’s Jihad vs. McWorld 
(1995) and such anti-Islam latter-day crusaders as Daniel Pipes, Huff also 
decides to defend Weber’s commonplace notion of Islam as a “warrior” 
religion spread by the sword, a view some of the other contributors ex- 
plicitly reject. The main argument rests on Huff’s indiscriminate appli- 
cation of the phrase “slaves on horses/horseback” (Eccl. 10:7) as a blanket 
label for two fundamentally different systems of military organization: 
that of the conquering Arab tribes (7th and 8th centuries) and the system 
that had its origins in the employment of Turkish slaves by the Caliphs 
of Baghdad in the 9th century and later developed into distinctive forms 
in Central Asia and Northern India (10th—14th centuries), Mamluk Egypt 
and Syria (mid-13th—16th centuries) and the Ottoman Empire (15th-19th 
centuries). While the earlier system of military organization—for which 
“slaves on horses” is a figurative and totally misleading description—may 
have had intrinsic connections with the development of Islam as a religion, 
the later Turkish systems were largely extraneous to Islam and had scant 
religious underpinnings or justification. To take this later system as proof 
of the “warrior” nature of Islam seems as unreasonable as to derive nuclear 
warfare from Christianity. 

Quite apart from Huff’s bad counsel on “slaves on horseback,” the 
discussion raises an issue concerning the formative period of the emer- 
gence of the “world religions,” which Weber never solved systematically. 
The question of when (and how) Islam was elaborated into an ethicolegal 

“system of life regulation” and a universalist religion is ignored by Huff 
and set aside in Schluchter’s vague references to “early Islam.” Levtzion 
(chap. 3) demonstrates that conversion was gradual: Islam remained the 
religion of a conquering minority for at least a century and a half. Fur- 
thermore, he points out that the warriors showed little interest in the 
development of Islam as a universalist religion, which was the work of 
the later jurists and Sufis. Sociopolitical universalism (equality of all Mus- 
lims as against Arab superiority) was a consequence of the ‘Abbasid 
revolution in the middle of the 8th century. The systematic development 
of Islamic law, as Huff points out, came even later. In chapter 2, however, 
the question of the formative historicity of Islam is addressed by eas 
who outlines the institutionalization of Islam. 

The most ambitious and longest essay (over 80 pages) is by Wolfgang 
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Schluchter, the convener of the series of conferences at Heidelberg on Max 
Weber’s studies on sociology of religion that ended with this one on Islam. 
It is a piece of solid scholarship that examines Weber’s view of Islam in 
the broader context of his writings, sources, and uncompleted projects. 
Schluchter’s focus—which he offers as Weber’s—is entirely on the de- 
velopment of the unique pattern of Western rationality. As a result, 
Schluchter points to Weber’s interest in the similarity and difference be- 
tween Islam and ascetic Protestantism and the characteristics of Islamic 
and Western legal and political organization. His range of comparisons 
is wide and their coverage, thorough. Of particular interest is Schluchter’s 
discussion of law, the cities, and political systems (patrimonialism, Sul- 
tanism, feudalism, and the law-centered development of Western patri- 
monialism into the stdndesstaat). Yet in the end, all Schluchter’s pains- 
taking efforts appear to have gone into demonstrating what we have 
known all along—namely, that Islam is not ascetic Protestantism. I think 
the problem—he it Schluchter’s or Weber’s—arises from the inability of 
any historical narrative, however exhaustive, to explain a unique devel- 
opment (a “historical individual”) such as capitalism or rational forms of 
legal and political organization. Such a narrative is important for under- 
standing the unique historical development in the West but may not be 
helpful for understanding the developmental pattern of the Islamic civ- 
ilization. To give a concrete example, the quest for certitudo salutis and 
predestination loom large among Weber’s defining conditions of Protes- 
tantism. Weber and Schluchter put much emphasis on their absence in 
Islam (certitudo salutis is a unique feature of Protestantism, not found 
in Islam or other religions; predestination is in fact found in Islam but 
disposed of as “predetermination” [p. 77] ). 

One could just as well begin the analysis with a historical narrative of 
the rise of Islam (or any religion), that would highlight a number of 
defining features that invite comparisons and this is, in fact, what R. 
Peters does with the rise of Wahhabi fundamentalism in the 18th and 
early 19th century Arabia (chap. 6). He finds Wahhabi fundamentalism 
to be remarkably similar to ascetic Protestantism in its austerity. As cap- 
italism did not develop in Arabia, Peters concludes that ascetic Protes- 
tantism is irrelevant to the rise of capitalism. By the same token, however, 
Peters’s argument is as logically inconclusive as the Weber/Schluchter 
perspective described above. 

Weber’s focus on the development of capitalism, as Lapidus points out 
(p. 139), was “a revelation about Europe,” but misleading for letting “the 
European issue define the questions for the study of non-European so- 
cieties.” The comment may be extended to Schluchter. But most of the 
other essays have the merit of drawing on Weber’s general categories and 
ideal-types to analyze Islamic societies in terms of their own distinctive 
institutions and ethos. Cook finds a “striking misfit” between Weber’s 
conception of sect, derived from Christianity, and the intensely political 
character of religious schism in Medieval Islam (p. 276). By contrast, 
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Eaton (chap. 4) and Hardy (chap. 5) see special value in Weber’s concept 
of patrimonialism for the understanding of Muslim India. The former 
also applies Weber’s idea of the routinization of charisma to the Islami- 
cization of Bengal, while the latter pays close attention to the ethos of 
Mughal patrimonialism (instead of chasing the chimera of “inner asceti- 
cism”). Francis Robinson ingeniously develops the idea that, by taking 
Weber’s understanding of developmental history of world religions and 
his typology of religious rationalism, we can make sense of the fact that 
“the retreat of the European, and some failures of the state machinery 
left behind have often been accompanied by a reassertion of Islamic world 
view and a reinstatement of the sharia in public life” (p. 241). In the 
concluding essay, Eisenstadt offers a sketch of the inner dynamics of 
Islamic civilization in the form of a relatively simple model of constant 
tension between an Islamic primordial utopia—the ideal of the Golden 
Age of pristine Islam—and the historical reality of patrimonial Sultanism, 
coexisting with autonomous public sphere protected by Islamic law and 
dominated by the religious elite, the ulema. He draws on Ibn Khaldun's 
depiction of the cycle of the rise and fall of puritanical Muslim dynasties 
to construct a model of oscillation between military regimes with limited 
pluralism and intolerant, “protofundamentalist” and “Jacobin fundamen- 
talist” ones (p. 292). Eisenstadt’s model is appealing for its simplicity and 
for discarding the Eurocentric view of social evolution and dynamics. It 
leaves out, however, the nonreligious political ethos of Muslim societies 
highlighted by Lapidus and Hardy and the trends in institutionalization 
and development of nonreligious culture such as those treated by 
Robinson. 


Family, Religion, and Social Change in Diverse Societies. Edited by 
Sharon K. Houseknecht and Jerry G. Pankhurst. New York: Oxford Uni- 
versity Press, 1999. Pp. xiv+395.$27.50 (paper). 


Malcolm Hamilton 
University of Reading 


The moment one picks up this book and considers its title and contents 
one cannot help being struck by the realization that there has been a 
remarkable neglect of the linkage between family and religion in contem- 
porary sociology. Can this really be because, as the editors in their intro- 
duction suggest, both family and religion are seen as declining institutions 
belonging to the private sphere with little impact beyond it? Whatever 
the answer to this question, it is odd that this linkage has not received 
systematic treatment on a comparative basis. This unusual collection be- 
gins to fill that gap in providing a set of case studies drawn from mah 
different cultures and societies. 
The emphasis upon diversity in its title is entirely appropriate. Fourteen 
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contributions range across both developed and developing countries and 
most of the main religious traditions. Grouped into five sections, inevitably 
somewhat arbitrarily given the multiplicity of overlapping themes, cases 
from the United States, Egypt, Sweden, Taiwan, Belarus, Indonesia, South 
Africa, France, Ukraine, Britain, Japan, Cameroon, Brazil, and Mexico 
are included. Not all are focused, however, upon mainstream society. 
Mormon families in France are the concern of Jarvis’s study, Danzger’s 
deals with the Jewish community of Kiev, while Larson’s is concerned 
with immigrant communities from South Asia in Britain. The four cases 
in the first section exhibit the theme of conflict and accommodation. The 
next three are more focused upon social change and transformation. In- 
novation is the concern of section 3, which deals with minority traditions. 
Economic factors are emphasized in section 4, and gender in section 5. 

This diversity, the editors stress, allows and is intended to facilitate the 
comparative study of the religion/family interconnection. Unfortunately, 
the great strength of this collection, its diversity, runs counter to this 
ambition, and in this respect the volume is less impressive. Too often the 
term “comparative” is applied to collections of material as if mere jux- 
taposition of diverse cases in itself allows systematic comparison and 
analysis on the basis of which generalizations may be made. But generally 
in such collections the studies have very different foci, data are gathered 
in very different ways and for different purposes, and the variables in- 
cluded are too many and too complexly interrelated to meet the purposes 
of comparative analysis in the proper sense of the term. This collection 
is no exception. Not only are three of the contributions specifically focused 
on minority traditions, but many are concerned with highly specific as- 
pects of religion or family life. Several are only tangentially concerned, 
in fact, with the linkage between religion and family. For example, Cam- 
mack, Young, and Heaton’s study of Indonesia is concerned very specif- 
ically with reform of the divorce laws, Godsell’s with entrepreneurship 
in South Africa, and Nunes’s study of Brazil on the changing relationship 
between Catholicism and the position of women in the society. 

Even where the studies are centrally concerned with the relationship 
between religion and family, there is in some of them an assumption that 
it is a relatively straightforward matter to trace the influence of one upon 
the other, which neglects the complexities that arise when one takes into 
account the wider cultural context. To take just one example, House- 
knecht’s impressively thorough and informative account of family and 
religion in the context of social change in Egypt concludes that Islamic 
doctrine remains widely reflected in family life. Men provide financial 
support for wives, wives obey husbands, and so on. But how far are such 
values and behavior a reflection of broader cultural values that are, toa 
considerable degree, independent of, although entirely congruent with, 
religion and religious values? The way in which family, religion, and 
culture are sometimes inextricably intertwined is brought out very strik- 
ingly in Yang, Thornton, and Fricke’s study of Taiwan and in Smith’s 


1213 


American Journal of Sociology 


study of Japan. Here religion, or at least that part of it concerned with 
the veneration of ancestors, is so much a part of family that one can 
hardly speak of one without speaking of the other; to say that one influ- 
ences the other is either meaningless, tautological, or redundant. 

If the aim of reaching generalizations through systematic comparison 
remains out of reach for a volume of this kind, a number of important 
if loose themes do emerge. These the editors bring out expertly in their 
introduction. The impact of social change is clearly something that affects 
both religion and family and their interconnections in every part of the 
globe, if to very differing degrees. A crucial aspect of this is secularization, 
a process that also has a profoundly different impact in different societies 
and within different religious traditions. The relationship between the 
public sphere and the private, to which both family and religion are 
usually seen to belong, also reflects a differentially changing role for re- 
ligion and religious organizations across different societies and cultures. 
This collection provides an extremely informative and widely useful con- 
tribution to our knowledge in all of these areas and will be of great value 
for anyone with an interest in religion, family studies, and social change. 


A Particular Place: Urban Restructuring and Religious Ecology in a 
Southern Exurb. By Nancy L. Eiesland. New Brunswick, N.J.: Rutgers 
University Press, 2000. Pp. xvi+255. $52.00 (cloth); $21.00 (paper). 


Dwight B. Billings 
University of Kentucky 


In this insightful and well-written ethnographic study, Nancy Eiesland 
examines the mutual effects of urban and religious restructuring on re- 
ligious behavior and organizations in a small community that has found 
itself drawn increasingly within the orbit of metropolitan Atlanta. Only 
20 or so years ago, according to the author, residents of the small town 
of Dacula, Georgia, viewed their rural community as physically and cul- 
turally distinct from Atlanta. But today, located in one of the fastest 
growing counties in the United States, Dacula has been completely in- 
corporated into Atlanta’s deconcentrated metropolitan zone. Urban re- 
structuring and population influx have profoundly transformed the town’s 
physical, social, cultural, and economic ways of life, while religious re- 
structuring has meant the declining significance of denominational loyalty 
and the rise of new forms of religious organization and participation. 
Eiesland's study illuminates the interaction of these two national trends 
on local-level religious life. Advocating the advantages of an approach 
she terms “religious organizational ecology” rather than utilizing the more 
familiar “metaphor of supply-side religious markets and basic tenets of 
rational choice models of religious change” (p. 13), Eiesland defines’ the 
goal of her ethnography: “to account for the ways in which religious [and 
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urban] restructuring [have] shifted relations with the population of relig- 
ious organizations at the local level” (p. 14). 

After preliminary chapters that review the literature on national trends 
in religion and in exurban growth and that describe the local ethnographic 
context, Eiesland examines the changing interorganizational relations 
among the 24 religious organizations that existed in Dacula in 1995. She 
concentrates on their shifting status order, emergent organizational forms, 
and their efforts to define distinct cultural niches for themselves as they 
variously cooperate and compete to survive. She examines phenomena 
such as the rise of megachurches that capitalize on the rising tide of 
newcomers in the area, the emergence of nondenominational (often Pen- 
tecostal) fellowships and house-churches, the decline (and occasional re- 
surgence) of mainstream denominational churches that sometimes try un- 
successfully to ignore the rapid changes taking place around them, and 
the demise of rural family chapels. The author's account of long-term 
ecological dynamics is shored up by evidence from demographic and 
membership trends, interviews, ethnography, and oral history. 

In order to deepen her analysis of how the interorganizational religious 
environment as a whole influences the internal dynamics of individual 
congregations as well as the religious behavior of their members, Eiesland 
provides an extensive case study of changes that have taken place over 
the past two decades within one United Methodist congregation. Once 
the most prominent and influential church in Dacula, this small congre- 
gation has weathered the effects of many changes, including the loss of 
members to a megachurch that offered far more extensive programming 
and services to the town’s newcomers and consequently usurped its prom- 
inence. Once content to ignore the fast-paced demographic changes taking 
place around it, the church began actively to recruit newcomers only to 
undergo deep schism when many of its new members and its pastor 
embraced Pentecostalism. Finally, it defined a distinct niche for itself by 
promoting its legacy as an old-time, Southern, small-town, and family- 
centered congregation. 

Just as a transformed urban environment influences the internal dy- 
namics of congregations, it also shapes the potential choices made by 
members. In an especially interesting chapter, Eiesland examines differ- 
ences in congregational and denominational loyalties among Methodist 
old-timers and newcomers and looks at how differently they knit together 
their work, consumption, recreational, and religious activities across ur- 
ban space. She finds that old-timers tend to be more denomination and 
church centered in their friendship choices and more likely to support 
local merchants; newcomers choose more widely among the diverse of- 
ferings of the urban and religious environment. One typical newcomer 
family, for instance, joined the local Methodist church but took grief 
workshops offered by the megachurch, enrolled their children in a day- 
caré center sponsored by another denomination, and shopped, worked, 
and pursued friendships across the entire Atlanta metropolitan expanse. 
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In another chapter the author explores the different (and sometimes com- 
plementary) responses of old-timers and newcomers to three community/ 
religious conflicts—the maintenance of a church cemetery, the proposed 
building of a new airport, and controversy over the Georgia state flag 
and its incorporation of Confederate insignia. 

Eiesland reports that in 1990 almost half of all Americans lived in 
suburbs and exurbs like the one she describes. By documenting how urban 
and religious restructuring is lived out in such a context, her well-realized 
study makes a significant contribution to the small but growing ethno- 
graphic literature on religion in particular places that complements and 
refines the picture of such forces in national surveys. F urther, by offering 
a concrete and sympathetic view of how such changes are experienced 
by real people and organizations, her book provides a compelling point 
of access to these issues for students enrolled in courses on the sociology 
of religion, urban sociology, and the sociology of culture. 


Globalising Cities: A New Spatial Order? Edited by Peter Marcuse and 
Ronald van Kempen. Oxford: Blackwell, 2000. Pp. xviii+318. $24.95 
(paper). 


Susan S. Fainstein 
Rutgers University 


Globalising Cities consists of a collection of 12 essays addressing the issue 
of whether “there is a new spatial order within cities, as the result of the 
process of globalization” (p. xvii). The first and last chapters, by the ed- 
itors, raise and answer the question directly; eight of the other chapters 
are case studies of specific cities, ranging from New York to Calcutta, 
that examine their sociospatial structures and seek to tease out the effects 
of globalization on them. Two chapters, one by Robert Beauregard and 
Anne Haila and the other by William Goldsmith, are theoretical. Beau- 
regard and Haila challenge the thesis of discontinuity in urban devel- 
opment, while Goldsmith analyzes the importance of racial division, as 
opposed to globalization, in determining the contemporary city. 
The editors do not keep us in suspense concerning the answer to their 
question; they declare in their preface that cities do not display a new 
social order. Rather, “while the general tendencies produce predictable 
socio-spatial patterns, very visibly affect some locations, and are increas- 
ing in importance, they are not so consistent or so different from prior 
patterns as to deserve being called ‘a new spatial order” (p. 4). Never- 
theless, the editors remain somewhat ambiguous about this conclusion, 
evidently not being quite sure when incremental change adds up to a 
qualitative leap and conceding that how one comes down on the issue is 
a matter of judgment. 
Overall the various contributions present complex, useful pictures: of 
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globalizing processes as they play out “on the ground.” Each case study 
examines the economic transformation brought about by global compe- 
tition and flexible production, traces the roots of class and ethnic segre- 
gation, and considers the contributions of specific histories and political 
forces. Most of the authors identify a move toward increasing inequality 
and spatial isolation of different population groups. The model of the 
post-Fordist city, however, applies better to the cities of Europe and North 
America than to Rio, Calcutta, and Singapore. 

Thus, Sanjoy Chakravorty points out that flexible production, while 
novel in the developed world, has always characterized Calcutta, which 
never passed through a Fordist phase. He also notes that domestic live- 
in servants are ubiquitous in well-to-do residential areas; thus, the seg- 
regation of rich and poor that prevails in wealthy Western cities does not 
apply there. His comment is particularly significant, as its implications 
undermine the well-meaning assumption in much of this book that prox- 
imity produces empathy. 

Singapore also presents an important deviation from the paradigmatic 
model of the post-Fordist divided city. Despite recent tendencies toward 
some ethnic segregation, Singapore, with the world’s largest public hous- 
ing program, demonstrates the extent to which state action can overcome 
general trends. Because the authoritarian government of the city-state 
feared the political consequences of ethnic mobilization, it guarded against 
the formation of ethnic enclaves when allocating housing units. The fact 
that it could do so in an entrepót country totally enmeshed in the global 
economy indicates that global forces do not inevitably produce certain 
social outcomes. Leo van Grunsven hence concludes that the example of 
Singapore “demonstrates that the types of partitioning of the economic 
structure of society and of the metropolitan space which is seen as char- 
acteristic of the post-Fordist phase of capitalism do not necessarily hold 
outside the realm of the traditional core areas of the world economy” (p. 
125). 

In the concluding chapter, the editors delineate the typical processes 
that occur within globalizing cities. They describe the changing uses of 
physical space and highlight the adaptive reuse of waterfronts and ware- 
houses, gentrification, and the construction of office centers, but also the 
abandonment of brownfield sites. They use the term “soft locations” to 
depict areas particularly subject to mutation as a consequence of glob- 
alization and economic restructuring. In elaborating a model previously 
set forth by Peter Marcuse (the “quartered city”), they utilize the term 
“layered city.” They necessarily, however, insert a number of qualifiers in 
attempting to make their model generally applicable and, in fact, warn 
against the dangers of overgeneralization. On the whole, they make a 
strong case concerning growing spatial segregation. Less convincing is 
their argument that the different quarters of the city are becoming to- 
talized, such that individuals living within them need not leave them. 

We are at a point in history where the roles of the city and the nation- 


1217 


American Journal of Sociology 


state are changing, at least in part as a consequence of greater global 
integration. The issue is not whether the global or the local predominates, 
but the character of their interaction and the extent to which it is possible 
to exert political control over urban processes. Globalizing Cities is an 
important contribution to the scholarly literature on what is therefore 
currently the most salient topic in urban sociology: the interaction between 
the global and the local. 
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Social Change, the Social Organization of 
Families, and Fertility Limitation’ 


William G. Axinn and Scott T. Yabiku 
University of Michigan 


The social organization of the family is a key link between macro- 
level social change and individual-level childbearing behavior. The 
family mode of organization framework and life course perspective 
are used to develop hypotheses about these links. To test those 
hypotheses, the analysis uses a combination of life history and 
neighborhood history measures designed explicitly for this purpose. 
Results from fully dynamic multilevel hazard models demonstrate 
both childhood and adult community contexts shape childbearing 
in independent ways. The results implicate a variety of specific 
mechanisms linking social change to fertility behavior: cost-benefit 
analysis, ideational diffusion, and long-term personality develop- 
ment. The results also show contextual characteristics at multiple 
points in the life course may each exert independent effects on in- 
dividual outcomes. 


The relationship between macrolevel social changes and microlevel in- 
dividual behaviors has been a persistent theme in sociological theory 
(Alexander 1988; Coleman 1990; Durkheim [1893] 1984). This theme is 
not only a theoretical issue, but also has important empirical implications. 
In fact, recent studies from a broad range of substantive areas, including 
demography, gender relations, child development, criminology, and ed- 
ucation emphasize the empirical significance of social context as a deter- 
minant of individual behavior (Brooks-Gunn et al. 1993; Entwisle and 
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Mason 1985; Huber 1991; Raudenbush 1988; Rountree, Land, and Miethe 
1994). 

Despite substantial research in these areas, it is often difficult to find 
strong empirical evidence of relationships between social context and in- 
dividual behaviors. One reason is that most multilevel studies use mea- 
sures of social context that were created for some other purpose, such as 
census tract or regional data (Brewster 1994; Brooks-Gunn et al. 1993; 
Hirschman and Guest 1990). This often leads researchers to operationalize 
social context at a higher level of aggregation than may be relevant in 
individuals’ daily lives. Researchers may also use contextual measures 
that are not strongly related to theoretically relevant dimensions of con- 
text. Another reason is that researchers must generally rely on static mea- 
sures of macrolevel characteristics, even when they are investigating dy- 
namic processes (Brewster 1994; Entwisle, Casterline, and Sayed 1989; 
Hirschman and Guest 1990; Sastry 1996). This often prevents researchers 
from examining the impact of macrolevel contextual changes that occur 
over time. Even more troubling, it sometimes means that empirical tests 
of models do not match the temporal order embedded in hypotheses. As 
a result, social science has provided relatively little empirical evidence of 
the strong macro/micro relationships predicted by theories. 

In this study, we use innovative measurement to overcome these ob- 
stacles and to show the strong impact of community-level social change 
on individual childbearing behavior. In contrast to many studies, we use 
data that were explicitly designed for this purpose and measures based 
explicitly on the theoretical framework described below. We operationalize 
context at an extremely local level that is likely to have substantial mean- 
ing in people’s lives: their immediate neighborhood. We focus on a setting 
in rural Nepal because this population has experienced a dramatic re- 
organization of social activities within the lifetimes of its residents, and 
dramatic variance in levels of reorganization characterize the neighbor- 
hoods in this setting. Furthermore, the measurement strategy uses new 
techniques for gathering continuous histories of contextual change; rather 
than aggregating individual characteristics, the neighborhood history cal- 
endar technique focuses on characteristics of the communities (Axinn, 
Barber, and Ghimire 1997). In the empirical analysis, we use dynamic 
models of fertility limitation to incorporate both change in community 
context and change in individual life histories. 

We investigate perhaps the most theoretically and empirically dida 
subject in social demography: the processes of transition from high fertility 
and no use of birth control to low fertility and the widespread use of birth 
control (Notestein 1953; Davis 1955; Caldwell 1982; Freedman 1979; 
McNicoll 1980; Lesthaeghe and Surkyn 1988; Easterlin and Crimmins 
1985; Smith 1989). Despite an intense interest, demographers have not 
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agreed on an accepted theoretical framework for understanding this im- 
portant transition (Mason 1997). This undoubtedly comes as little surprise 
to many sociologists, who may in general give social demography higher 
marks for generating empirical evidence than for developing theoretical 
reasoning. What may be more of a surprise is that social demography has 
also generally failed to provide strong empirical tests of the central ideas 
in these models (Tilly 1986; Burch 1983; Ryder 1986; Smith 1989). 

Our framework for studying this fundamental social transformation 
uses the social organization of the family as the key intervening link 
between macrolevel social change and the transition toward the wide- 
spread use of birth contro]. We use our theoretical framework to identify 
the relevant dimensions of community context rather than rely on pre- 
existing data sources. The framework itself is an important advance in 
theoretical reasoning of fertility transition because it allows multiple 
causal connections between macrolevel social changes and individual- 
level behavior. We have strong reason to believe that, depending on the 
contextual circumstances, a variety of social processes may influence fer- 
tility behavior, including changes in the mode of economic production, 
institutional organization of social life, and patterns of diffusion of ideas 
(Lesthaeghe and Wilson 1986; McNicoll 1980; Cleland and Wilson 1987; 
Caldwell 1982; Montgomery and Casterline 1993; Smith 1989). Further- 
more, empirical evidence indicates that all populations do not follow a 
single, unilinear course of demographic change (Freedman 1979). Our 
framework simultaneously incorporates multiple causal mechanisms into 
a consistent body of empirical predictions, an important advantage for 
achieving the goal of consistent prediction across settings, which may 
follow widely varying courses of demographic change. 

Because we focus on timing and sequencing changes at both community 
and individual levels, both our theoretical framework and our empirical 
analyses build on the life course perspective (Elder 1977, 1983). Theo- 
retically, this perspective suggests that childhood and adult community 
context may have independent effects because events early in the life 
course may continue to influence later behavior. The empirical implication 
for our analysis is that we first model the impact of childhood community 
characteristics followed by modeling the consequences of subsequent ex- 
periences and later community characteristics. This life course perspective, 
combined with a modes of social organization approach, offers a powerful 
way to examine the effects of macrolevel social change on individual 
outcomes. 
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THEORETICAL FRAMEWORK 
Social Change and Family Organization 


The family mode of social organization framework considers a wide array 
of social changes and their potential influence for individuals in families 
(Thornton and Fricke 1987; Thornton and Lin 1994). The framework 
builds upon previous research that focused exclusively on the family mode 
of production (Caldwell 1982; Lesthaeghe and Wilson 1986) and extends 
to family modes of social organization across a variety of domains: con- 
sumption, residence, recreation, protection, socialization, procreation, and 
production. Historically, most of these activities of daily living were or- 
ganized within the family (Ogburn and Nimkoff 1955; Thornton and 
Fricke 1987). Yet as social changes created new nonfamily institutions to 
organize these activities, they increasingly took place outside the family. 
No society is expected to be completely organized inside or outside of 
families, but the nonfamily and family modes of social organization are 
two ideal types that aid our understanding of social change and the family. 

Our choice of this framework is rooted in the premise that improved 
transportation and communication, monetization of the economy, and 
population growth increase the division of labor in society. Durkheim 
argued that these factors increase the numbers of people who interact 
with one another, or the “moral density” of society (1984, p. 257). Durk- 
heim mentioned three specific mechanisms that predicate the division of 
labor: population concentration, the formation and development of cities, 
and improved communication and transportation (1984, pp. 257-58). The 
framework recognizes that changes in transportation, communication, and 
monetization can reorganize not only production, but a wide variety of 
other social activities as well, which greatly impacts the social organization 
of families As activities of daily living increasingly take place outside the 
home and away from the family, the structure of social interactions 
changes and alters social relationships with both family members and 
others outside the family. This reorganization of family life is the key link 
between macrolevel social changes and microlevel outcomes in family- 
building behaviors. 


Family Organization and Childbearing Behavior 


The family organization framework can be used to integrate and expand 
upon existing explanations of fertility behavior. Theoretical work linking 
social change to childbearing behavior has mainly focused on two sets of 
explanations: (1) microeconomic explanations that emphasize the impact 
of changes on the costs and benefits of childrearing and (2) ideationa] 
explanations that emphasize the impact of changes in the diffusion of 


1222 


Fertility Limitation 


beliefs and preferences related to childbearing. The family organization 
framework incorporates both these perspectives, plus we add a largely 
unexplored mechanism: personality development. 

Costs and benefits of childrearing —Microeconomic theories of family 
formation processes, sometimes referred to as “demand” theories, focus 
on the costs and benefits of childrearing (Becker 1991; Easterlin and 
Crimmins 1985; Notestein 1953; Lesthaeghe and Surkyn 1988; Willis 
1973). Recent theoretical work by sociologists also emphasizes this same 
perspective. For example, Coleman explicitly links the reorganization of 
family life to individuals’ childbearing and childrearing behaviors via the 
costs and benefits of childrearing (Coleman 1990). As nonfamily organi- 
zations and institutions (what Coleman calls corporate actors) grow, they 
remove from the family responsibilities for activities it once performed: 
production (p. 580), education (p. 581), food preparation (p. 587), and care 
of the elderly (p. 584). The key is that when these activities happen within 
the household, they create positive externalities to childrearing for house- 
hold members. For example, while the elderly are cared for in the family 
home, they also can oversee young children. Similarly, when parents’ 
productive activities occur near the home, children can assist with the 
simpler tasks. These positive externalities keep the costs of childrearing 
low. When these social activities shift to specialized corporate actors, 
however, the positive externalities begin to disappear, which increases 
costs and decreases benefits of childrearing. These increased costs and 
decreased benefits motivate individuals to limit their fertility (p. 585). 

Earlier theories of fertility change contain many ideas similar to Cole- 
man’s. For example, Notestein argued that the reorganization of social 
activities outside the family reduces motivations to have children. He 
wrote that fertility transitions began in settings that “stripped the family 
of many functions in production, consumption, recreation and education” 
(Notestein 1953, p. 16). Caldwell’s intergenerational wealth flows theory 
of fertility decline is also similar to Coleman’s claim that the reorgani- 
zation of care for the elderly outside the family reduces the motivation 
to have children. When social change reduces the value of children, in- 
tergenerational flows of wealth reverse and flow from parents to children. 
This reversal, argues Caldwell, induces fertility declines (Caldwell 1982). 
Thus, these theories use the social organization of the family to link ma- 
crolevel social changes to individual-level childbearing behavior. Or, as 
Coleman says, “the new corporate actors are specialized and narrow pur- 
pose entities, constitutive of a structure marked by a high division of 
labor, and have no place for childrearing” (Coleman 1990, p. 579). 

Note that in this theory of fertility decline, the individuals who make 
decisions about childbearing are always aware of their self-interest and 
make decisions to maximize self-interest. Circumstances around individ- 
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uals change, and this alters whether having many children is in individ- 
uals’ self-interest. This is an important contrast to fertility theories that 
argue that the circumstances facing individuals stay essentially the same, 
but as individuals become more aware of their own self-interest, they 
become more rational and begin to limit their fertility (Kasarda, Billy, 
and West 1986). 

Diffusion of new ideas —In contrast to perspectives that emphasize the 
costs and benefits of childrearing, diffusion perspectives stress social in- 
teractions as the key to fertility declines. New patterns of social interaction 
change the patterns of communication among individuals and can facil- 
itate the spread of new ideas about childbearing. This perspective was 
fueled by the empirical observation that patterns of fertility decline over- 
lapped with ethnic, language, and religious groupings, suggesting that 
patterns of communication may have shaped childbearing behavior (An- 
derson 1986). Additional evidence showed that factors believed closely 
related to the costs of childrearing did not explain fertility decline (Cleland 
and Wilson 1987; Bongaarts and Watkins 1996). 

The spread of many types of new ideas has been linked to fertility 
decline. These include information about contraception and the means of 
contraception (Knodel 1987), higher consumption aspirations (Freedman 
1979), smaller family size preferences (Caldwell 1982; Lightbourne 1984), 
and secular and individualistic attitudes and preferences (Lesthaeghe and 
Wilson 1986; Lesthaeghe and Surkyn 1988; Bumpass 1990). Each of these 
social changes, however, requires a fundamental change in social inter- 
action: individuals must be in contact with ideas and information that 
they had not previously encountered. Multiple potential mechanisms of 
change have been suggested, including social networks that guide the 
informal spread of information or gossip networks (Watkins 1991; Watkins 
and Danzi 1995), the spread of Western ideas through colonial education 
systems (Caldwell 1982), increased migration, travel and tourism (Bon- 
gaarts and Watkins 1996; Freedman 1979), and contact with mass media 
(Bongaarts and Watkins 1996). These social changes alter the people with 
whom individuals interact and provide the mechanisms for new ideas to 
spread through populations. 

The family mode of social organization framework is consistent with 
both microeconomic and diffusion theories of fertility decline. For ex- 
ample, when social activities are reorganized, it may alter the costs and 
benefits of childrearing, stimulating individuals to change their fertility 
behavior. As asserted by most microeconomic theories of fertility, indi- 
viduals respond to changes in social and economic circumstances by al- 
tering their fertility behavior (Bulatao and Lee 1983; Easterlin and Crim- 
mins 1985). On the other hand, reorganized activities also alter the group 
of people with whom any single individual interacts, thereby facilitating 
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diffusion of alternative ideas. Such mechanisms of social change are con- 
sistent with fertility theories that emphasize the diffusion of innovative 
ideas (Montgomery and Casterline 1993; Cleland and Wilson 1987). Thus 
the family mode of social organization framework incorporates both 
microeconomic change and the diffusion of ideas as mechanisms of fertility 
change. This characteristic of the framework is particularly useful, as 
both sets of changes are likely to exert important influences on child- 
bearing behavior (Lesthaeghe and Surkyn 1988). 

Long-term personality development.—tIn: addition to its integration of 
ideational diffusion and microeconomic explanations of childbearing be- 
havior, the framework also suggests an overlooked mechanism linking 
social change to childbearing behavior: personality. Recent theoretical 
advances demonstrate how the social organization of families in early 
childhood affects children’s long-term personality development (Yabiku, 
Axinn, and Thornton 1999). Social activities organized around the family 
integrate family members, which affects the environment in which chil- 
dren are raised. Empirical evidence confirms the long-term impact of 
family integration in childhood on adult self-esteem, but theory predicts 
similar long-term impacts on attitudes, values, and preferences related to 
family formation (Yabiku et al. 1999). Specifically, when individuals have 
high levels of family integration in childhood, they are likely to prefer 
earlier family formation and larger family sizes. The complementary pre- 
diction is that when social change removes activities from the family, 
family integration decreases, stimulating preferences for smaller families 
and later family formation. This mechanism of social change is ideational 
in nature, but it differs from mechanisms that rely on the diffusion of 
new ideas among adults. By contrast, this mechanism implicates the social 
organization of families in childhood and how it shapes the long-term 
personality characteristics of children. 

Because it emphasizes the timing and sequencing of both macrolevel 
historical changes and microlevel individual experiences, the life course 
perspective is extremely valuable for illuminating the empirical differences 
among our hypotheses (Elder 1977, 1983). Timing is important because 
different mechanisms work through different periods in an individual’s 
lifetime. The personality mechanism described above suggests that social 
context during childhood affects social organization in the child’s family, 
which subsequently influences childbearing behavior in adulthood. Mech- 
anisms affecting the costs and benefits of childrearing, however, operate 
during adulthood as a person decides on childbearing. Mechanisms linked 
to the diffusion of new antinatalist ideas may operate through either 
childhood or adult processes: new social interactions in adulthood may 
spread new ideas that produce fertility decline (e.g., gossip networks de- 
scribed by Watkins 1991), but social interactions in childhood may also 
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convey new information such that attitudes and preferences are altered 
for the long term (e.g., exposure to school books depicting families with 
only two children as described by Caldwell 1982). 

An interesting question is raised in regard to the interplay between 
personality mechanisms developed in childhood and adult contemporary 
context. The research literature on attitudes and behavior recognizes that 
a key reason attitudes and preferences do not necessarily translate into 
consistent behavior is real constraints associated with circumstances at 
the time of behavior (Ajzen 1988). For example, in our setting, a woman 
who lived nearby a school in her childhood may place a high value on 
sending her own children to school, but if she does not have access to 
nearby schools in adulthood, when she has children, she may not act 
consistently with those values. Thus, it may be reasonable to expect that 
the effect of current adult context is more important than childhood con- 
text. On the other hand, the attitudes and preferences formed in childhood 
may remain important factors in adult decision making. Work by Elder 
(1974) suggests the long-term influences of childhood experience even in 
the midst of very different adult contexts. Thus, we have reason to expect 
there may be independent effects of childhood and adult contexts on adult 
decision making. 


APPLYING THE FAMILY ORGANIZATION FRAMEWORK 


Application of the family organization framework requires knowledge of 
the starting state of family versus nonfamily social organization and at- 
titudes toward family formation (Thornton and Fricke 1987; Thornton 
and Lin 1994). From these initial characteristics, the context-specific con- 
sequences of social change can be predicted. This characteristic of the 
framework is advantageous compared to existing frameworks for the 
study of fertility decline. For example, most demographic transition the- 
ories imply that parents will always desire fewer children when social 
activities become organized outside the family (Caldwell 1982; Coleman. 
1990; Notestein 1953). This unilinear outcome, however, is not necessarily 
a universal consequence of changes in family organization. The conse- 
quence of reorganizing social activities outside the family is likely to de- 
pend on both the social context and the specific social activity being 
reorganized. For example, in low fertility settings with many activities 
organized outside the family, such as the United States, the reorganization 
of childcare outside the family is likely to increase fertility (Presser 1986; 
Rindfuss 1991). Before predicting the consequences of specific social 
changes using the family organization framework, we briefly describe the 
setting. 
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The setting for this study is the Western Chitwan Valley located in south- 
central Nepal. Because poverty and topographical barriers have delayed 
the spread of new nonfamily organizations and institutions in Nepal (Blai- 
kie, Cameron, and Seddon 1982), the vast majority of social activities 
were organized within the family up until the very recent past (Fricke 
1986). Chitwan is a wide flat valley nestled in the Himalayan foothills at 
approximately 450 feet above sea level, 100 miles southwest of Kath- 
mandu, the country’s capital. Until the early 1950s, Chitwan was covered 
by virgin forests, infested with malaria-carrying mosquitos, and inhabited 
by dangerous fauna ranging from poisonous snakes to Bengal tigers. Be- 
ginning in the mid-1950s, with assistance from the U.S. government, the 
Nepalese government began a program to clear the forests, eradicate 
malaria, and distribute land to settlers from the higher Himalayas. Our 
study examines social change in a 93-square-mile area of Western Chitwan 
that was cleared and settled. 

New roadways through the valley were major sources of social change. 
Although rich soils, flat terrain, and new opportunities drew many farmers 
into the area, the valley remained a remote, isolated frontier until the late 
1970s. The first all-weather road into Chitwan was completed in 1979. 
This road linked Chitwan’s largest town, Narayanghat, to cities in eastern 
Nepal and India. Two other important roads followed: one west, linking 
that town to the western portion of Nepal, and another north, linking 
Chitwan to Kathmandu. Because of Narayanghat’s central location, by 
the mid-1980s, this once isolated town became the transportation hub of 
the country. This change produced a rapid proliferation of government 
services, businesses, and wage labor jobs in Narayanghat that spread 
throughout Chitwan (Pokharel and Shivakoti 1986). The population of 
this valley continued to grow, with both in-migration and natural increase 
significantly contributing to this growth (Central Bureau of Statistics 1987; 
Tuladhar 1989). 

Within the lifetimes of its residents, the social organization of Chitwan 
has changed greatly. As of the late 1950s, there were virtually no em- 
ployment opportunities, marketplaces, schools, health posts, or transpor- 
tation services. Now bus service through the valley provides access to the 
wage labor opportunities and commerce of Narayanghat. Commercial 
enterprises, such as grain mills and new retail outlets, have scattered 
throughout Chitwan. A wide range of government services, including 
schools, agricultural cooperatives, police posts, and health posts have also 
spread. For the hundreds of small farming communities in the Western 
Chitwan Valley, these changes significantly transformed the local context. 

Until the very recent past, fertility in Nepal has remained moderately 
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high. As of the early 1980s women still averaged more than six children 
(Central Bureau of Statistics 1991). Likewise, through the early 1980s 
fewer than 10% of women used any form of contraception (Tuladhar 
1989). The entirely rural population of Chitwan also experienced high 
fertility and low rates of contraception through the beginning of the 1980s 
(Banister and Thapa 1981; Central Bureau of Statistics 1987; Tuladhar 
1989). There is little direct historical evidence regarding family size pref- 
erences for the Nepalese population or the population that now inhabits 
Chitwan, but what little evidence there is suggests that the groups who 
inhabit Chitwan now generally preferred large families in the past and 
did not desire to limit their family sizes (Fricke 1986; Acharya and Bennett 
1981; Gurung 1980; Hitchcock 1966; McFarlane 1976; Messerschmidt 
1976). 


Empirical Predictions 


In this setting, we expect community-level changes in access to nonfamily 
organizations and institutions, such as markets, schools, health care, and 
transportation to result in the reorganization of productive, consumptive, 
residential, recreational, protective, and socialization activities outside the 
family. As these activities of daily living are reorganized outside the family, 
we expect the costs and benefits of childrearing, the diffusion of new 
ideas, and long-term effects on personality development will each change 
to promote fertility-limiting behavior. 

Nonfamily work.—The availability of nonfamily work begins with the 
spread of employment opportunities at the community level. If a com- 
munity contains nonfamily employers, then residents are more likely to 
secure nonfamily employment. We hypothesize that when nonfamily work 
is available and individuals participate in this work, fertility limitation 
will increase. In rural Nepal, children contribute to family production 
when they can be employed within the family, but additional children are 
less help to families who work in the cash economy and sell their labor. 
Also, nonfamily employment exposes workers to new ideas that may le- 
gitimize fertility limitation or create new goals, such as higher levels of 
consumption, which compete with the pursuit of larger families (Cleland 
and Wilson 1987; Freedman 1979). Furthermore, parents’ participation 
in nonfamily work reduces family integration in childhood, which may 
have long-term implications for the formation of children’s preferences. 

Nonfamily consumption.—Similar to production, the spread of markets 
at the community level increases nonfamily consumption opportunities, 
redistributing consumption outside the family. With greater accessibility 
of marketplaces, the regular routine of buying goods outside the home 
becomes more a part of social life, as do the associated social practices: 
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impersonal transactions with strangers, utility comparisons, and calcu- 
lated decision making within a monetized environment.’ 

Consumption outside the family is likely to affect fertility decisions. 
Most important, individuals may directly compare the costs of childrear- 
ing to the prices of goods and services newly available through the market. 
Tf individuals want to acquire these new goods and services, the relative 
costs of childbearing are likely to rise and thus promote fertility-limiting 
behaviors (Easterlin 1987; Freedman 1979). Note that the desire to acquire 
these goods and services may transcend the actual ability to acquire the 
goods and services themselves (Freedman 1979). When community change 
makes marketplaces more accessible, the redistribution of consumption 
outside the family is likely to increase. 

Nonfamily schooling —The spread of nonfamily schools to communities 
removes many socialization activities from the family. When children 
attend schools, it increases the costs of childrearing because it creates a 
new demand for money to buy uniforms, pens and paper, and to pay 
tuition and fees (Axinn 1993; Caldwell 1982). Nonfamily education also 
reduces children’s economic contributions to the family, thereby reducing 
the benefits of childbearing (Caldwell 1982; Coleman 1990). Along with 
this direct effect on the costs and benefits of childbearing, education also 
is likely to increase children’s independence from their parents, making 
it more difficult for parents to maintain their authority over children 
(Caldwell, Reddy, and Caldwell 1988). This indirect mechanism is also 
likely to increase the costs and decrease the benefits of childbearing (Cald- 
well 1982; Coleman 1990). 

Schools in Nepal may also be sources of ideational change, as they are 
modeled on the British educational system and can provide new infor- 
mation to parents about the availability of contraceptive methods or al- 
ternatives to childbearing (Axinn and Barber, in press; Cochrane 1979; 
Kasarda et al. 1986). Because of the links between South Asian schools 
and the British educational system, educational materials in the region 
often include books describing love marriage, nuclear families, and small 
family sizes (Caldwell 1982; Caldwell et al. 1988). The same materials 
also describe new consumption alternatives, which may raise consumption 
aspirations. As these ideas spread, they raise the likelihood that those who 
have been to school outside the family will limit their fertility. In fact, 


? Of course, calculated decision making is not new to the residents of Chitwan. Chitwan 
is largely an agricultural subsistence economy. These rural Nepalese farmers must be 
highly shrewd simply to survive—what crops to plant, how much of each, when to 
harvest. What is new, however, 1s that the residents of Chitwan have the opportunity 
to use these decision-making skills in an ever-expanding market, where the medium 
of exchange, legal currency, allows decision making to expand over longer time spans 
than before. 
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Caldwell (1982) argues that schooling can affect ideas so strongly that 
individuals who have not been to school may change the way they think 
about childhood just because many of their neighbors send their children 
to school. 

Nonfamily transportation ——Even if nonfamily employers, consump- 
tion, or schools are not geographically nearby, improvements in com- 
munity access to transportation may stimulate the reorganization of family 
life. Employers affect the community if a transportation infrastructure 
links the community to the employers: changes in transportation affect 
labor market accessibility. For example, in Chitwan, what might have 
been a three-hour walk to Narayanghat may become a short bus ride, 
and commuting to nonfamily wage labor can now be part of the daily 
routine for residents of formerly remote communities. Similar to produc- 
tion, nonfamily consumption and schooling opportunities may become 
more accessible through improved transportation. 

Nonfamily residence.—Residential experiences outside the family are 
likely to discourage historically held family values, such as unlimited 
fertility in Nepal, and promote fertility limitation. When individuals reside 
with their parents, the parents are likely to encourage them to have values 
characteristic of the population in the past (Axinn and Barber 1997; Waite, 
Goldscheider, and Witsberger 1986). When individuals have nonfamily 
living arrangements, such as in group quarters with peers or alone, how- 
ever, they are more likely to encounter new family formation values and 
consumption preferences (Axinn and Barber 1997; Chamratrithirong, 
Morgan, and Rindfuss 1988; Waite et al. 1986). Nonfamily living also 
reduces the positive externalities to childrearing, increasing child care 
costs and reducing the demand for children (Coleman 1990), Community 
access to nonfamily work opportunities, markets, schools, and transpor- 
tation are all likely to increase access to nonfamily living. 

Nonfamily leisure.—Social changes at the community level are also 
likely to increase leisure activities outside the home and away from the 
family. Examples of these changes are new cinema halls, local markets 
that sell televisions and radios, and new bars and tea stalls introduced in 
marketplaces. Thus, proximity to cinema halls, marketplaces, or bus serv- 
ices each promote leisure activity outside the family. 

Leisure activities located outside the family allow new ideas to spread, 
which may influence fertility decisions and the costs of childrearing. For 
example, nonfamily leisure activities such as cinema or radio may inform 
parents about contraceptive methods, thereby lowering fertility regulation 
costs (Hermalin 1983). New activities also inform individuals of con- 
sumption alternatives that raise consumption aspirations and decrease the 
demand for children (Freedman 1979; Easterlin 1987). Because much of 
the mass media content in Nepal originates in high income, low fertility 


1230 


Fertility Limitation 


settings, this content directly spreads ideas and images that promote 
smaller families (Caldwell 1982). Furthermore, when parents participate 
in leisure activities at these places, they are outside the home and generally 
away from the family. Because it is difficult for parents to combine child 
care with these new nonfamily leisure activities, they raise the relative 
costs of childrearing, which may increase fertility limitation (Coleman 
1990). 

Nonfamily health care.—As government services have spread through 
Chitwan, many different health care activities have become organized 
outside the family. Chitwan now has a variety of medical facilities that 
did not exist less than a generation ago: a hospital, many local health 
posts and clinics, plus numerous pharmacies. New nonfamily health care 
services may motivate individuals to limit their fertility for two reasons: 
these services may reduce infant and child mortality, thereby increasing 
the supply of children (Caldwell 1986; Mason 1997), or these services may 
provide access to contraceptive methods, thereby reducing the costs of 
fertility limitation (Bulatao and Lee 1983; Easterlin and Crimmins 1985). 
Much evidence demonstrates that when contraceptive methods are avail- 
able, individuals are more likely to limit their fertility, especially among 
those who are already motivated to control their childbearing (Freedman 
1997; Knodel 1987). 


RESEARCH DESIGN, MEASUREMENT, AND ANALYTIC APPROACH 


Within the Western Chitwan Valley in south-central Nepal, we examine 
social change and fertility in 171 neighborhoods using measures from the 
Chitwan Valley Family Study (CVFS). The CVFS defined a neighborhood 
as a geographic cluster of 5-15 households. Given the rural setting, these 
neighborhoods contain a group of people who know each other and in- 
teract face to face every day. This extremely local definition of context is 
advantageous because high levels of family organization of social activities 
generally correspond to extremely localized patterns of social interaction 
and social organization (Ogburn 1933; Thornton and Fricke 1987). We 
selected neighborhoods through an equal probability, systematic sample 
of neighborhoods in Western Chitwan. The sampling strategy was de- 
signed to eliminate national and regional sources of variation by focusing 
on a single small area, Chitwan Valley. Yet the strategy also maximizes 
neighborhood-level variations by sampling neighborhoods from a setting 
with much local variation (as advocated by Smith [1989]. The CVFS 
measured neighborhood context with the neighborhood history calendar 
method. This technique combines archival, ethnographic, and structured 
interview methods to gather detailed, continuous measures of neighbor- 
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hood change over time (see Axinn, Barber, and Ghimire 1997). The CVFS 
interviewed every resident between the ages of 15 and 59, and their 
spouses, in those 171 neighborhoods,’ with an overall response rate of 
97%. This process generated 5,271 completed interviews. 

In the analysis, we use information from the 1,395 women included in 
the CVFS who were between ages 25 and 54 at the time of interview, 
were married, and had at least one child. We restrict our sample to older, 
ever-married women with at least one child because in Nepal never- 
married women, childless women, and extremely young women are very 
unlikely to use contraceptive methods to stop childbearing (Acharya and 
Bennet 1981; Axinn 1992; Tuladhar 1989). We also use information about 
these women's husbands from the spouse interviews. 


Measures of Community Context 


Because the contextual data in the CVFS are particularly rich, we can 
construct many types of contextual measures. After carefully considering 
the alternatives, we present two types of contextual measures of individ- 
uals’ experiences: measures of childhood community context (ages 0-12) 
and measures of current community context after first childbirth. The 
measures of childhood community context were collected in the individual 
interviews, pertain only to the individual interviewed, and are time- 
invariant measures. The current measures of the individuals? communi- 
ties, however, come from the neighborhood history calendars and are time- 
varying measures. The contemporary measures are true contextual 
measures: a single measure is assigned to all individuals in a specific 
neighborhood at a specific point in time. These two sets of measures allow 
us to demonstrate the enduring effects of community context in childhood 
along with the independent effects of contemporary context. 

For measures of childhood community context, we used characteristics 
of the communities in which the respondents lived through age 12. Re- 
spondents were asked, “Was there a school within a one-hour walk from 
your home at any time before you were 12 years old?” If they answered 
yes to this question, then our measure of whether they had a school in 
their neighborhood before age 12 is coded “1,” and if they answered no, 
the measure is coded “0.” Similar questions were asked about health posts, 


? The main sample was defined by eating and sleeping in the selected neighborhood 
for three months or more of the past six months. The CVFS also interviewed the 
spouses of all married persons who met this criteria but had spouses who did not meet 
this criteria Thus the data include interviews with spouses who were temporarily 
away, over age 59, or under age 15 This procedure provides full measurement of 
spousal characteristics for all those married persons in the main sample. 
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employers, markets, and bus stops.* In addition to dichotomous measures 
of each of these services within an hour walk, we also created an index 
of the number of nonfamily organizations. This index is the sum of the 
dichotomous measures—a count of services within a one-hour walk. Pre- 
vious research on individual-level nonfamily experiences and fertility be- 
havior has used similar indices (Axinn 1992; Thornton et al. 1986). 
Descriptive statistics for these and other individual interview-based 
measures are presented in table 1. Note that 45% of respondents lived 
within a one-hour walk of an employer some time through age 12, 72% 
within one hour of a market, 38% within one hour of a bus stop, 79% 
within one hour of a school, and 45% within one hour of a health service. 
Current community context is constructed from the neighborhood his- 
tories. Because employment opportunities, marketplaces, schools, health 
posts, and bus services have become much more common in recent years, 
many residents of the valley now live within a one-hour walk of these 
nonfamily organizations and services. To identify those respondents who 
live particularly close to these services, we use a five-minute walk as a 
distance threshold (varying these distance thresholds produces no sub- 
stantive changes in the interpretation of our results; lower thresholds, 
however, produce slightly stronger effects). If a woman lives in a neigh- 
borhood where the nearest wage labor employer is five minutes away or 
less, the employment opportunity measure is coded “1.” It is coded “0” 
otherwise. The other measures of community context are coded similarly. 
As with the measures of childhood community context, we created an 
index of the total number of nonfamily services within a five-minute 
walk of the neighborhood. In our models, the measures also are lagged 
one year, so that a woman’s experiences living near an employer in the 
previous year affect her likelihood of contraceptive use in the current 
year. This insures that the temporal order among measures in our model 
is consistent with the temporal order imbedded in our hypotheses. See 


* Although these questions represent a lengthy period of recall for some respondents, 
in this setting the introduction of new nonfamily services was a dramatic and mem- 
orable event. Our pretesting of these retrospective questions indicated respondents 
faced little difficulty reporting these matters with the low level of precision required 
by our questions. 

* An employment opportunity is the nearest employer who employs 10 or more indi- 
viduals for pay. Bus service 1s the nearest location where a resident could board a 
public motorized vehicle and ride for a fee (including tractors and jeeps) A market 
place is the nearest location of two or more contiguous shops where goods and services 
are sold for money, including tea shops A school is the nearest location of nonfamily 
instruction and socialization aimed at children and youth This includes religious 
schools and those schools which do not yet have a physical building. A health service 
outlet is any facility that provides medical services or supplies, including hospitals, 
health posts, family planning clinics, and pharmacies. 
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INDIVIDUAL-LEVEL DESCRIPTIVE STATISTICS 











Childhood community context* 
Employment EA 


Any school .... aas 


Timing controls: 


Born 1962-71 (age 25-34 at survey) .. . . 


Born 1952-61 (age 35-44 at survey) . . 


Born 1942-51 (age 45-54 at survey) 
Family background. 


Father's education (ever went to school) ...... 
Father’s employment (ever had paid employment) « 


Mother’s children ever born . 


Parental contraceptive use (Garents ever ise) x 


Ethnicity. 
High-caste Hindu ens 


Migration 





Mean SD 


45 
72 


2.13 


mee eee 


UL ha A e ell ol 
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TABLE 2 
COMMUNITY-LEVEL DESCRIPTIVE STATISTICS 











Mean / SD Min Max 

Contemporary community context:* 

Employment opportunity . . 28 0 1 

Market . 2.0.0... 000 aaa 42 0 1 

Bus stop .. 50 0 1 

School seess ooa cael bs 49 0 1 

Health service .. ......... ooo... 28 0 1 

N of services aa occ 197 1.60 0 5 

Nott. — context vanes by year m the analyses The statistics presented here are for the 
171 neeghbarhoods in 1995 

* Nonfamily services withm five-minute walk. 

* Sum of above five 


table 2 for the univariate distributions of these measures; the distributions 
of these time-varying covariates pertain to the most recent year. 


THE TEMPORAL SEQUENCE OF COMMUNITY CHANGE 


Before turning to our analysis of fertility limitation, we briefly describe 
the pace and order in which nonfamily organization and services spread 
through Chitwan Valley. Figure 1 presents the history of change in the 
community infrastructure of Chitwan Valley: historical time increases 
along the x-axis, and travel time in minutes increases on the y-axis. The 
lines represent, across 43 years, the average (mean for all 171 neighbor- 
hoods) time it takes to reach the nearest employment opportunity, mar- 
ketplace, school, health service outlet, or bus stop. The declining slope of 
these lines indicates that the average time to walk to each of these services 
has declined dramatically over the recent history of Chitwan Valley. The 
difference among the lines indicates which of these changes spread 
through the valley first, second, and third. 

It is clear that schools spread through the valley before other services. 
The average walking time to the nearest school declined earlier and re- 
mained lower than the average walking time to other services. In fact, 
these neighborhood history data are flexible enough to recalculate mea- 
sures of the temporal order of change in many different metrics, including 
the number of services available in any year, the number of years of 
exposure to each service within a particular distance, or the most common 
temporal sequence of the appearance of these services within a particular 
distance (Axinn et al. 1997). Each method demonstrates the same basic 
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Fic. 1.—Change over time ın mean minutes by foot to the nearest nonfamily service or 
organization for 171 neighborhoods. 


information as figure 1: schools spread through Chitwan Valley before a 
variety of other contextual changes.° 


MEASURES OF INDIVIDUAL EXPERIENCES 


The women in the CVFS were asked about a wide variety of behaviors, 
including their labor force experiences, education, use of health services, 
leisure time pursuits, marital and childbearing experiences, and use of 
contraceptive methods to limit childbearing. All interviews were con- 
ducted in Nepalese, the most common of the languages in Nepal, and : 
thus the question wordings presented below are translations. A modified 
version of the life history calendar technique was used to improve the 


* A common difficulty 1n examining these types of community services is that changes 
in these services tend to occur together (Casterline 1985) We also calculated the bi- 
variate correlations among all of our measures of community context. All of these 
correlations are positive, but none is higher than 0.51, and the largest correlation is 
for minutes to the nearest bus stop and health service (not shown in tables) Most of 
the other correlations are substantially less. Thus it should be possible to estimate the 
independent effects of each of these dimensions of community context on contraceptive i 
use to limit childbearing. ; 
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accuracy of recall of the timing of key events and experiences (Axinn, 
Pearce, and Ghimire 1999). 

Fertility limitation —As discussed in our opening remarks, the main 
objective of this analysis is to understand the impact of social change on 
the transition from no use of birth control to the widespread use of birth 
control to limit family size. Although a variety of contraceptive methods 
have been available in Nepal for the past 20 years, Nepal is similar to 
other South Asian countries in that the population strongly prefers per- 
manent contraceptive methods. Furthermore, even methods that could be 
used to space childbearing, such as Norplant, Depo-Provera, or the IUD, 
are generally used to stop childbearing in Nepal (Axinn 1992). In fact, 
our data show that among married women age 25-54 with at least one 
child, 98.77% who had ever used any of these contraceptive methods said 
that they wanted no more children. We also find that women in the 25-34 
age group (women who are still in their prime childbearing years) averaged 
only 0.08 more children per woman after using these methods. Although 
not all of these methods are strictly permanent (a woman could start and 
stop using Norplant, Depo-Provera, or the IUD), use of these methods 
clearly marks an intention to end childbearing in this setting. Given this 
evidence, we treat the initiation of any use of sterilization, Norplant, IUDs, 
or Depo-Provera as the initial effort to stop childbearing, and we focus 
on the timing of this transition. We code a dichotomous variable “1” the 
first time a respondent uses any of these methods. As we describe below, 
this variable is used to estimate the hazard of contraceptive use to ter- 
minate childbearing.’ 

There has been dramatic change over time in the proportion of women 
who had ever used one of these contraceptive methods in this setting. 
Among the cohort born in 1942-51 (ages 45-54 in 1996), less than 5% 
had used contraception to terminate their childbearing by age 25. Yet 
among the cohort born in 1962-71 (ages 25-34), more than 35% had used 
contraception to terminate their childbearing by age 25. Of course, many 


’To explore the sensitivity of our results to changes in the definition of this measure 
of fertility limitation, we also estimated models of alternative definitions of fertility 
limitation. One series of models defined contraception to limit fertility as the use of 
sterilization only, and the other series defined contraception to limit fertility as the use 
of any contraceptive method (including condoms, foam, pills, abstinence aimed at 
avoiding births, and other methods volunteered by respondents in addition to those 
included in our initial definition). The estimated effects of key measures of community 
change were virtually the same in those series of models as they are in the models 
described in the text, with the estimates in the text falling in between the other two 
series of models (models focusing on sterilization only revealed the strongest effects of 
community context) Thus, the results presented in tables provide a good summary of 
the results from all analyses, and differences in the definition of fertility limitation do 
not alter the substantive conclusions we reach in the text. 
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in the 1942-51 birth cohort eventually went on to use birth control (49% 
by age 45), but the large differences accross cohorts demonstrates a tre- 
mendous change in the pace of adopting contraception. This dramatic 
change in fertility-limiting behavior in Chitwan coincides with the equally 
dramatic increase in nonfamily organization in the valley. In the multi- 
variate multilevel models estimated below, we explore the direct connec- 
tions between these two social changes. 

Measures of nonfamily experiences ——We use measures from the life 
history calendars to capture experiences with nonfamily work (wage em- 
ployment, salaried employment, or owning a business outside the home), 
nonfamily school (standard education or adult education), or living away 
from home (alone, in dormitories, with unrelated individuals, and other 
nonfamily possibilities). Measures from the individual questionnaire also 
capture experiences with nonfamily health services or seeing movies in a 
cinema hall.’ 

Because we separate the impact of childhood community characteristics 
from the impact of contemporary community characteristics, we construct 
two different sets of measures of individual-level nonfamily experiences. 
The first set measures nonfamily experiences that occur between the re- 
spondents’ own birth and the birth of her first child. These measures do 
not vary with time; they pertain to the entire period of life up to the point 
where the woman in question became exposed to the risk of using per- 
manent contraception (up to the point where we start the hazard model, 
see below). Each dichotomous variable is coded “1” if the woman has had 
the experience before the time of her first childbirth, and “0” if she has 
not. In addition, we also construct an index measure summing the total 
number of different nonfamily experiences each woman has had before 
the time of her first birth (see table 1 for descriptive statistics), Figure 2 
shows the relationship between events in a respondent’s life and the tem- 
poral ordering of our measurement. 

The second set of measures captures women’s nonfamily experiences 
between the time of their first childbirth and the present. These measures 
are time varying, as the date of “the present” changes for every year 


* A small fraction of respondents who had seen movies (2%) or gone to health posts 
(4.8%) did not know the age at which they first experienced these events. Also, a small 
fraction of respondents did not know if their fathers had gone to school (1.9%) or 
worked outside the home (0.3%), if their parents used contraceptives (0.5%), or how 
many children their mothers had (less than 0.1%). In these cases, we imputed the 
values with a simple linear regression model of the observed cases with age, education, 
ethnicity, and gender as predictors. Although multiple imputation procedures can pro- 
vide a more informed treatment of missing data, when missing cases are few—as they 
are with our data—this simple single imputation method is generally reasonable | 
(Schafer 1997) 
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Fic. 2.—Relationship between respondent's life history and temporal ordering of 
measurement 


included in our dynamic event history models. Because we control for 
previous experiences, these measures capture the additional impact of 
contemporary experiences beyond the impact of early life experiences. 
Each dichotomous variable is coded “1” if the woman has had the ex- 
perience after her first childbirth but before the period being examined, 
and “0” if she has not. We construct one time-varying dichotomous mea- 
sure for each nonfamily experience: work, residence, school, health care, 
and seeing a movie. We also constructed an index measure summarizing 
the total number of nonfamily experiences each woman had during the 
interval (see table 1 for descriptive statistics). 

Comtrols.—To properly specify our models, we control for character- 
istics of the respondents that may influence both the independent variables 
and the likelihood of permanent contraceptive use. We use this strategy 
to help insure that the observed associations between the independent 
variables and permanent contraceptive use are not spurious. 

We control for the respondent’s birth cohort because the use of con- 
traceptive methods has increased over time, as has the prevalence of 
nonfamily organizations. We use three birth cohorts: 1962—71 (ages 25-34 
at the survey), 1952-61 (ages 35-44 at the survey), and 1942-51 (ages 
45-54 at the survey). For each respondent, one of these variables is coded 
“1,” and the other two are coded “0.” The youngest birth cohort is the 
reference group in the analysis, and thus the effects of the other two 
Measures can be interpreted as the effect of being born in an older birth 
cohort compared to being born in the 1962-71 cohort on the hazard of 
permanent contraceptive use.’ Finally, we control for years since the first 


? To test for the sensitivity of our results to changes in the age groups included in our 
analyses, we also estimated models that included women in the cohort aged 15-24. 
Childbearing in this age group is rare relative to other age groups, so few of these 
women were exposed to the risk of permanent contraceptive use. However, including 
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birth (a countervariable) and years since the first birth squared (the square 
of the countervariable). Because we start the period of risk at the time 
of first birth, these start at zero. These measures control the baseline 
hazard of contraceptive use, which increases with time after the first birth 
and then decreases at much older ages. 

In our models of contemporary community characteristics, we also con- 
trol for multiple measures of women’s childbearing experiences. These 
measures are all time varying: the total number of children ever born 
(live births), total number of sons, and the number of live births who 
died. We also control for whether the respondent lived with her husband 
in the prior year. Because their risk of pregnancy is low, women who 
have not been living with their husbands are unlikely to use a permanent 
contraceptive method. 

We also control for ethnicity. Ethnicity in Nepal is complex, multifac- 
eted, and interrelated with religion. A full description of the ethnic groups 
residing in this setting is beyond the scope of this article (for detailed 
descriptions of these groups, see Acharya and Bennet 1981; Bista 1972; 
Fricke 1986; Gurung 1980), but we control for five classifications of eth- 
nicity. This is important because each group has different propensities to 
use permanent contraceptives and different experiences with nonfamily 
opportunities.” 

The first two ethnic groups are high-caste Hindus and low-caste Hin- 
dus. High-caste Hindus are an elite group in Nepalese society who have 
historically had the most power and access to the most opportunity, in- 
cluding nonfamily opportunities (Acharya and Bennet 1981). Low-caste 
Hindus have enjoyed fewer opportunities but identify themselves as 
Hindu. Both groups’ ancestors originate from India. The third group, 
Newars, are a Tibetan-origin group who practice a mixture of Buddhism 
and Hinduism (Acharya and Bennet 1981). They are also an elite group 
in Nepal, and average education among Newars rivals that of high-caste 
Hindus. The fourth group, hill Tibeto-Burmese, are also of Tibetan origin, 
but they tend to practice Buddhism. This group includes people such as 
the Tamang, the Gurung, and the Magar (Fricke 1986). The last group, 
the terai Tibeto-Burmese, are the original inhabitants of the Chitwan 
Valley. They include the Tharu, the Derai, and the Kumal. They are 
indigenous jungle dwellers who adopted sedentary agriculture in the 1950s 
when the valley was cleared and converted to farmland. These people 


those who were exposed to the risk in our analysis had little influence on our estimates 
of the impact of community change 

1 We also explored the possibility of interactons between ethnic identity and the impact 
of social changes. However, this investigation failed to uncover any statistically sig- 
nificant or consistent interactions between our measures of community context (in both 
childhood and adulthood) and our measures of ethnic identity. 
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have been much less able to take advantage of the social changes occurring 
around them than other groups. They are, on average, much less educated, 
have higher rates of infant mortality, and own less land than the other 
ethnic groups. 

We control for migration with several approaches. Although most multi- 
level modeling does little to incorporate migration, it is an important threat 
to the validity of conclusions: migrants may not be exposed to the same 
community (or contextual) characteristics as nonmigrants. To address this 
issue, we estimate three different types of models. The first ignores mi- 
gration and estimates models with no attention to this issue as in most 
previous research. We discuss but do not present these models. The second 
approach to migration adds two controls: (1) a time-invariant measure of 
any migration experience between birth and the birth of the respondent’s 
first child and (2) a time-varying measure of whether the respondent is 
outside the neighborhood during the contemporary period being analyzed. 
We display these results in the tables described below. The third approach 
to migration keeps the control for migration experience before the birth 
of the first child, but it omits from the analysis all person-years in which 
the respondent resided outside the neighborhood being studied. As an 
extension of this approach, we also estimate models eliminating all persons 
who lived outside the neighborhood. We discuss these results but do not 
present them." 

Finally, we control for family background with several measures: fa- 
ther’s education (ever went to school), father’s employment (ever had 
nonfamily employment before respondent’s age 12), mother’s children 
ever born, and parents ever used a contraceptive. These measures may 
affect both the individual women’s use of contraceptives to stop child- 
bearing and parental choice of community residence. Parents with edu- 
cation or work experience may move to neighborhoods with greater access 
to schools, health posts, and other institutions. Thus, controls for these 
dimensions of family background help to reduce the chances that observed 
effects of childhood context are spurious. 


Analytic Strategy 

A life course perspective organizes our models. We first model the impact 
of characteristics that are established at birth and then treat these char- 
acteristics as controls in all subsequent analyses (not shown in tables). 
Then we add childhood community characteristics to the models. Next 


1 Note that our individual interview-based measures of childhood community context 


are not threatened by migration because all respondents, migrants and nonmigrants 
alike, provide individual reports of the characteristics of their childhood community. 
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we add early adult experiences with nonfamily social activities. Then we 
add current community characteristics to the model to estimate the unique 
contribution of these circumstances. Finally, we add measures of current 
nonfamily experiences. 

We use event history methods to model the risk of adopting a permanent 
contraceptive method. Because our data are precise to the year, we use 
discrete-time methods to estimate these models. Person-years of exposure 
are the unit of analysis. We consider women at risk of using a permanent 
contraceptive method after they are married and have at least one child. 

To estimate these discrete-time models, we use logistic regression in the 
form: 


(23) = a+ rB, 
where p is the yearly probability of using a permanent contraceptive 
method, p/(1 — p) is the odds of the contraceptive use occurring, a is a 
constant term, B, represents the effects parameters of the explanatory 
variables, and X, represents the explanatory variables in the model. This 
approach to discrete time estimation of the hazard model is described in 
detail elsewhere (Peterson 1986, 1991; Allison 1982, 1984). 

Finally, because our models of contemporary community characteristics 
are hierarchically clustered, with several individuals living in the same 
neighborhood, we also estimated these models with a technique that takes 
this structure of the data into account (MLwiN). Recent research dem- 
onstrates it is possible to estimate these multilevel, discrete-time hazard 
models with time-varying covariates at both the individual and contextual 
level using software such as MLwiN (Barber et al. 2000). The results we 
present in the tables have been calculated using MLwiN and thus are 
properly specified for the multilevel nature of the data. 


RESULTS 
The Impact of Childhood Community Context 


In table 3, we present estimates of the effects of early childhood com- 
munity context on the rate of permanent contraceptive use. We trans- 
formed the raw logistic coefficients by exponentiating them; the coeffi- 
cients we display are multiplicative effects on the odds of using 
contraceptives in any one-year interval. A coefficient of 1.00 represents 
no effect, a coefficient greater than 1.00 represents a positive effect on the 
odds, and a coefficient less than 1.00 represents a negative effect on the 
odds. Because the frequency of events in any one time interval is quite 
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small, the odds of contraception are very similar to the rate of contra- 
ception, and we discuss our results in terms of rates. 

Each context measure indicates whether the respondent had that facility 
within an hour’s walk at any time through age 12. All of these early life 
measures of nonfamily organizations and services have a positive effect 
on the rate of fertility limitation: employment opportunities, markets, 
buses, schools, and health posts. For example, the odds ratio of 1.18 for 
employers means that women who had a wage labor employer within an 
hour walk of their homes during childhood used permanent contraception 
to terminate their childbearing at rates 18% higher than women without 
nearby employers. 

In the last column, we present our composite measure, which is a simple 
sum of the previous five dichotomous measures, and as expected, it too 
has a positive effect. In fact, because the coefficient is multiplicative, the 
magnitude of this effect is quite large. For example, women with all five 
nonfamily services within a one-hour walk had rates of contraception 
61% higher (1.10 raised to the power of 5) than women without any of 
these services nearby.” We investigated various functional forms for the 
effect of this composite measure. However, the impact of additional nearby 
nonfamily services increases monotonically, and the linear form appears 
to be the most parsimonious summary of these effects. 

The effects of the controls are as expected. Net of the other variables, 
older cohorts used contraception at lower rates than younger cohorts. 
In the first column, the odds ratio of 0.20 for the oldest cohort (born 1942— 
51, currently age 45-54) means that the youngest cohort (born 1962-71, 
currently age 25-34) had fertility limitation rates five times higher (1/.20 
= 5.00)—a large difference that has occurred over just a generation. The 
only family background control that was significant was father’s em- 
ployment, which had a positive significant effect on the rate of contra- 
ception. Other family background factors were not statistically significant. 
Etbnicity did affect rates of contraceptive use. For example, low-caste 
Hindus, Hill Tibeto-Burmese, and Terai Tibeto-Burmese had rates of 
contraception use lower than high-caste Hindus. We also tested for in- 
teractions between ethnic identity and our measures of social changes (not 
shown in tables). These tests revealed no evidence to suggest that these 
ethnic groups differentially respond to social change or community context 


2 In order to ensure these effects were not produced by access to health services, which 
in our study include all sources of contraceptive methods, we also estimated a version 
of this final model that included a summary measure of access to all four nonfamily 
services other than health services and a separate control for access to health services 
In this version of the model, the index measures of number of nonfamily services 
within an hour’s walk continues to be statistically significant and of comparable 
magnitude. 
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in terms of their fertility limiting behaviors (net of family background 
controls). 

Finally, at the bottom of table 3, note our estimate of the variance of 
the intercept across neighborhoods. This random effect suggests that there 
is variation between neighborhoods in the intercept that is not explained 
by the characteristics included in our model. The size of the variance 
component, however, is small relative to the magnitude of the intercept 
term: in our models, the untransformed intercept varies from In(.020) to 
In(.026), or —3.91 to —3.65, which is large relative to the estimate of the 
variance component (from 0.32 to 0.34 in our models). 

As the results show, childhood community context significantly affected 
a process that occurred many years later: the rate of contraceptive use. 
This suggests that the community context in which individuals are raised 
has an important long-term influence on individual behavior. Hypotheses 
focused on the impact of community context on the costs and benefits of 
childrearing imply strong effects of current adult community character- 
istics. The findings displayed in table 3 are more consistent with hypoth- 
eses centering on the impact of social context in childhood on the long- 
term personality development of individuals (Yabiku et al. 1999). 
Nonfamily organizations and services in childhood alter the social or- 
ganization of daily life so that many activities occur outside the family. 
This type of childhood family experience produces long-term preferences 
for smaller families and stimulates preferences for nonfamily pursuits that 
compete with childbearing. The long-term consequences are higher rates 
of fertility limitation in adulthood. 


Early Life Nonfamily Experiences 


In table 4, we examine the effect of early life nonfamily experiences. These 
dichotomous measures indicate whether the respondent ever participated 
in any of these activities before her first child was born: lived away from 
family, worked outside the home, ever went to school, went to a health 
post, saw a movie. Visiting nonfamily health services and seeing movies 
have positive effects on the rate of fertility limitation: women who saw 
a movie had 29% higher rates of contraception than women who did not, 
and women who went to health posts had 25% higher rates of contra- 
ception than women who did not. 

The strong positive effect of visiting a health post on contraceptive use 
may not surprise readers, as these health posts are the main sources of 
contraception in developing countries. However, recall that these early 
life experiences are measured before women had their first birth and 
therefore are extremely unlikely to be the source of the contraceptive 
methods used much later. Instead, we interpret these effects as the con- 
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sequences of personal use of nonfamily health services early in life. Thus 
the strong positive effects of visits to nonfamily health care facilities are 
just as impressive as the strong effects of nonfamily leisure activities. 

Nonfamily work, living, and school before the first birth, however, do 
not have statistically significant effects on fertility limitation. This may 
be because in our setting these experiences are not common among women, 
especially nonfamily living. Nonfamily living was experienced by only 
3% of the women before their first birth. Another possibility is that hus- 
bands’ nonfamily experiences are more important predictors of marital 
childbearing behavior in this setting. Previous research in Nepal suggests 
that husbands’ nonfamily experiences are much more likely to influence 
fertility limitation than women’s own experiences (Axinn 1992). We ex- 
plore this possibility in greater depth below. Another possibility is that 
we have failed to measure the key individual-level experiences that trans- 
mit the effects of community context. This is particularly likely if the 
connection between childhood community context and birth control in 
adulthood is caused by other experiences (such as travel or consumption) 
or ideational factors (such as the spread of new values about childbearing 
and contraception). Unfortunately, the full array of possible intervening 
mechanisms is beyond the scope of the present analysis. 

Our composite indicator of childhood community context has a signif- 
icant, positive effect even when measures of early nonfamily experiences 
are included in the same model. This suggests childhood community con- 
text effects are largely independent of these early nonfamily experiences.” 


Contemporary Community Context 


Table 5 displays the relationship between contemporary community con- 
text and fertility limitation. These current measures are a series of di- 
chotomous variables indicating whether the respondent lives in a neigh- 
borhood with the nonfamily organization or service located within a five 
minute walk. These contemporary measures are time varying; they change 
as the neighborhood changes. 

An additional control in this model is whether the respondent currently 


» Although not presented here, we also tested each of the childhood community char- 
acteristics measures while controlling for our composite index measure of early non- 
family experiences The results are generally the same. Although some early childhood 
context measures are no longer statistically significant (employment, bus stops, health 
services), they are far from being reduced to zero. Furthermore, the other dimensions 
of community context (markets, schools) continue to have significant effects, even 
controlling for early nonfamily experiences. This implies that the effects of childhood 
community context are not explained by these particular measures of early nonfamily 
experience 
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lives in the neighborhood during the period analyzed and if she is currently 
living with her spouse. We also control for women’s previous childbearing 
experiences, including the number of children ever born, the number of 
sons ever born, and the number of children born who died. 

All the contemporary measures have the expected positive association 
with contraceptive use. When these nonfamily institutions are near re- 
spondents, they are more likely to limit their fertility. For example, the 
1.41 odds ratio for markets means that when a market is within a five- 
minute walk, the rate of contraception to stop childbearing increases by 
41%. In the last column, we present a composite measure that indicates 
the number of institutions within a five-minute walk of the respondent. 
As expected, it is positive—the more nonfamily organizations and services 
that are within five minutes of the respondent, the higher her rate of 
contraceptive use. This association is quite large; the coefficient of 1.12 
means that women who live in a neighborhood with all five services have 
a 76% higher contraception rate than women who have none of the five 
services within a five-minute walk (1.12 raised to the fifth).™ 

The effects of childhood community context and early nonfamily ex- 
periences remain significant in table 5, even with the addition of current 
community characteristics. The composite early nonfamily experiences 
measure is significant in all the models in table 5. This is important 
because it suggests that the effect of these early experiences is not ex- 
plained by the contemporary community context.” 

The result that both childhood community context and contemporary 
community context exert independent influences on women’s childbearing 
behavior is even more fascinating. Characteristics of childhood and adult 
community context are positively correlated, but in this setting of dramatic 
social change, there are also substantial differences. In addition to the 
long-term consequences of childhood context, the opportunities and con- 
straints associated with contemporary context also shape the adult de- 
cision making of these women. Contemporary context, however, does not 
explain the effects of childhood context. Instead these later characteristics 
add to the overall impact of community surroundings on individual be- 
havior. Thus we conclude that both early childhood and contemporary 


“ Once again, in order to ensure these effects were not produced by access to health 
services, which in our study include all sources of contraceptive methods, we also 
estimated a version of this final model that included a summary measure of access to 
all four nonfamily services other than health services and a separate control for access 
to health services In this version of the model, the index measures of number of 
nonfamily services within a five-minute walk continues to be statistically significant 
and of comparable magnitude. 

* The effect of early nonfamily experiences grows slightly as a result of adding controls 
for childbearing experiences (comparison not presented in the tables). 
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adult community context simultaneously affect behavior. This conclusion 
points toward contextual models of behavior that incorporate a much 
more comprehensive view of contextual change over the life course than 
most social science models have thus far considered. 

We also include previous childbearing experiences in these models, and 
they influence fertility limitation as expected. The more children a woman 
has, the more likely she is to limit her fertility: each child born increases 
the rate of contraceptive use by about 51%. For each family size, the 
number of boys in the family also increases fertility limitation. Each boy 
increases the contraception rate by about 64%. Having children who died 
reduces the rate of contraception.'* We also included a measure of whether 
the respondent was living with her husband in each year of exposure. If 
her husband was away for some reason (such as working in a distant 
village or even a different country), then a woman had lower odds of 
contraceptive use. Women had rates of contraception 66%—68% higher 
when living with their spouses. 


Contemporary Nonfamily Experiences 


We also examined how contemporary nonfamily experiences affect fertility 
limitation. Unfortunately, very few women who had not participated these 
activities before their first birth participated in them for the first time 
after their first birth.'” As one might expect, given the small number of 
women who engaged in more nonfamily activity after the birth of their 
first child, the effects of these experiences on rates of contraception were 
both small and statistically insignificant. As such, they do not alter any 
of the other effects in the model (presented in table 5), and we do not 
present those estimates here. 

We also tested models incorporating measures of the women's hus- 
bands’ nonfamily experiences: nonfamily work, living, and school. We 
hypothesized that these activities would decrease the demand for children 
and therefore increase fertility limitation. Two of the activities are in the 
expected direction: husbands’ nonfamily work and school before the birth 
of the first child increase the couple’s rate of fertility limitation. School 
has a strong significant effect, while the effect of nonfamily work is pos- 


4 Including these controls provides a conservative test of the impact of contemporary 
community context, because some of the total effect of context on contraception op- 
erates via these childbearing experiences. However, the differences in our estimates 
when these are omitted is minor (analysis not shown in the tables) The effects of 
community characteristics become slightly larger. 

1 In fact, 96% of the women did not experience any nonfamily living after their birth 
until the time of contraceptive use or censoring, and 91% of the women did not have 
any schooling from birth until contraceptive use or censoring. 
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itive but not statistically significant (this analysis is not shown in the 
tables). 

Even with these controls for husbands’ experiences, however, the effects 
of community context remain the same in magnitude and statistically 
significant. Composite index measures of childhood community charac- 
teristics and contemporary community characteristics are both significant, 
in the expected direction, and have the same magnitude as the estimates 
presented in table 5. Neither childhood nor contemporary contextual ef- 
fects are explained by our measures of these adult experiences. 

Clearly, we have not captured the individual-level experiences that 
transmit these contextual effects in our analysis. Both childhood and con- 
temporary community characteristics affect fertility limitation, but to find 
the mechanisms that explain these effects, we must look beyond the limited 
set of nonfamily experiences analyzed here. This result may have several 
conceptual implications. Other nonfamily experiences, beyond those we 
have measured, may be important. It may be the combined effect of 
multiple people’s experiences, including family members or neighbors, 
that link these contextual changes to fertility limitation. The measures of 
experience used here may be too imprecise to capture the key dimensions 
of nonfamily experience. For example, agricultural wage work and other 
forms of nonfamily work may be different experiences, yet they are treated 
the same in the analysis. Or, it may be that factors other than experience, 
such as plans, attitudes, expectations, and aspirations connect these con- 
textual changes to individual fertility limitation behavior. 


The Impact of Migration on These Estimates 


The movement of individuals from place to place makes it inherently 
difficult to estimate the effects of community context on individual be- 
havior in any study. In the estimates presented above, we used information 
from detailed life histories of migration to construct measures of these 
experiences and control for them in multivariate models. This approach 
essentially imputes the characteristics of a migrant’s current neighborhood 
with the past characteristics of the neighborhood where she resided at 
the time of the survey, and it adds a control for the person-years in which 
this has been done. However, we also explored a variety of other ap- 
proaches for incorporating migration into our models. 

First, we estimated the models ignoring women’s migration experiences, 
as in most previous efforts to estimate multilevel models of social behavior. 
This was accomplished by removing our controls for migration experi- 


” Note that operationalizing nonfamily work experience as years of nonfamily work 
does not alter these results. 
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ences from the multivariate models. Our estimates of community influ- 
ences on contraceptive behavior remained nearly the same. Second, we 
reestimated our models eliminating all person-years for which the woman 
in question lived outside the neighborhood that was measured. Again, 
the results remained nearly the same, and the substantive conclusions 
based on those results remained exactly the same. Third, we reestimated 
our models eliminating all persons who had ever lived outside the sampled 
neighborhood during the period being analyzed. In this version, our es- 
timated effects grow, so that community context has even stronger effects 
on behavior than those displayed in table 5. Thus, the estimates generated 
by each of these approaches to the migration problem lead to the same 
substantive conclusion, and we are confident migration has not under- 
mined the reliability of our conclusions. 


DISCUSSION 


We have found strong empirical evidence that new nonfamily services 
and organizations near communities increase fertility limitation of women 
residing in those communities. As revealed in the effects of childhood and 
contemporary context, nonfamily organizations affect these women across 
the life course. This outcome is what the family mode of social organi- 
zation framework predicts for settings similar to the one studied here. In 
this rural Nepalese setting, families in the past organized most of the 
fundamental social activities of life, and women practiced unlimited child- 
bearing. As nonfamily services and organizations spread, many activities 
of daily social life relocated outside the family, which reduced parents’ 
motivations to bear and rear children. As a result, women who live closer 
to nonfamily services and organizations have higher rates of contraceptive 
use to limit fertility. The results, however, raise several important 
questions. 

Why have we found strong empirical evidence for contextual effects 
on childbearing behavior when many previous studies have found rela- 
tively sparse evidence of such effects? Two features of our data and anal- 
ysis stand out. First, we operationalize the concept of context at an ex- 
tremely local level that is likely to have a great deal of meaning in women’s 
day to day lives: the immediate neighborhood. This strategy is different 
from some previous multilevel studies in which context is operationalized 
at the level of province (Hirschman and Guest 1990), metropolitan sta- 
tistical area (South and Crowder 1997), town (Massey and Espinosa 1997), 
or census tract (Brooks-Gunn et al. 1993). Higher levels of aggregation 
reduce the likelihood that measured characteristics of context are salient 
in individuals’ daily lives. Note that in the Nepalese context most of daily 
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social life takes place within the immediate neighborhood. In other con- 
texts, such as the United States, daily social life may take place across 
many different organizations, institutions, or locations (e.g., schools, work- 
places, place of worship, clubs, or associations). In settings such as these, 
the approach we advocate prioritizes measurement of contextual features 
that have the most relevance in individuals’ daily lives, even if those 
contextual features are outside the immediate neighborhood. 

Second, in our analysis, we use dynamic contextual modeling to in- 
corporate both change in community context and change in individual 
lives. Time-varying measures at the individual level are standard practice, 
but because the data demands are high, dynamic measures at the con- 
textual level are extremely rare (Massey and Espinosa [1997], e.g., is an 
exception). Dynamic measures at both individual and contextual levels, 
however, are critical because they preserve a temporal order in the analysis 
that is consistent with the hypotheses. 

Why do we find that both childhood and contemporary community 
contexts have independent effects on childbearing behavior? Scholars 
have long explored the relative importance of early versus current en- 
vironments on behavior (Mannheim 1952; Elder 1974; Alwin and Thorn- 
ton 1984), and it seems that both have strong effects in our setting. A 
number of scholars have argued that in spite of possible long-term con- 
sequences of childhood environments, contemporary circumstances are a 
more important determinant of behavioral choices (Kohn 1983; Ryder 
1965). Theoretical perspectives on fertility that focus on the costs and 
benefits of childrearing are closely tied to this view (Becker 1991; Coleman 
1990; Easterlin and Crimmins 1985; Notestein 1953). In the microecon- 
omic view of childbearing behavior, it is cost and benefit tradeoffs at the 
time of childbearing decisions that structure parents’ decisions to ter- 
minate childbearing. The idea that these tradeoffs are key implies that 
characteristics of the community in adulthood have important effects on 
fertility, not characteristics of the community long ago, when the decision 
maker was a child. Some of the results we find are evidence in support 
of this idea—we find strong effects of contemporary community context 
on fertility limitation. 

Of course, childhood circumstances may simply predict later adult cir- 
cumstances. The fact that the effects of childhood community context do 
not diminish once we add measures of adult context is evidence against 
this hypothesis. Given the enduring and independent effects of childhood 
neighborhood context on adult childbearing decisions, our evidence is 
more consistent with nonmicroeconomic explanations of fertility. A num- 
ber of sociologists have advanced various types of diffusion theories, gen- 
erally stressing social interactions as key social changes that produce fer- 
tility declines, as alternatives to perspectives that emphasize the costs and 
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benefits of childrearing (Anderson 1986; Cleland and Wilson 1987; Bon- 
gaarts and Watkins 1996). Because the diffusion of new ideas may work 
through both childhood and adult processes, it is difficult to use differences 
in timing across the life course to adjudicate between ideational diffusion 
and microeconomic processes. However, our evidence of independent ef- 
fects of childhood contextual circumstances clearly points toward a set of 
ideational mechanisms that are separate from adult cost-benefit compar- 
isons, that is, the long-term development of attitudes and preferences. 

This mechanism has been overlooked in nearly all previous studies of 
macrolevel social change and individual childbearing behavior. The fam- 
ily mode of social organization framework we use here draws our attention 
to this mechanism because recent research has demonstrated long-term 
effects of the social organization of families in childhood on individual 
personality (Yabiku et al. 1999). Social activities organized around the 
family tend to integrate family members together, which affects the en- 
vironment in which children are raised. Theory predicts individuals who 
have high levels of family integration in childhood are likety to prefer 
earlier family formation and larger family sizes or, conversely, social 
changes that remove activities from the family reduce family integration 
and stimulate preferences for smaller families and later family formation 
(Yabiku et al. 1999). This mechanism of social change is ideational in its 
nature, but it differs from mechanisms that rely on the diffusion of new 
ideas among adults. By contrast, this mechanism implicates the social 
organization of families in childhood and how it shapes the long-term 
personality characteristics of children. 

This mechanism is consistent with more recent teoretica work in 
economics that predicts that early context forms individual preferences 
and tastes that endure across life (Becker 1996), and later context rep- 
resents the constraints in which individuals reveal their preferences. This 
work argues that even as they encounter new circumstances and expe- 
riences, individuals may cling to past preferences simply because it is too 
costly to derive new ones (Becker 1996, p. 127). The enduring effects of 
childhood circumstances are central to Easterlin’s theories of childhood 
socialization, cohort size, economic opportunity, and fertility (Easterlin 
1987). Thus ideational mechanisms of change linked to individual per- 
sonality are neither antithetical to microeconomic perspectives on child- 
bearing behavior, nor unheard of among the economists most closely as- 
sociated with the application of microeconomic theory to childbearing 
behavior. In fact, if researchers are to explain how social change affects 
childbearing behavior, we will almost certainly need to integrate both 
microeconomic and ideational perspectives. Although it has become com- 
mon to pose these two sets of fertility theories as competing alternatives, 
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these social processes probably work closely together, reinforcing each 
other over time (Lesthaeghe and Surkyn 1988). 

The modes of social organization framework is a fundamental theo- 
retical tool for studies of childbearing behavior because it integrates cost- 
benefit comparison and ideational mechanisms of social change within a 
comprehensive model of social change, family behaviors, and outcomes. 
The modes of social organization framework can generate empirical pre- 
dictions across a wide variety of substantive domains. The framework 
allows models of childbearing to incorporate both microeconomic theories 
(Bulatao and Lee 1983; Caldwell 1982; Easterlin and Crimmins 1985) and 
ideational theories (Anderson 1986; Cleland and Wilson 1987; Montgom- 
ery and Casterline 1993). Equally important, the framework identifies 
other key mechanisms of social change that have not been applied in 
studies of childbearing behavior, such as the impact of family social or- 
ganization on social integration and childhood personality development 
(Yabiku et al. 1999). It is an ideal tool for studies of the link between 
macrolevel contextual change and individual behavior because it explicitly 
links macrolevel social change to individual childbearing behavior via 
the social organization of the family (Thornton and Fricke 1987; Thornton 
and Lin 1994). This has emerged as a fundamental element of a broad 
range of theories linking social change to fertility (Caldwell 1982; Coleman 
1990; Freedman 1979; Lesthaeghe and Wilson 1986; Notestein 1953). In 
sum, it is a flexible framework that can address a broad set of research 
problems for both sociologists and other social scientists and represents 
an important step forward in theoretical reasoning regarding the links 
between fertility and social change. 
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This article uses a multistate hazard model to test the network 
hypothesis of social capital theory. The effects of family network 
ties on individual migration are estimated while controlling for mea- 
sured and unmeasured conditions that influence migration risks for 
all family members. Results suggest that social network effects are 
robust to the introduction of controls for human capital, common 
household characteristics, and unobserved conditions. Estimates 
also confirm the ancillary hypothesis, which states that diffuse social 
capital distributed among community and household members 
strongly influences the likelihood of out-migration, thus validating 
social capital theory in general and the network hypothesis in 
particular. 
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hypothesis of social capital theory offers a particular dilemma. Its leading 
prediction is that people who are socially related to current or former 
migrants have access to social capital that significantly increases the like- 
lihood that they, themselves, will migrate. This hypothesis is not new. 
Indeed, it has a respectable historical tradition and continues to be invoked 
to explain the magnitude of migration flows as well as the concentration 
of certain types of migrants in particular locations. The logical and his- 
torical foundations of the hypothesis and a summary of a number of newer 
formulations and applications throughout the world are thoroughly cov- 
ered elsewhere (see Massey et al. [1998], for a review). 

Despite the fact that the hypothesis has been sustained in a surprisingly 
large number of studies and in diverse social and geographic settings, no 
test has yet established its veracity compared with other theories that 
predict the same outcomes. In this article, we employ an infrequently used 
model and statistical tool to conduct a systematic test of social capital 
theory, one that confirms the latter’s validity while simultaneously casting 
doubt on competing explanations. 


SOCIAL CAPITAL THEORY 


The economist Glenn Loury (1977) introduced the concept of social capital 
to designate a set of intangible resources in families and communities that 
help to promote the social development of young people, but it was the 
sociologist Pierre Bourdieu (1986) who pointed out its broader relevance 
to human society. According to Bourdieu and Wacquant (1992, p. 119), 
“Social capital is the sum of the resources, actual or virtual, that accrue 
to an individual or a group by virtue of possessing a durable network of 
more or less institutionalized relationships of mutual acquaintance and 
recognition.” 

The key characteristic of social capital is its convertibility—it may be 
translated into other forms of capital, notably financial (Harker, Mahar, 
and Wilkes 1990). People gain access to social capital through membership 
in interpersonal networks and social institutions and then convert it into 
other forms of capital to improve or maintain their position in society 
(Bourdieu 1986; Coleman 1988). Although Portes and Sensenbrenner 
(1993) point out that social capital may have negative as well as positive 
consequences, theorists have generally emphasized the positive role it 
plays in the acquisition and accumulation of other forms of capital (see 
Coleman 1990), an emphasis that has been particularly strong in migration 
research. 

Migrant networks are sets of interpersonal ties that connect migrants, 
former migrants, and nonmigrants to one another through relations of 
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kinship, friendship, and shared community origin. Network connections 
increase the likelihood of international migration because they lower the 
costs and risks of movement and increase the expected net returns to 
migration. Having a tie to someone who has migrated yields social capital 
that people can draw upon to gain access to an important kind of financial 
capital, that is, high foreign wages, which offer the possibility of accu- 
mulating savings abroad and sending remittances home. 

As early as the 1920s, sociologists recognized the importance of net- 
works in promoting international movement (see Thomas and Znaniecki 
1918-20; Gamio 1930). Although Taylor (1986, 1987) characterized net- 
work ties as a source of “migration capital,” Massey et al. (1987, p. 170) 
appear to have been the first to label migrant networks specifically as a 
source of social capital. Following Coleman’s (1990, p. 304) dictum that 
“social capital . . . is created when the relations among persons change 
in ways that facilitate action,” they identified migration itself as the cat- 
alyst for change. Everyday ties of friendship and kinship provide few 
advantages, in and of themselves, to people seeking to migrate abroad. 
Once someone in a person’s network migrates, however, the ties are trans- 
formed into a resource to gain access to foreign employment and the 
money that it brings. Each act of migration creates social capital among 
people to whom the new migrant is related, thereby raising their own 
odds of out-migration (Massey et al. 1987; Massey, Goldring, and Durand 
1994),? 

Thus, although there are a number of alternative renditions of the same 
idea, the key hypothesis is that social networks connections create con- 
ditions that facilitate the migration of others (decreasing costs, augmenting 
potential streams of future income, reducing risks, transmitting infor- 
mation). As a result, individuals who are related to migrants will, ceteris 
paribus, be more likely to migrate themselves. In what follows, we often- 
times refer to the observable correlation of migration risks across members 
of a social group as the “apparent” network effect since the correlation 
may also be observed in the absence of any social capital embedded in 
relations within a network, as described below. 


COMPETING EXPLANATIONS 


Despite the cogency of this argument, there are several plausible alter- 
native explanations for the fact that people related to migrants are more 


* Due to space constraints, we can only discuss a few testable propositions about 
migration risks derived from recent research on social networks as social capital. The 
literature 1s broad and rich and covers experiences in very diverse geographic settings. 
For comprehensive reviews, see Massey et al. (1998) and also in Hugo (1981) 
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likely to migrate themselves. Whereas some of these explanations predict 
a close association between the migratory behavior of individuals con- 
nected by close family or household ties, others account for the common- 
ality of migratory behavior within a broader set of people linked more 
loosely by kinship, friendship, or community origin. In either case, how- 
ever, the association of migration risks among individuals who belong to 
a social group is expected not—as the social capital hypothesis would 
have it—because the behavior of one influences the behavior of the others 
via the formation of social capital, but due to the influence of conditions 
that are shared by individuals in the group. In what follows, we review 
the most important competing explanations for apparent network effects. 


Human Capital 


To the extent that people living in the same social group share charac- 
teristics that influence the costs and benefits of international migration, 
the conventional human capital model predicts that migration decisions 
will be correlated among friends, relatives, and even community members. 
The key argument is that the migratory behaviors are correlated because 
they share common characteristics and constraints that influence the ex- 
pected net return to migration, and, hence, the likelihood of its occurrence. 
According to this line of reasoning, if one could somehow remove the 
influences of shared human capital characteristics, then the association 
between migratory behaviors of related individuals would be reduced or 
eliminated. 


Joint Decision Making 

Unlike the foregoing explanation, the model of family income maximi- 
zation assumes that household members jointly formulate a strategy to 
maximize household (rather than individual) income. The family collec- 
tively chooses members to move in a particular order so as to earn the 
highest total household income, yielding an apparent “chain migration 
effect,” whereby the migration of one household member seems to raise 
the likelihood that others will follow. In reality, however, the observed 
network effect does not stem from the effect of household members on 
one another, but from the correlation of behaviors within the household 
as a result of joint decision making to develop a common strategy that 
governs individual actions. 

A prominent version of this model, proposed by Borjas and Bronars 
(1991), assumes that household members jointly formulate an optimal 
allocation of family workers to potential productive activities, including 
migration. Depending on whether the income at potential destination 
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areas is distributed more or less equally than in the origin area, the first 
link in the migration chain is selected with great care. No matter which 
member goes first, however, the family knows that migration costs for 
subsequent members will be less than for the first migrant. A jointly 
maximizing family incorporates this knowledge into its decision before 
anyone migrates, picking an optimal chaining pattern that amortizes the 
costs of migration over all family members. 

Thus, the observed correlation of migratory behavior among individ- 
uals within the same household may only reflect the fact that they are 
responding jointly to common conditions that impinge on the household 
exogenously, yielding another version of the “common cause” hypothesis 
already mentioned. No value is assigned to network relationships them- 
selves. Rather, members of a household share an elevated risk of migration 
because they formulate a common strategy in response to a single set of 
economic exigencies, not because social ties facilitate migration. Note, 
however, that in contrast to human capital theory, this theory does not 
necessarily predict a positive correlation between the migration risks of 
different household members, as the coordination of behavior to maximize 
income could require some members to stay at home while others are 
selected to migrate. This may occur, for example, when a household owns 
a productive enterprise that calls for overseeing by trusted family mem- 
bers. In this case, some family members must remain while others will 
be free to migrate. 


Risk Diversification 
In contrast to the neoclassical economic model developed by Borjas and 
Bronars, the new economics of labor migration model proposed by Stark 
and others postulates that households operate not only to maximize in- 
come, but also to minimize risk (David 1974; Stark 1991). According to 
this conceptualization, migration offers a means of diversifying income 
to manage households’ risk exposure. In the same way that investors 
diversify their holdings to limit their exposure to loss, households diversify 
the allocation of workers to different productive activities in different 
places. The strategy requires only that earnings at points of origin and 
destination be uncorrelated, or better yet, inversely correlated. Given a 
negative association between business cycles in sending and receiving 
societies, a household will not be greatly harmed by recession at home, 
since one or more family members will be earning high wages abroad 
and can remit a portion of their earnings back to the household. 

Social networks render migration practical as a means of risk diver- 
sification (Taylor 1986). When migrant networks are well-developed, they. 
put a destination job within easy reach of most community members, 
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making emigration a reliable and relatively risk-free resource (Massey et 
al. 1987). As a result, migration is more likely under conditions of strong 
than weak network ties. As in the Borjas and Bronars model, diversifi- 
cation may necessitate different timings of movement for different indi- 
viduals, possibly yielding a negative correlation between migration de- 
cisions within households. 


Selection 


One final explanation for network effects rests on the fact that people 
become enmeshed in social networks through nonrandom selection pro- 
cesses. Social and economic variables that determine a person's network 
membership simultaneously influence the propensity to migrate, thus cre- 
ating a spurious association between the two outcomes. According to this 
line of reasoning, the migration of one household member does not influ- 
ence others’ migration risks. Rather, the observed association is due to a 
common underlying process of selection. Such a mechanism is particularly 
plausible where there is a substantial amount of room for personal choice 
to operate, as in networks based on friendship or, to a lesser extent, on 
shared community of origin. It is much less likely that this mechanism 
will be of any significance when social networks are based on kin ties.’ 
As in the joint decision-making model, the selection hypothesis does not 
assign any intrinsic value to social relations themselves but underscores 
the importance of common underlying processes that simultaneously in- 
fluence decisions made by different family members. 


THE BURDEN OF PROOF 


Whereas social capital theory hypothesizes that movement by one person 
directly influences the odds of movement by others within the social 
network, we have specified four equally plausible mechanisms leading to 
the same prediction: that people within common social groupings are 
subject to common human capital influences; that moves may be coor- 


+ Admittedly, even within nuclear families where membership is not a matter of choice, 
some selection forces can operate. In fact, it is known that health status conditions 
are partially shared by members of the same family or those living in the same house- 
hold and, in turn, that health status affects the risks of migration. In this case, selection 
of migrants on health status creates a correlation between their migration experience. 
It should be noted, however, that this situation is indistinguishable from one where 
family or household members have similar human capital, with health status being 
one of the defining elements of human capital. It then follows that if we are able to 
reduce the influence of unmeasured factors shared by members of the household, we 
will simultaneously reduce selection effects due to health conditions. 
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dinated as a result of a joint household decision to maximize income; that 
moves may be linked as a result of a joint household strategy to diversify 
risk; and that moves are interrelated because factors that select individuals 
into common social networks also select on the propensity to migrate. 

How can we tell these alternative explanations apart? Inferences about 
network effects are typically based on qualitative or quantitative studies, 
which show that having a tie to a current or former migrant raises a 
person’s odds of out-migration, controlling for the influence of various 
individual, household, and community characteristics. In quantitative 
studies, for example, a dichotomous indicator of migration is regressed 
on a set of measured covariates plus one or more network indicators that 
are defined a priori—whether certain family members are current or past 
migrants, the number of friends or acquaintances who have ever migrated, 
the fraction of a community’s inhabitants with prior migrant experience, 
and so on. If the network indicator displays a positive association with 
the odds of out-migration—either in the cross section (Espinosa and Mas- 
sey 1998) or longitudinally (Massey and Espinosa 1997)—then ceteris par- 
ibus one infers a network effect (¡.e., that the social tie has operated directly 
to promote the subject’s migration). This commonly used strategy, how- 
ever, has three distinct shortcomings. 


Spuriousness and Selection 


The conventional strategy does not rule out common effects. In order to 
infer the existence of a direct effect, as claimed by the social capital theory, 
it is essential to remove the influence of conditions that are common to 
individuals in a network. Since two individuals linked by kinship or 
friendship will typically share common characteristics that influence mi- 
gration, these must be controlled before any causal influence can be as- 
signed to the network tie per se. Although many characteristics are easily 
measured and can thus be included in statistical models as controls, in- 
evitably some common factors are not so easily measured (health status, 
attitudes, motivations, beliefs) and are not so easily subject to statistical 
control. In the presence of unmeasured heterogeneity, the usual method 
of inferring network effects is not sufficient to eliminate competing 
explanations. 

By the same token, rarely if ever are potential selection effects addressed 
at all. Although this is much less of a problem when social networks under 
observation are defined by household or kin ties, selection may have some , 
influence within networks that emerge in other social domains. 
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Completeness 


Without exception, conventional efforts to test the social capital/network 
model have relied on a single test to assess the direction and magnitude 
of the association between migratory behaviors of individuals within a 
social network. Yet the validity of social capital theory cannot rest on a 
single test, This is because the theory implies and predicts other empirical 
regularities that should be assessed as well. Insofar as these predicted 
regularities are not observed, or if observed are inconsistent with com- 
peting theories, they can be used to falsify social capital theory or eliminate 
rival explanations 


Multiplicity of Social Networks 


Conventional strategies typically only probe the significance of one realm 
of social relations at a time. This practice is usually associated with short- 
comings in the information available to researchers and can lead to in- 
conclusive results, particularly when no tests of alternative predictions 
are simultaneously carried out. For example, even if all shared conditions 
among related individuals can be measured and statistically controlled, 
both the joint maximization and risk diversification perspectives still pre- 
dict a correlation between migratory behaviors within households. If these 
theoretical accounts are valid, some or all of the association of migration 
risks among members of a household-based network that remain after 
controlling for shared conditions may simply reflect the fact that household 
members act collectively to derive a joint strategy of migration that they 
subsequently implement. 

Thus, even if it were possible to strip the observed relationship of the 
effects of shared conditions, methodologies typically used to infer network 
effects cannot eliminate the counter-hypotheses of risk diversification and 
joint decision making. A strong means of adjudicating between these 
counter-hypotheses and social capital theory is to demonstrate that, net 
of shared conditions, there is an association between migration risks of 
individuals who share the same social networks but mot the same family 
or households (see above). This is a demanding test because it requires 
information on network connections across several social domains. 

In the absence of conditions to implement a strong test, one can derive 
and verify the validity of corollaries from social capital theory that are 
not predicted by the joint-decision and risk diversification models. Thus, 
to the extent that we eliminate the second weakness—testing corollaries 
is a means of achieving completeness—we may also be able to eliminate 
or attenuate the relevance of the third shortcoming. It should be noted, 
however, that this is a weaker means of discriminating between theories 
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than the one involving joint evidence from different social domains. In 
what follows, we develop a model that enables us to bypass the spuri- 
ousness and selection shortcoming. Because we only have information 
pertaining to a single social domain (the family), we cannot eliminate the 
multiplicity of social networks shortcoming. Instead, we are able to test 
three corollaries from social capital theory neither of which is implied by 
competing models. This is a solution to the completeness shortcoming and 
provides a weak solution to the multiplicity of social networks problems. 


MODELS AND ESTIMATIONS OF SOCIAL NETWORKS EFFECTS 


We adapt recently developed hazards techniques to derive models capable 
of eliminating rival explanations (Clayton 1978; Hougaard 1986; Clayton 
and Cuzick 1985; Yashin and lachine 1997). These models permit us to 
retrieve fixed and time-dependent effects on the joint migratory risks of 
two members of a social dyad while simultaneously controlling for the 
effect of unmeasured common conditions. They establish a relation be- 
tween the timing of movement by each party in the social relationship to 
four basic factors: (1) measured conditions characteristics of each indi- 
vidual; (2) common measured conditions and characteristics; (3) unmea- 
sured common conditions and characteristics; and (4) the effect of the 
migration of one member of the pair on the timing of migration by the 
other. Our basic methodological problem is how to distinguish between 
these various effects, determine their direction, and estimate their 
magnitude. 

Although this problem is certainly not new in social science, its solution 
is not obvious and requires the application of special models and pro- 
cedures. Neither the theories discussed above nor the models we introduce 
below necessitate that we focus only on the timing of first migration, but 
doing so offers the advantage of not requiring us to model an interrelated 
sequence of events or to fine-tune data on the timing of first, second, third, 
and higher order moves. To be sure, a thorough test of competing expla- 
nations ultimately should examine such sequences and their interrelations. 
Our objective is more modest: we only evaluate whether or not the initial 
migration of one family member influences the timing of movement by 
another. 


A Naive Model 


Let us begin with the simplest case. Suppose that Y,(£) is a dichotomous 
variable representing the first migratory experience of individual ¿in social 
group j at time ?. It attains a value of “1” if the first migration occurred 
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by time ? and “0” otherwise, where t represents the time elapsed since a 
suitably defined point of origin for the first migration process. People 
within social group j are related to one another and could be expected to 
influence one another’s behavior. If one person in j migrates, then we 
hypothesize that the risks of first migration increase for other members 
of the social group because of the theoretically expected mutual influences 
derived from social capital theory. The social bonds that define member- 
ship in j constitute connections within the migrant network. 

For example, j may indicate membership in a household wherein people 
are related by kinship. Migratory behavior of various members of the 
household, husbands and wives, fathers and sons, brothers and sisters, 
and so on are thus related to each other. Given this conceptualization of 
network migration, we can specify the following simple model: 


Y,@) = aX, + BZ, + yM, + €y, (1) 


where X, is a vector of characteristics for individual i in household j, 
Z, stands for shared characteristics of the household that may affect mi- 
gration risks of all its members, M, is a vector of indicators of migration 
experience of other household members, a, $, and y are vectors of effects, 
and e, is an error term. The a-effects are associated with individual 
characteristics, the -effects are associated with conditions shared by 
members of the household, and the y-effects measure the influence of 
migration of other members of the household. That is, 8-effects and y- 
effects are estimates of the contributions of shared conditions and family 
networks respectively. 

The model presented in equation (1), however, presents a number of 
serious estimation problems. The most important is the likely existence 
of unmeasured characteristics correlated with M,. Relevant unmeasured 
factors are common conditions that should have been included in the 
subvector Z,, but which have not been measured for some reason (e.g., 
cost, practicality, convenience). The consequence of such omissions is in- 
consistent estimates of y, the effects of migration experience among mem- 
bers in the kin network. More generally, the presence of unmeasured 
common causes makes infeasible the identification of social network 
effects. 


A Bivariate Hazards Model 


Bivariate hazards models were developed to study two survival pro- 
cesses affected by common conditions as well as mutual influences. Sup- 
pose two individuals in a household have migration risks (or hazards) 
defined by p, (t |X Z, W) and y,,(t,|X,,,Z,,W). In this notation, X, refers 
to a vector of individual characteristics, either fixed or time-dependent, 
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that are associated with each member of the pair in household j; Z, 
includes common characteristics of the household or community, again 
either fixed or time-dependent; and W contains unmeasured fixed char- 
acteristics of the household. These risks are expressed by the following 
two equations: 


hyl) = aX, + B,Z, + OW + €, (2) 


Bylta) = a,X,, + B,Z, + ow + € (3) 


Although this model allows the estimation of effects (œ’s and f”s) net 
of the influence of common measured and unmeasured characteristics, it 
suffers from two key limitations. First, using procedures developed by 
Clayton (1978) and Clayton and Cuzick (1985), the model is theoretically 
estimable only in the absence of reciprocal influences (Mare and Palloni 
1988). If such reciprocal influences are strong—and this is precisely the 
social capital hypothesis—then the estimates of the effects of covariates 
will be inconsistent. Second, in order to find a tractable solution, Clayton 
postulates a parametric form to describe the effect of unmeasured con- 
ditions, W. The problem is that the estimates of the «’s and §’s are very 
sensitive to the actual specification of this distribution. 


A Flexible Multistate Model 


To estimate the effects of reciprocal behavioral influences in the process 
of first migration while simultaneously controlling for the influence of 
shared conditions, we propose a multiple hazards model. This model 
improves on the naive approach by virtue of its power to control for 
measured and unmeasured conditions. It improves upon the bivariate 
parametric approach in that it enables us to retrieve estimates of reciprocal 
influences that are not sensitive to parametric specification of unmeasured 
conditions. 

Again, we consider the case of paired household members and for 
simplicity work with the example of two siblings. As illustrated in figure 
1, at any time t, we can identify four distinct states with respect to the 
timing of first migration by two siblings: neither sibling has migrated, 
the oldest sibling has migrated but not the younger one, the youngest 
sibling has migrated but not the oldest, and both siblings have migrated. 
Hazards associated with flows into and out of the four states can be 
represented either parametrically or nonparametrically, and each hazard 
is defined as a function of both individual and shared characteristics: 
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py (E) = aX, +Z, + W+e, (4) 
hall) = Xy + B,Z,+W+e,, (5) 
Hyls) = a, Xy +B,Z, + W+ es, (6) 
Habs) = 0,X, + BZ +W €,, (7) 
Bylts) = aX, +B,Z, + Wt es, (8) 


where the four hazards correspond to the four states just described, X, 
refers to the measured characteristics of person i in household j, Z, in- 
dicates measured common characteristics of household j, and W indicates 
unmeasured household characteristics. As before, a, and $,, are vectors 
of parameters 

This representation yields an adequate basis for estimation provided 
we can resolve one minor difficulty. This occurs when the hazard for one 
member of the pair is effectively zero even though the hazard for the 
other is not—for example, if an older sibling is exposed to a nonzero risk 
of migrating but a younger sibling is not exposed at all (because he or 
she is too young). This circumstance violates the proportionality of hazards 
assumption and leads to inconsistent estimates. To resolve the problem, 
we start the clock of the process only after the youngest member of the 
sibling pair reaches the minimum age for migration, which we assume to 
be 15 (see below). 

This model specification has several appealing features. First, because 
the units of observation are pairs of actors rather than individuals (e.g., 
older and younger siblings), the representation of hazards tolerates the 
inclusion of unobserved characteristics associated with the pair, thus fa- 
cilitating estimation while controlling for shared unmeasured conditions. 
Second, the unmeasured component can be represented either as para- 
metric or nonparametric rather than being constrained to a narrow range 
of parametric forms (as in the bivariate hazard model). Third, and most 
important, the effects of the timing of the event for one member of the 
pair on the timing of the event for the other can be evaluated in rather 
simple ways. For example, if the initial migration of the oldest sibling 
has an important influence on the timing of first migration of a younger 
sibling, then the difference between u, (t,) and u, (t,) should be discernible 
from the parameters for the corresponding baseline hazards (when all 
relevant characteristics and their effects are the same across siblings and 
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Fic. 1 —A multistate representation of siblings’ migration 


transitions). Alternatively, we can use global likelihood ratio statistics to 
test the equality of parameters for the two baseline hazards. 


Using the Multistate Model to Discriminate between Competing 
Theories 


The data available to us pertains to households. As a consequence, we 
apply the multistate model described above to estimate social networks 
effects in only one social domain, the coresidential family. In particular, 
we investigate the effects of one sibling’s migration on other siblings’ 
migration risks. The multistate model enables us to estimate these effects 
while controlling for measured characteristics reflecting human capital (of 
siblings and households) and for unmeasured conditions shared by sib- 
lings. Thus, the estimated effects are net of the influence of spurious 
association and of selection effects triggered by some or all of the un- 
measured shared conditions (see n. 2). The direction of these effects, their 
magnitude, and their significance provide evidence for social networks 
effects and enable us to test social capital theory. 

Because of the nature of the data, however, we cannot deploy a strong 
test to adjudicate between social capital theory, on the one hand, and 
joint decision-making and risk diversification theories, on the other. This 
strong test requires us to show that social networks effects prevail in some 
social domains other than the family. We can, however, use a weak test 
and check the validity of corollaries derived from social capital theory 
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that are not consistent with competing theories. This is a minimum re- 
quirement to distinguish social capital theories from the other theories. 

The first corollary can be formulated as follows: to the extent that social 
capital can be deployed to resolve problems faced by migrants, apparent 
network effects should be greater during periods when migration becomes 
more costly and difficult. In particular, if social capital indeed becomes 
more valuable during periods of tighter border enforcement, we should 
observe larger effects of social networks. In the case of Mexico-U.S: mi- 
gration, for example, we would gain greater confidence in our hypothesis 
if the size of network effects increased after the passage of the Immigration 
Reform and Control Act of 1986, which launched a substantial build-up 
of enforcement resources along the Mexico-U.S. border, criminalized the 
hiring of undocumented migrants, and authorized the U.S. Department 
of Labor to expand internal inspections (Singer and Massey 1998; Phillips 
and Massey 1999).* 

The second corollary regards the influence of social capital located 
outside the household on individual migration risks. Individuals who live 
in communities where migration is more prevalent are more likely to 
participate in nonfamily networks involving migrants and, consequently, 
to tap sources of social capital located outside the household. It follows 
that the persistence of effects of community migration prevalence on in- 
dividual migration risks (after controlling for measured and unmeasured 
individual and household conditions) is prima facie evidence of the im- 
portance of social capital over and above factors implied by the joint 
decision-making and risk diversification theories. Indeed, neither of these 
perspectives predicts an effect for social capital located outside the house- 
hold (once household conditions are controlled). 

The third corollary involves the migration experience of the household 
head. As mentioned before, the correlation of migration risks across sib- 
lings, net of the effects of measured and unmeasured conditions, is also 
expected under the joint household and risk diversification theories be- 
cause under these theories risks of migration are subject to household 
coordination. However, if we are able to control for migration behavior 
of other members of the household, we would expect that the apparent 
social network effects disappear: if the only factor accounting for corre- 
lation between migration risks of any two siblings is the existence of 
household coordination, it should vanish once we control for the migration 
behavior of al household members. This is a tall order, so we only pursue 
a shortcut and control for migration behavior of the father. Under social 


* Full documentation of these data, the questionnaires, and the samples, along with 
the files themselves, are available from the MMP Web site: http://www pop.upenn.edu 
/mexmig/ 
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capital theory, father’s previous migration also creates social capital and 
should exert an effect on the migration risks of both siblings, but it should 
not do so by attenuating the affects of one sibling’s migration on the 
migration risks of the other. 

In what follows, we show that our estimates support the existence of 
social networks effects. We do not take this as evidence that human capital 
theory is irrelevant, but as an indication that social capital is also a stra- 
tegic condition promoting the process of migration. Similarly, because we 
are not able to implement a strong test to discriminate between social 
capital and the other competing theories, we can only claim that the 
observed social network effects are weakly distinguishable from effects 
that would be observed if the processes of joint decision making or risk 
diversification took place in the absence of social capital effects 


DATA, MEASURES, AND METHODS 


Our data come from the Mexican Migrant Project (MMP), whose database 
at the time of the analysis included samples of 39 communities located 
in the states of Jalisco, Michoacán, Guanajuato, Nayarit, and Zacatecas. 
Together, these states constitute a region (Western Mexico) that historically 
has sent a majority of migrants to the United States (Durand, Massey, 
and Zenteno 2001). The data set also includes one additional community 
from the state of Guerrero, a newer migrant-sending location in the central 
region to the south of Mexico City (other communities from this region 
are in the process of being added to the file). 


Characteristics of the Data 


Respondents were interviewed in 1982-83 and in successive years from 
1987 to 1995 using an ethnosurvey questionnaire that collected infor- 
mation about the social, economic, and demographic characteristics of 
the head, the spouse, the head’s children, and other household members 
(see Massey and Zenteno 2000). Information was compiled for all children 
of the household head regardless of age or where they lived (determining 
independently whether each child still lived in the respondent household). 
Among the data gathered from each son or daughter was the date of his 
or her first trip to the United States. Each household head also provided 
a complete life history that included separate histories of marriage, fertility, 
labor, home ownership, land ownership, and business ownership. 
Within each community, the typical sample consisted of 200 households, 
although in smaller settlements, fewer households were chosen, and in 
some cases, larger samples were compiled. Sampling frames were con- 
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structed by conducting a house-to-house census at each site. Usually an 
entire town or city was canvassed, but in large urban areas, this was not 
possible and specific working-class neighborhoods were demarcated and 
sampled instead. Sampling fractions ranged from 0.029 to 0.803 and av- 
eraged about 0.228. Refusal rates varied from O to 0.152 and average 
0.062. Higher refusal rates generally occurred because of local political 
circumstances rather than suspicions about the study per se. 

In choosing the communities, investigators sought to include a range 
of population sizes, ethnic compositions, and economic bases. Commu- 
nities were not chosen because they were thought to contain U.S. migrants, 
and the data set in fact includes a wide range of migratory prevalence 
ratios, ranging from one community where just 9% of adults have been 
to the United States to another where 60% have migrated (Massey et al. 
1994). Although the sample is not strictly representative of the states of 
western Mexico, it nonetheless incorporates a broad cross section of house- 
holds and communities in the region and yields a sample of U.S. migrants 
whose characteristics are remarkably similar to those enumerated in rep- 
resentative surveys (Zenteno and Massey 1999; Massey and Zenteno 
2000). 

Since our method focuses on the migratory behavior of sibling pairs, 
we select all households containing at least two siblings over the age of 
15, assuming that beyond this age people tend to make their own migra- 
tion decisions rather than simply following their parents. From these 
households, we select the oldest sibling as a reference point and randomly 
choose a younger sibling for the second person in the pair. To the extent 
that the age distribution of siblings in a household is related to their 
migratory behavior (via fertility effects or the age distribution of parents), 
our sample of pairs may be biased somewhat by selection, but we do not 
see this as a serious problem. We limit our analysis to siblings enumerated 
in Mexico to yield comparable measures of employment and occupational 
status for all brothers and sisters. Cases where one of the sibling pair was 
born in the United States were excluded to eliminate return migration as 
an extraneous effect. Finally, so as to focus on recent migratory experience 


* This corollary is not only a logical deduction from social capital theory, but is also 
suggested by past research (Donato, Durand, and Massey 1992; Massey and Espinosa 
1997; Singer and Massey 1998). Accordingly, restrictive policies implemented in the 
wake of the Immigration Reform and Control Act of 1986 (IRCA) have had little effect 
in reducing the odds of leaving on an initial undocumented trip, taking an additional 
trip without documents, or crossing the border surreptitiously. These outcomes could 
only occur if the effect of social networks became stronger after IRCA’s implemen- 
tation, thus offsetting the increase in detechon equipment and enforcement personnel 
along the border. To test this idea, we will divide years of observation into those before 
IRCA (1986 or earlier) and those afterward (1987+) and check whether or not the 
effect of having a tle to a migrant sibling increases over time. 
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TABLE 1 
OBSERVED TRANSITION MATRIX OF MALE SIBLING PAIRS SELECTED 
FROM 39 MEXICAN COMMUNITIES 


STATE OF DESTINATION 
STATE OF ORIGIN (1) (2) (3) (4) 


Frequencies 
1 Neither brother migrated . . 2,248 324 342 24 
2 Older brother migrated ... 3 438 0 206 
3 Younger brother migrated 307 35 
4. Both brothers migrated . . . oes 265 
Crude probabiliter 
1. Neither brother migrated . 765 .110 .116 .008 
2 Older brother migrated . .... .680 000 320 
3 Younger brother migrated . . 898 .102 
4 Both brothers migrated ...... 1.00 


and minimize recall error, we deleted cases where the oldest sibling was 
over age 50. 

These deletions give us a sample of 3,258 sibling pairs. Because we 
know the date of each person’s first trip to the United States, we can 
derive a transition matrix that counts the frequency of moves associated 
with each pathway depicted in figure 1. The frequencies of this transition 
matrix are displayed in the upper panel of table 1. Beginning when the 
youngest sibling turns 15, we follow both persons in the pair and observe 
the timing of first migration. Cases of left censoring (where the older 
sibling migrated before the younger one reached age 15) originate in state 
2 and do not violate the assumption of proportional hazards. Of these 
left-censored cases, 204 remained in state 2 for the entire observation 
period and 90 proceeded to state 4 (i.e., the younger sibling migrated). 

In total, we counted 4,189 interstate transitions among sibling pairs 
during the observation period. The diagonal of the transition matrix con- 
tains instances of state immobility. In 2,248 cases, neither sibling migrated 
between the time the youngest turned 15 and the survey date. In 265 
cases, both siblings migrated in the same year, in 307 cases, the younger 
sibling migrated first whereas the older never left; and in 438 cases, the 
older sibling migrated first while the younger never left. 

The off-diagonal cells, in contrast, indicate moves between states during 
the observation period. Of the 931 changes of state catalogued in the 
observation period, there were 324 moves from state 1 to state 2 (neither 
migrated to oldest migrated), 342 moves from state 1 to state 3 (neither 
migrated to youngest migrated), and 24 moves from state 1 to state 4 
(neither migrated to both migrated). In addition, 206 moved from state 
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2 to state 4 (youngest followed oldest), and 35 went from state 3 to state 
4 (oldest followed youngest). 

Dividing cell frequencies by the row totals yields crude probabilities 
associated with each transition. These are shown in the lower panel of 
table 1. Among those pairs reaching age 15 without either sibling mi- 
grating (yielding no social capital for either person to draw upon), younger 
and older siblings had roughly the same likelihood of migrating: the prob- 
ability that the older sibling left first was 0.110, whereas the likelihood 
that the younger left first was 0.116. Having an older migrant sibling, 
however, almost triples the likelihood that the younger sibling will migrate, 
raising the exit probability from 0.116 to 0.320. This higher risk of out- 
migration means that having a tie to an older migrant sibling significantly 
lowers the waiting time to first migration. The effect of a network tie to 
a migrant sibling appears to be asymmetrical with respect to age, however, 
as having a younger sibling with U.S. experience does not increase the 
likelihood of out-migration for the older sibling. Indeed, at 0.102, the 
probability remains roughly the same as for those lacking social capital 
entirely (0.110 and 0.116). 

In general, these transition probabilities are consistent with the network 
hypothesis of social capital theory. As we have pointed out, however, they 
are also consistent with other plausible explanations. In order to isolate 
the independent effect of social capital, we need to control for the effects 
of individual and shared characteristics using the multistate hazard model 
derived above. To identify the model in equations (4)X7), we selected a 
set of tractable indicators of individual and shared characteristics from 
the MMP data set. 


Measures 


Table 2 shows means and standard deviations for individual character- 
istics of the siblings, including gender, education, and occupational status. 
Table 3 displays means and standard deviations for shared conditions. 
These include traits of the household head that are assumed to be fixed 
(gender, education, and occupational status), as well as time-varying in- 
dicators of household wealth (ownership of farmland, real estate, or busi- 
ness enterprises). 

To assess variability ofindividual human capital, we use the educational 
and occupational status of siblings. A comparison between figures in tables 
2 and 3 shows that educational levels improve sharply between parental 
and sibling generations. Whereas nearly two-thirds of household heads 


* These probabilities are “crude” because they are calculated without proper adjust- 
ments for competing risks 
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TABLE 2 
INDIVIDUAL CHARACTERISTICS OF PERSONS IN SIBLING PAIks SELECTED 
FROM 39 MEXICAN COMMUNITIES 


YOUNGER SIBLING OLDER SIBLING 
VARIABLE Mean SD Mean SD 

Gender. 

Male... ... 502 500 510 500 
Educaton: 

0—3 years... .153 360 211 408 

4-9 years ...... ` .615 487 534 499 

10+ years . . 232 422 256 436 
Occupational status: 

Unemployed ... 313 464 336 473 

Unskilled . ... 236 425 220 415 

Shlled ... 299 A58 370 473 


Norr —For fixed covanates evaluated at the tme of the baselme survey, gender = 1 male, 
0 otherwise, education = 1 if 4 or more years, 0 otherwise, occupation = 1 if skilled, 0 otherwise 
N ol nblmg pairs = 3,258 


(64.9%) had three or fewer years of schooling, among their offspring, only 
21.1% of older siblings and 15.3% of younger siblings had such low levels 
of education. Likewise, whereas roughly a quarter of all siblings had 10 
or more years of schooling (23.2% of the younger ones and 25.6% of those 
who were older), the figure for household heads was only 5.2%. By con- 
trast, the distribution by occupation is more favorable to fathers than 
siblings, probably reflecting longer labor force experience among parents. 
In fact, a substantial number of the siblings are still only teenagers, so it 
should not surprise us that their unemployment rates are much higher 
than their parents. Whereas roughly a third of the siblings were unem- 
ployed at the time of the survey (31.3% of the younger and 33.6% of the 
older siblings), only 23% of household heads lacked a job. Nearly half of 
all household heads (49.4%) held a skilled occupation, compared with 
only 37% of older siblings and 30% of younger siblings.’ 


7 In these analyses, we consider employment and occupational status as a single variable 
because the structure of the data prevents their separation. The “unemployed” category 
includes those jobless but looking for work, but also homemakers, students, and people 
out of the labor force for other reasons. Siblings and parental occupational status at 
the time of interview are not the ideal measures of human capital we seek since they 
may reflect the acquisition of skills that resulted from migration itself. To the extent 
that improvements in occupational skills are related to underlying abilities and re- 
sourcefulness, however, the indicator will perform its role, albeit crudely. If, however, 
changes are strongly related to migration experience and only partially reflect the 
influence of innate skills and abilities, the effects of this variable will be inconsistent. 
Our efforts to include a time-dependent version of individuals’ occupation were not 
successful due to the relatively large number of cases with unknown values for periods 
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TABLE 3 
HOUSEHOLD CHARACTERISTICS OF SIBLING PAIRS SELECTED 
FROM 39 MEXICAN COMMUNITIES 


Variable Mean SD 

Head’s gender 

Male ....... 850 150 
Head’s education: 

0-3 Years..... 649 477 

4-9 Years... 2 2. we 299 458 

10+ Years ...... 052 222 
Head’s occupational status: 

Unemployed o.s... 229 420 

Unskilled .... .............. 277 448 

Skilled . 494 -500 
Household wealth. 

Owns farm land . ..... 156 363 

Owns real estate ........... 671 470 

Owns business 361 480 
Social networks: 

Father a migrant .. z: 350 102 

Prevalence in community . 420 110 


Note —For fired covanates, head's gender = 1 male, 0 otherwise, 
head’s education = 1 1f 4 or more years, 0 otherwise; head's occupation 


experienced migration, prevalence 
of migration = 1 if prevalence of dult migration in community exceeds 
a REE N of sibhng pers = 
3,258. 


Although gender is not a human capital variable proper, it represents 
an important individual characteristic that proxies for the different roles 
assigned to males and females in Mexican society (Lewis 1960, p. 54—68; 
Diaz 1966, pp. 76—93; Foster 1967, pp. 55—86). Traditionally, it is men 
who migrate first, and when women do migrate, they typically go either 
as members of a couple or a larger family unit (Massey et al. 1987; Rouse 
1991, 1992; Durand and Massey 1992). Thus, although having a Y chro- 
mosome does not endow individuals with naturally low or high levels 
of human capital, female gender nonetheless operates as a constraint 
by raising the emotional and social costs of migration (Alarcón 1992; 
Hondagneu-Sotelo1994; Goldring 1995). As expected, the gender distri- 
bution of siblings is relatively balanced. 

Age is a potentially important individual variable because it reflects 


before the baseline survey. Another limitation of these measures of social capital 1s 
that we cannot gauge joint effects of quality and quantity of social capital since we 
do not have suitable measures to do so. 
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accumulated experience and belongs in the model as a dimension of hu- 
man capital. The ages of individual siblings are not entered as covariates 
when modeling transitions originating in state 1 because the process is 
assumed, by definition, to start at age 15. Thus, the baseline risk reflects 
effects of age, including those associated with human capital traits that 
are not measured by education or occupation. Preliminary analyses with 
controls for sibling’s age at first trip in transitions originating in states 2 
and 3 showed that its effects were trivial and were dropped from further 
consideration.” 

As argued earlier, gender captures unmeasured factors likely to influ- 
ence the propensity to migrate. In keeping with Mexico’s patriarchal 
culture, the vast majority of household heads in our sample are male. 
Typically females are designated as heads only when the male is absent 
because of death, divorce, or abandonment, leaving the household vul- 
nerable to risk of poverty. Other things equal, therefore, one might expect 
siblings from female-headed households to experience a higher risk of out- 
migration. 

Three of the common conditions are time-varying covariates. First, 
timing of father’s migration is entered to check the validity of the second 
corollary. It is also a condition that affects the social capital for children 
in the household and is thus expected to have its own influence on mi- 
gration risks.’ 

Second, to test for the validity of the third corollary, we include a 
measure of the prevalence of migration within the community. The MMP 
data allow the computation of a time-varying estimate of the proportion 
of community members age 15+ who have ever been to the United States 
(see Massey, Goldring, and Durand 1994). From this information, we are 
able to construct a time-dependent covariate that attains a value of “1” 
if, at the beginning of any calendar year of exposure to migration risks, 
the prevalence of migration experience among those age 15+ in the com- 
munity of residence migration exceeded the first quartile of the distri- 
bution. We use a time-dependent covariate because aggregate migration 
experience in the community changes over time as a result of individual 
migration experiences. The corollary asserts that if social capital theory 
holds, prevalence of migration in the community should exert an influence 


* Other individual variables of theoretical interest that proved to be of trivial empirical 
significance in the models, are birth order of the youngest child and documentation 
status (on first tnp) of either sibling. They too were dropped from consideration. 

* Note that father’s migration is one way of producing a female-headed household; 
therefore, it is a necessary control to assess the net effects, if any, of female headedness. 
As was the case for siblings’ documentation on first trip, father’s documentation was 
dropped from the analysis since it did not contribute significantly to it. 
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on the individual propensity to migrate over and above the effects of 
network ties within the household.” 

Third and finally, possession of assets in the form of property is intro- 
duced as a control since such household characteristic could inhibit mi- 
gration among some of the children (the youngest in a stem-family system) 
and promote it among others (the older ones). Since assets are subject to 
change as a function of migration experience, the variables for possession 
of property are time dependent and defined as of the beginning of each 
year of exposure to migration. 


Model Estimation 


We estimate the model using CTM, a maximum-likelihood program de- 
veloped by George Yates, James Heckman, and James Walker precisely 
to estimate generalized continuous hazard models. We employ a Weibull 
baseline hazard to represent the time dependence of the risk of migrating, 
a flexible, monotonic functional form that requires only a level and a slope 
parameter. To test the sensitivity of our estimates to this functional form, 
we tried different specifications (piecewise exponential, Gompertz, and 
quadratic functions), but our main results proved robust to changes in 
the baseline hazard. 

The multistate model posits different baselines and effects for each 
transition. Since there are five different transitions, the simplest model 
requires 10 parameters to describe all five baseline hazards. For each 
individual variable we enter, there are potentially 10 different effects (one 
for each sibling in each of five transitions), whereas for each shared con- 
dition there are five parameters to estimate (one per transition). Estimates 
can thus proliferate very rapidly, but this turns out to be completely 
unnecessary. In fact, extensive tests of different model specifications 
showed that constrained models offer a parsimonious representation of 
the data. Constrained models are those where the effects of individual 
and shared characteristics are invariant across transitions and where the 
effects of individuals’ characteristics on his or her own migration risks 
are the same as those on the sibling’s migration risks. 


% In early analyses, we also introduced controls for community size as another common 
characteristic, but these were ultimately dropped for lack of significance. 
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TABLE 4 
Basic RESULTS FOR MULTISTATE HAZARD MODELS OF INCREASING COMPLEXITY 


Parameters Log 


Model N Estimated lLikehhood x df p-value 

Model 1 (baseline) .. . 3,258 10 —2,439 0 
Model 2 = 1 + human 

capital: 

Unconstrauned ....... +. 3,258 25 —2,215 6 

Constrained ... ......... . 3,158 13 —2,2212 116 12 59 
Model 3 = 2 + shared con- 

ditions * 

Unconstrained A . 3,180 55 2,205 3 

Constrained . A 3,180 19 —2,208 1 56 36 99 
Model 4 = 3 + father's 

migration and 

prevalence * 

Unconstrained ..... . . 3,180 63 —1,976.1 

Constrained . ER 3,180 21 — 1,983.2 142 42 .98 
Model 5 = 4 + unmeasured 

heterogeneity 

Unconstrained E 3,180 65 —1,949 1 

Constrained . . 3,258 23 —1,958 2 182 42 98 


Note —df and x’ reported only when goodness of fit of alternatrve models is carned out (unconstramed 
versus constrained models) x’ based goodness-of-fit test across models can also be performed except for 
the model 5 withm which none of the otbers nests Constramed models force estimates of a characteristic 
to be sdentical across all transttions. Indrvidual charactenshcs are constramed to have identical effects 
across mblings from the start (tests not shown) 

* Models 4 and 5 disregard transrbon 1 to 4 

t Models 4 and 5 aro estimated on a reduced number of cases, a result of deletions due to misamg 
information over tune on shared characteristics, father’s date of first migration, or community af readen.ce 


RESULTS 
Testing Model Constraints 


Table 4 displays values of the log-likelihoods, chi-square statistics, degrees 
of freedom, and p-values for the main models we estimate." The first 
model is our baseline model, and it only includes parameters for the five 
baseline hazards. The second model adds indicators of human capital; 
the third adds conditions shared by the siblings; the fourth adds two 
dichotomous (and time-varying) indicators, one for timing of father’s mi- 


* All our tests are based on likelihood rato statistics, a reasonable choice when testing 
hypotheses that involve nested models. Alternative statistics (Akaike criterion and BIC) 
were also calculated but produced identical results, and we do not show them here. 
Although we are able to retrieve estimates for all 5 transitions in models 1-3, the 
increasing complexity of models 5 and 6 leads to unstable estimates for the parameters 
of the transition from state 1 to state 4. This is because of the very small number of 
events associated with the transition (see table 1). Throughout our discussion, we omit 
display of estimates for the transition even when it was possible to obtain them. 
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TABLE 5 
ESTIMATES OF PARAMETERS FOR BASELINES HAZARD 


‘TRANSITION 
1 to 2 1 to 3 2 to 4 3 to 4 lI to4 
Model 1: : 
Intercept —2 25" -179* — 73* —1.96* —5 00* 
(09) (07) (12) (27) (40) 
Slope -31* 08 —.17* — 10 —.41 
(.06) (.06) (.08) (18) (23) 
Model 2 
Intercept —2 66* —2 28* -1 13* -221* —5.00* 
(.10) (09) (12) (26) (.29) 
Slope ....... —.25* 16 —.07 —.06 —41 
(.06) - (06) (.07) (.18) (23) 
Model 3: 
Intercept —2.92* —2.53* —1.33* —2.62* —5 00* 
(06) (06) (.23) (.68) (39) 
Slope ..... —25* 16* — 07 —.02 — 41 
(06) (06) (07) (18) (.23) 
Model 4- 
Intercept -307* —2.71" —2.09* —2.62* 
(16) (15) (17) (35) 
Slope ...... -21* .18* —07 —.09 
(.06) (.06) (.08) (.18) 
Model 5: 
Intercept . . —3.08* —1.59* —.11 —2.79* 
(18) (23) (36) (36) 
Slope .... . —.18* A6* 49% —.004 
(.06) (09) (.15) (.18) 





Norr —SEs are given m parentheses. Model 1 ts the baseline model, 2 adds controls for human capital, 
3 adds controls for shared conditions; 4 adds controls for soctal networks; 5 adds controls for heterogeneity 
"P< 05 


gration and one for community migration prevalence; and the last model 
introduces a control for unmeasured shared conditions (unmeasured het- 
erogeneity). For models 2-5, we also estimate the corresponding con- 
strained model where effects of variables are set to be identical across 
transitions. In all cases, the chi-square statistic for goodness of fit leads 
to acceptance of the hypothesis that effects are indeed identical across 
transitions or to acceptance of the constrained models. The estimates of 
parameters of the baseline hazards for each model are in table 5. (Table 
7, below, displays the estimates associated with each of the variables 
included models 2-5.) 
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Testing the Main Hypothesis of Social Capital Theory 


Resting on the simplified representation made possible by the parsimony 
of constrained models, we turn now to a test of the first hypothesis derived 
from the social capital theory. If social capital theory is valid, we expect 
to observe a marked difference in the risks of out-migration for individuals 
whose sibling has migrated compared with those whose sibling has not, 
even after controlling for observed individual and shared conditions. 
There are two procedures for doing this in the multistate model. The first 
is to test for the significance of differences between baseline parameters 
before and after controls for individual and shared characteristics are 
introduced. The second is to perform global likelihood ratio tests to assess 
the goodness of fit of models with constrained baseline parameters. We 
use each of these in turn. 

Our purpose is to show that prior migration of a sibling exerts an 
important effect on the migration risk of the other. To evaluate this claim, 
we ask the following counterfactual: How much larger would the migra- 
tion risk be for an individual whose sibling has not migrated if he or she 
had migrated? If the focal individual is a younger sibling, the answer is 
given by the relative hazard, or the ratio p,,[t; g,,(X)Vu,,[t; g,,(X)], where 
£,(X) is a linear function of all parameters associated with of all measured 
characteristics of the focal individual include in vector X. In the con- 
strained models we are using, g,,(X) = g,,(X) so that the relative hazard 
is equal to exp(a,, + 8,, X In #Vexp(a,, + u ln £), where the a and $ are 
the Weibull parameters of the baselines. When the §’s are similar to each 
other, the relative hazard is just a function of the a’s. 

Table 5 shows the estimated parameters for the baseline model (model 
1) along with those estimated after successive controls are introduced. We 
are interested in comparing intercepts and slopes for pairs of transitions. 
For example, comparing the intercept (slope) of the transition from state 
2 to state 4 with the intercept (slope) of the transition from state 1 to state 
2 provides information about the relative magnitude of the risks of mi- 
gration among younger siblings whose older sibling migrated first and 
among younger siblings whose older siblings migrated after they did (or 
not at all). 

The top panel of table 5 captures in parametric form the transition 
processes already described by the raw probabilities shown in table 1. 
Recall that this table offered evidence that network ties do increase the 
likelihood of first migration. The probability of out-migration among those 
with an older migrant sibling was nearly three times that of individuals 
lacking this tie. Moreover, a tie to an older migrant sibling was more 
powerful in promoting out-migration than a tie to a younger migrant 
sibling. We also see this pattern in the parameters estimated for model 1. 
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The underlying hazard for transition 2 to 4 (migration risk for younger 
siblings whose older sibling migrated first) is considerably greater than 
that for transition 1 to 2 (migration risk for older siblings whose younger 
sibling has not migrated) or 1 to 3 (migration risk for younger sibling 
whose older sibling has not migrated). In fact, the intercept for 2 to 4 
transition (—0.73) is significantly above either of the latter two intercepts 
(-2.25 or —1.79). Moreover, the value of the intercept (—1.96) for the 3 
to 4 transition (risk for older siblings if younger migrated first) is much 
closer to the intercept for the 1 to 2 transition than to the intercept of the 
3 to 4 transition. This may reflect the fact that a network tie to an older 
sibling is more powerful in promoting out-migration than a tie to a 
younger migrant sibling, an age asymmetry observed if older siblings are 
endowed with more resources by virtue of their migration than are their 
younger siblings. The estimated slopes suggest that the higher hazard for 
transition 2 to 4 decays more slowly than for the 1 to 2 transition. The 
last pair of slope coefficients is not significantly different from zero, in- 
dicating that the hazard does not change strongly as siblings age. 

The remaining panels in the table show changes to baseline parameters 
as successive controls are introduced. If social capital theory is correct, 
then we expect the hazard for the 2 to 4 transition (indicating the presence 
of an older migrant sibling) to remain significantly above either the 1 to 
2 or 1 to 3 transition (where there is no migrant sibling) despite controls 
for human capital variables. This is precisely what we observe in model 
2. Whereas the 1 to 2 and 1 to 3 intercepts are well below negative two 
(—2.66 and —2.28, respectively), the 2 to 4 intercept is significantly higher 
at —1.13 (P < .001). The slope coefficient for the 2 to 4 transition reveals 
a hazard curve that is significantly flatter than the 1 to 2 transition. The 
slope in the latter case (—0.25) indicates a relatively rapid decay in the 
hazard of out-migration with age, whereas the slope coefficient of —0.07 
for the 2 to 4 transition is not significantly different from zero and suggests 
a constant hazard of out-migration with age. Thus, compared to those 
who lack this source of social capital, having an older migrant sibling 
exposes individuals to a higher hazard of out-migration over a longer 
period of time. And although differences between intercepts for the 2 to 
4 transition and for the 1 to 2 and 1 to 3 transitions are somewhat di- 
minished by the introduction of human capital controls, the gap is still 
large and highly significant (—1.13 for the 2 to 4 transition versus —2.66 
and —2.28 in the 1 to 2 and 1 to 3 transitions, respectively). The intro- 
duction of human capital controls, however, does enhance slightly the 
differences in slopes. 

Adding controls for shared conditions in model 3 and two time-depen- 
dent controls for network effects (timing of father’s migration and mi- 
gration prevalence in the community) in model 4 does not change the 
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estimates very much. As shown in table 5, although the value of the 
intercept for the 2 to 4 transition first falls slightly to —1.33 and then to 
—2.09, it remains significantly above the value of the intercepts for the 
1 to 3 transition (—2.53 and —2.71) and for the 1 to 2 transition (—2.92 
and —3.07). The slope coefficients hardly experience any changes. 

The last model in table 5 introduces controls for unmeasured shared 
conditions that are likely to affect the migration risks of both siblings 
(unmeasured heterogeneity). To avoid estimates that are overly sensitive 
to distributional assumptions, we estimate the multistate model allowing 
nonparametric heterogeneity.” Specifically, we postulate the existence of 
more than one latent subgroup with distinctive risks of first migration. 
Despite multiple attempts at increasing its number, our best behaved and 
most parsimonious model suggested the existence of only two latent sub- 
groups: one composing an estimated 34% of the exposed pairs with higher 
than average risks of first migration, and a second subgroup composing 
about 66% of the population of exposed pairs with lower than average 
risks. If the differences in baseline migration risks between younger sib- 
lings whose older siblings have (or have no) migration experience is due 
to unmeasured shared conditions, the introduction of a control for such 
sources will lead to attenuation of differences among the baselines. If, on 
the other hand, social capital theory has some validity, one would expect 
those controls to leave the differences unchanged. 

Remarkably, once unobserved heterogeneity is controlled, the higher 
intercepts associated with social capital not only persist, but are strength- 
ened. Having an older sibling who has been to the United States swb- 
stantially increases the chances of international migration. Whereas the 
intercept for the 2 to 4 transition is —0.11, those associated with transitions 
1 to 2 and 1 to 3, both involving no family network ties, are considerably 
lower at —3.08 and —1.59, respectively (P < .001). As before, network 
effects are asymmetrical with respect to age: those with a younger migrant 
sibling share about the same risk of migration as those lacking migrant 
siblings. 

Allowing for unobserved heterogeneity has a much stronger effect on 


Y There 15 a different type of heterogeneity, the so-called mover-stayer type of heter- 
ogeneity, for which we also estimated parameters. This type of heterogeneity captures 
the possibility that a subset of individuals have a zero-valued hazard of migration, 
that they will not migrate no matter what We estimated models using sequentially 
one and the other type of heterogeneity (we cannot estimate them both simultaneously 
since this creates identification problems). The inferences drawn from each were not 
dissimilar, though the values of the estimates of parameters were different. Since 
unmeasured heterogeneity that does not postulate the existence of a set of individuals 
with zero risk of migration is more realistic, we decided to present only the corre- 
sponding results. 
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the relative size of the slopes. Indeed, the introduction of this control 
completely changes the effect of duration in the 2 to 4 transition, switching 
its sign from —0.07 to 0.49, a figure virtually identical to the value esti- 
mated for transition 1 to 3. In both cases, the risk of out-migration tends 
to increase with age, at least up to age 50 when our observation stops. 
This makes the task of identifying the residual effects of social capital 
very simple, as the only difference between the risks for transitions 1 to 
3 and 2 to 4 is the relative size of the constants. Thus, despite the fact 
that models 1 to 4 fail to account for the existence of the two underlying 
subgroups with different migration risks, their estimates are quite robust. 
Even if a failure to control for unobserved heterogeneity conceals an 
important difference in the shape of transitions, this simply reinforces the 
idea that there are strong social capital effects that cannot be imputed to 
unmeasured shared conditions. 

The second procedure to test for differences in the baseline hazards 
consists of a sequence of likelihood ratio tests that assess whether setting 
equality constraints on the estimates of the baseline hazards for pairs of 
transitions leads to changes in the goodness of fit of the model. We perform 
these tests using a slight variation of the most complete model (namely, 
model 5) and refer to it as the unconstrained model, or model U. The 
specification for U is slightly more parsimonious than model 5 in that we 
constrain all three effects of the covariates for household property to be 
identical.” 

After estimating U, we proceed to estimate four sets of constrained 
models and to calculate chi-square statistics comparing the constrained 
model with U. The first set is for “same-sibling” comparisons and cor- 
responds to migration risks of the younger sibling only. The set includes 
models C1 and C2. The former constrains the intercept and slope of 
transition 1 to 3 to be equal to the intercept and slope of transition 2 to 
4, whereas the latter constrains only the slopes of the transitions to be 
identical. The second set is also for “same-sibling” comparisons but applies 
to the older sibling. This set includes models C3 and C4, which are anal- 
ogous to C1 and C2 but refer to transitions 1 to 2 and 3 to 4, respectively. 
The third and fourth sets are for “cross-sibling” comparisons. The third 
set includes models C5 and C6 for the contrast between transitions 1 to 
2 and 2 to 4. Model C5 constrains intercepts and slopes to be identical, 
whereas model C6 constrains only the slopes to be equal. The fourth set 


4 The estimated effects of each of the three dummies for household property are very 
similar to each other (see table 7, model 5), so the loss in fit by using U is trivial. The 
chi-square statistic for the constrained and unconstrained models is 0.54 with 2 degrees 
of freedom, a statistic’s value not significantly different from 0 even with a liberal 
significance of 0.05 
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TABLE 6 
SEQUENTIAL LoG-LIKELIHOOD RATIO TESTS 


Model + Log Transitions Involved Parameters 
Constraint Likelihood x in Constraints Constrained af 
U 7 1,886 7 None 

C1. A 1,898 8 242 1-3 va. 2—4 Interc + slope 6 
C2 .. 1,887 7 20 13v. 24 Slope only 1 
CB ATT 1,887.4 1.4 1-2 vs. 3-4 Interc.+ slope 6 
C4 . 1,886 9 4 12v5134 Slope 1 
C5 ; 1,8912 54 I3 vs. 34 Interc.+ slope 6 
Chios 1,889.4 52 1-3 vs. 34 Slope 1 
L r MAARA 1,901.2 29.0 1-2 vs. 2-4 Interc + slope 6 
(0: 1,897.6 21.8 1-2 vs. 2-4 Slope 1 
Co... 1,898 8 240 1-3 vs. 24 € 1-2 vs. 34 Interc + slope 4 
C10 . 1,904 7 36.0 13 vs, 34 € 1-2 vs. 24 Interc.+ slope 4 


Norr —See text for definition of models and contrasts All tests based on models that exclude trangtions 
lto4 


includes models C7 and C8, which are associated with transitions 1 to 3 
and 3 to 4. 

Table 6 displays the values of the log-likelihood of models with grad- 
ually increasing constraints. “Same-sibling” contrasts are easy to interpret: 
they reveal that the fit of the constrained model suffers greatly in the case 
of the younger sibling but not at all in the case of the older sibling. In 
fact, the chi-square statistic is 24.2 for model C1, but only 1.4 for model 
C3. It is also clear that the differences in baseline risks are overwhelmingly 
associated with differences in intercepts, not in slopes, since the log like- 
lihood of the model with a constrained slope is almost identical to the log 
likelihood of the unconstrained model. This result indicates that, as ex- 
pected by social capital theory, there are important differences in the 
hazards even after controlling for measured and unmeasured conditions. 
The degradation of goodness of fit across the constrained model reveals 
the importance of kin ties for the migration of younger but not necessarily 
older siblings. 

“Cross-sibling” comparisons are slightly more complicated to interpret 
since differences in baselines may also be a consequence of the difference 
in migration risks between the eldest sibling and any other sibling in the 
household, not just of kin effects. For example, the test for model C7, 
associated with a chi-square value of 29.0 and 2 degrees of freedom, 
reveals that there are important differences between transitions 1 to 2 
and 2 to 4, and that such differences are overwhelmingly the result of 
slope differences (positive for transition 2 to 4 and negative for transition 
1 to 2). However, this could result from (a) different social status and 
migration-related roles of eldest siblings in Mexican households; (b) dif- 
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ferences between experiences of siblings with and without ties to one with 
migration experience; or, lastly, (c) to a combination of both these mech- 
anisms. Although the same considerations apply to model C5, the test 
reveals that the constrained model fits the data well and that there are 
no significant differences between the slope and intercept of the corre- 
sponding transitions (1 to 3 and 3 to 4). Models C9 and C10 include 
simultaneously “same-sibling” and “cross-sibling” contrasts and are, there- 
fore, summaries of the differences just examined. 

In sum, whether we use the coarse procedure of comparing baseline 
hazards or the more robust strategy of assessing goodness of fit by con- 
straining parameters, we arrive at the same conclusion: that there are 
important differences, precisely in the direction predicted by social capital 
theory, between the risks of migration of individuals whose siblings have 
and have not migrated. 


Testing Corollaries 
The first corollary of social capital theory suggests that differences in 
migration risks between those with access to social capital (having sibling 
or father who has migrated) and those without it (having no sibling or 
father who has migrated) should increase during periods of stricter im- 
migration enforcement. In order to test this idea, we estimate a model 
that contains a time-dependent dummy variable that assumes a value of 
1 if the year of exposure to migration for each pair of siblings took place 
after 1986, when the Immigration Reform and Control Act was passed, 
and O otherwise. We expect the effects of this variable to be strong for 
transitions 2 to 4 and 3 to 4 but much less so for transitions 1 to 2 and 
1 to 3. The results (not shown) indicate that although the dummy variable 
is properly signed (negative effects on migration risks in all transitions), 
it has no discernible effects at all in the differences between the hazards 
for transitions 1 to 3 and 2 to 4 on the one hand, and between 1 to 2 and 
3 to 4, on the other.” 

To evaluate the validity of the second and third corollaries, we use table 
7. This table displays estimates of effects (and standard errors) for the 
constrained versions of models 2—5. The second corollary implies that the 
time-dependent variable for timing of father’s migration has a significant 
effect and that it does not alter the differences in risks between siblings 
with and without access to social capital (migration experience of sibling). 


1 A better test than the one performed would have been to test estimate two models, 
one for the period before IRCA and one for the period after. Regrettably, the number 
of events induced by the partition is too small in each case, and estimates are difficult 
to obtain or are unstable. 
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TABLE 7 
PARAMETER ESTIMATES FOR CONTROLS ADDED IN SUCCESSIVE PHASES OF ESTIMATION 








MODEL 3 MODEL 3 MODEL 4 MODEL 5 
CONTROL VARIABLE B SE B SE B SE B SE 

Individual traits 

Schooling 4+ . wee.) TJA* 14 —43* 0 —44* 09 — 43" Al 

Skilled occupation .. ..... —.94" 12 —81* .07 —76* .07 — 78* .09 

Mal ini o 1.60% .14 1.48* 48 150* 08 174* .09 
Household characteristics: 

Head schooling 4+ .... 33* 12 15* 07 14 09 

Head skilled occupation %3 07 09 07 12 08 

Head male .... 06 10 —24* 11 — 27 11 

Owns farm land ... .. — 04 10 —.02 09 —.05 11 

Owns real estate . —.09 07 —.08 07  —.07 .09 

Owns business 11 07 08 08 09 08 
Social networks 

Prevalence . .. .61* 07 73% .09 

Father a migrant . 71* 07 .89* .09 
Factor loadings for 

unmeasured 

heterogeneity: 

State 1 to state 2 ...... . —44* 02 

State 1 to state 3 ........... —2.48* 33 

State 2 to state 4  ..... -2.99* 32 

State 3 to state 4 . .. 3 3250 8600 
Probability .... ..... . . 34* 04 

*P<os5 


We have already shown that the latter part of this proposition is indeed 
confirmed by the data (likelihood ratios test corresponding to model 4 in 
table 6). The regression coefficients in table 7 show that the first part of 
the proposition is also true. Note that the estimated coefficient associated 
with having a migrant father is 0.89 meaning that the risk of migrating 
for any sibling whose father has already migrated is 2.43[exp(.89)] times 
as high as the risk for any sibling whose father has not migrated. This 
large effect is likely to occur because father’s migration is also a source 
of social capital. This finding is harder to justify though not inconsistent 
with the household joint decision-making or diversification theories since 
neither suggests a positive correlation of migration risks among all mem- 
bers of a household. 

The third corollary implies significant effects of prevalence of migration 
experience in the community, above and beyond the social capital effects 
of siblings’ migration. Table 7 indicates that this is in fact the case: the 
independent effects of living in a high prevalence migration community 
are of the order of 0.73 meaning that the risks of migrating for any siblings 
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are twice as high [exp(.73)] as in areas with lower prevalence of migration. 
Again, one would not expect this result under the joint household decision- 
making or risk diversification models, once we control for all relevant 
household and individual conditions. 


IMPLICATIONS OF RESULTS 


To illustrate the effects of social capital, we use the parameter estimates 
of model 5 in table 7 to generate three outcomes: waiting time to first 
migration, proportion not migrating by age 30, and median age at first 
migration. We estimate these quantities under two assumptions—that the 
subject does and does not have an older sibling with U.S. experience—and 
we use the parameters for transitions 1 to 3 and 2 to 4, respectively. We 
compute statistics using conventional life table methods using the param- 
eter values shown in table 7. For the sake of illustration, we design four 
different population profiles reflecting different combinations of values 
for the control variables (which are applied to the coefficients in table 5). 

The first profile assumes an unskilled, uneducated male whose house- 
hold head is similarly unskilled, uneducated, and without property. More- 
over, the head has not been to the United States and resides in a com- 
munity with little migratory experience. The second profile assumes the 
same male sibling and household head, except that now we assume the 
head has been to the United States and lives in a community with many 
migrants. The next two profiles are for male siblings who are educated, 
skilled, and whose household heads are likewise educated, skilled, and 
property-owning. In the third profile, the head has not migrated to the 
United States, and the community has few international migrations. In 
the fourth profile, the head is a previous migrant head, and the community 
has a high prevalence of migration. 

Outcomes associated with these profiles are presented in table 8. The 
upper panel shows what happens in the absence of a tie to a migrant 
sibling, and the lower panel reveals what happens when an older sibling 
has already migrated. No matter what profile is assumed, having an older 
migrant sibling (i.e., a network tie yielding social capital) substantially 
reduces the waiting time to migration, lessens the percentage who have 
not migrated by age 30, and lowers the median age of first migration. 
The first two columns, for example, correspond to the socioeconomic 
profile of the person generally most at risk of migrating to the United 
States: an unskilled and uneducated man without property. In profile 1, 
he is assumed to lack network ties, either through the household head or 
the wider community. Under these circumstances, if one’s older sibling 
were to migrate, the waiting time to first migration would be cut in half 
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TABLE 8 
EXPECTED LIFE TABLE PARAMETERS FOR YOUNGER SIBLINGS’ TIME TO FIRST 
MIGRATION WITH AND WITHOUT AN OLDER MIGRANT SIBLING ASSUMING 
DIFFERENT POPULATION PROFILES 


PROFILE NUMBER 


1 2 3 4 

Without social capital:* 

Years to first migration eos 68 1.9 12.6 26 

%nonmigrant at age 30 . 240 0 450 2.0 

Modian age at first migration 202 156 266 160 
With social capital-* 

Years to first migration 31 9 65 18 

%nonmigrant at age 30 ....... .. 50 0 240 10 

Median age at first migration . 16.7 15.0 200 155 


Norr —For profiles 1 and 2, mblmgs = uneducated, unskilled, male, head = uneducated, unskilled, 
Do property. Profile 1 shows data for nonmigrant heads and low migration prevalence; profile 2 shows 
data for migrant heads and high migration prevalence For profiles 3 and 4, nbbmgs = skilled, 
male, head = educated, skilled, property Profile 3 shows data for nonmigrant beads and low migration 
prevalence; profile 4 shows data for migrant heads and high migration prevalence 

^ Social captal mdicates an older migrant mbling 


(from 6.8 to 3.1 years), the percentage nonmigrant by age 30 would drop 
from 24% to 5%, and the median age at first migration would fall from 
20.2 to 16.7. Thus, having a family network tie substantially quickens 
the transition to international migration. 

If one assumes that the sibling lives in a household where the head has 
migrated and in a community characterized by a high prevalence of U.S. 
migrants, then out-migration becomes virtually inevitable in any event, 
although the transition is again more rapid for those who have an older 
migrant sibling than for those who do not. In the former case, the average 
number of years to first migration is just 0.9, the median age of departure 
is 15, and the percentage who have not migrated by age 30 is 0. 

A similar contrast is observed among those with more education and 
occupational skills, generally persons who would be assumed to be less 
prone to international migration. Among such people living in households 
without a migrant head and in a community with low migration prev- 
alence, the absence of a tie to a migrant sibling yields a waiting time of 
12.6 years to first migration, with 45% not migrating by age 30 and a 
median age at migration of 26.6. In contrast, the simple addition of an 
older migrant sibling lowers the waiting time to 6.5 years, reduces the 
percentage nonmigrant by age 30 to 24%, and cuts the median age at 
migration to 20. Although the presence of a migrant father and a high 
prevalence of community migration once again dominate in determining 
these statistics, the existence of a migrant sibling tie nonetheless works 
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to speed up the transition to international migration, reduce the age of 
first departure, and lower the percentage who never migrate. 


CONCLUSION 


By specifying and estimating a flexible multistate hazard model, we sought 
to overcome some important limitations of prior research on migrant 
networks and social capital. Although earlier studies show that having a 
social tie to someone with migrant experience significantly raises the like- 
lihood of out-migration, they failed to control for the effects of common 
causes (unobserved heterogeneity), possibly yielding overestimates of ap- 
parent network effects. Prior studies have likewise been unable to elim- 
inate competing explanations derived from neoclassical economic theory 
and the new economics of labor migration, both of which predict a cor- 
relation between the migratory behavior of household members but do 
not posit the existence of social capital or network effects. 

Our work has been successful in eliminating common causality and 
selectivity as competing explanations for family-based network effects. 
Estimates from our multistate hazard model show that having an older 
sibling who has been to the United States triples the likelihood of mi- 
grating to the United States and that this differential in the odds of 
movement persists when controls for human capital, common conditions, 
and unobserved heterogeneity are introduced. Overall, the estimated ef- 
fects suggest very sharp differences in the behavior of people who are 
and are not exposed to migratory behavior through a tie to a migrant 
sibling. 

We cannot implement a strong test to rule out competing explanations 
drawn from the other two competing theoretical models. Nonetheless, the 
fact that the apparent network effects pertain not only to close ties within 
households, but also to diffuse ties within communities confirms a pre- 
diction derived from social capital theory but not neoclassical economics 
or the new economics of labor migration. Moreover, the migration- 
inducing effect of a tie to an older migrant sibling is not reduced by high 
prevalence of migration in the community. Finally, although a father’s 
migration experience exerts powerful influences on the migration risks of 
both siblings, it does not alter the differences in risks between individuals 
with and without a migrant sibling. 

Despite these supportive findings, considerable work remains to be done 
to confirm the validity of social capital as a useful theoretical concept. 
For example, networks based on kinship are not necessarily the most 
efficient or most salient in shaping migration decisions. Indeed, networks 
based on much weaker ties of friendship or acquaintance may be equally 
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or more important than kinship ties in determining the odds of out- 
migration. Although we have clearly demonstrated the importance of 
siblings as an important source of social capital, this connection represents 
only one strand in a much larger and potentially more powerful fabric 
of social relations affecting migration. 
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This article examines the effects of crowding in a market center on 
rates of change in organizational niche width and on organizational 
mortality. It proposes that, although firms with wide niches benefit 
from risk spreading and economies of scale, they are simultaneously 
exposed to intense competition. An analysis of organizational dy- 
namics in automobile manufacturing firms in France, Germany, and 
Great Britain shows that competitive pressure not only increases 
the hazard of disbanding but also prompts organizational transfor- 
mations that give rise to processes of resource partitioning. Em- 
phasizing the content/process distinction in conceptualizing organ- 
izational change, the article finds that the process effect of changes 
in niche width and position increases mortality hazards. We discuss 
our findings in light of the processes investigated by the ecological 
theories of density dependence, resource partitioning, and structural 
inertia, and point to the theoretical links that help to integrate these 
theories. 


INTRODUCTION 


This article investigates processes at the intersection of density-dependent 
evolution and resource partitioning. In particular, it examines the effects 
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of crowding in the center of a market on rates of change in organizational 
niche width and on organizational mortality. 

Consider a product market defined in terms of a single dimension— 
engine capacity of automobiles in our empirical application. Producers 
locate themselves in such a market by choosing an array of products. 
Specialists offer products with a small range of variation on the dimension 
of interest, and generalists display a broad range of products. In addition 
to choosing a range (niche width), each producer also chooses a location, 
which we identify as the center of its product range. Specialists have great 
freedom in choice of location; they can locate in the center or toward 
either periphery (above and below the center). Generalists face more con- 
straint in choice of location because a wide niche covers much of the 
market; yet, they can still choose whether to locate the center of their 
range near the center of the market or toward one of the peripheries. 

The emergence of a center in a market with many producers means 
that consumer preferences have become concentrated on a relatively nar- 
row range. The concentration of consumers at a center has two conse- 
quences. First, producers who gain leadership in the center can grow very 
large, and often they can exploit economies of scale. Second, the attrac- 
tiveness of the center intensifies competition in the center (at least initially), 
as many producers seek to match their products with consumer 
preferences. 

How does crowding in the center affect the organizations in a market? 
According to density-dependence theory (and, especially, its generaliza- 
tions to localized-density models), crowding intensifies competition, which, 
in turn, elevates mortality hazards. This theory posits that crowding in- 
creases the mortality hazards of organizations located in the center. Such 
an effect should be especially pronounced for specialists located in the 
center, who have nowhere to hide. In contrast, generalists whose niches 
span the center can potentially offset some of the deleterious effects of 
crowding in the center with success in less competitive regions covered 
by their wide niches. 

Technological niches, though subject to strong inertial pressures, are 
not fixed. Crowding arguably induces organizations in the center to try 
to modify their niches. Obviously, this can mean change in niche width 
or change in niche position. So, for instance, specialists in the center might 
stay specialized but may move to a peripheral position or broaden their 
niches without changing the centers of their niches. 


erman, and the AJS reviewers for helpful comments. Earlier drafts of the paper ben- 
efited from presentations at the 2000 ASA meeting in Washington, DC, and at the 
Tulane Sociology Department. Direct correspondence to Stanislav Dobrev, Graduate 
School of Business, University of Chicago, 1101 East 58th Street, Chicago, Dlinous 
60637. 
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If change in a technological niche requires fundamental reorganization, 
then such change increases the risk of mortality, at least in the short run. 
So, crowding has a possible indirect effect on organizational mortality by 
stimulating changes that heighten the risk of mortality. 

Because the effects of crowding on mortality and the risks due to 
changes stimulated by crowding depend upon an organization’s position 
and niche width, the processes involved arguably can give rise to market 
partitioning. For instance, resource-partitioning theory (Carroll 1985; Car- 
roll and Swaminathan 2000; Dobrev 2000) assumes that, if generalists are 
present in the center, then specialists can thrive only in the peripheries. 
This is an obvious potential consequence of the microprocesses described 
above. However, these processes can produce other patterns as well.’ 

We examine these issues in the context of the automobile industries of 
France, Germany, and Great Britain. The automobile industry has a ma- 
jor advantage for study of the dynamics of crowding and resource par- 
titioning: specialism and small size can be clearly distinguished. This 
matters because some recent research on partitioning processes treats spe- 
cialization as equivalent to small size (Boone, Brdcheler, and Carroll 2000). 
Although specialists tend to be small in some industries, there is no in- 
herent connection between size and specialism. Given the current state 
of empirical knowledge, it is important to distinguish the effects of size 
from those of specialism, as our empirical setting allows. In the automobile 
industry, some specialists have grown very large. Indeed, no firm has so 
thoroughly dominated the world automobile industry as Ford in its heyday 
in which it offered a single model, the Model T.* Examples of dominant 
firms with very narrow niches in European automobile industries include 
the Great Horseless Carriage Company in the 1890s, Adam Opel in the 
1920s, Volkswagen in the 1950s and 1960s, and Renault in the 1960s. 


NICHE WIDTH, POSITION, AND LIFE CHANCES 


A fundamental sociological insight holds that opportunities and con- 
straints are inextricably linked to actors’ locations in a social structure. 
The concept of organizational niche builds on this view. A niche sum- 
marizes the ways in which social action (collective and individual) hinges 


2Tt is worth noting that the possibility that small niches suitable for specialists can 
persist in the center as a consequence of the geometry of sphere packing (Pél and 
Nooteboom 1999) does not apply to the case we consider. The niches we consider are 
one dimensional; the densest packing is complete in this case. 

3 Various versions of the Model T, with only one engine offered at any one time, 
garnered sales of over 15 million cars during 1908-27. For some of this period, Ford 
controlled over half of the world market for automobiles. 
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on the position of focal actors in a social space.* Our focus on organi- 
zational niches stems from our desire to understand how an organization’s 
position in an industry shapes its life chances. An analysis of niches allows 
us to understand the behavior of organizations as it reflects the competitive 
and symbiotic dynamics of interaction within a particular market segment. 

Niche width refers to an entity’s variance in resource utilization (Han- 
nan and Freeman 1989). This representation proves useful for classifying 
organizations in terms of generalist (wide niche) and specialist (narrow 
niche) competitive strategies. Claimed benefits of specialism (or general- 
ism) for organizational performance have received much attention. Con- 
ventional strategic analysis builds on an especially simple notion—for 
firms to grow, expand, and increase profitability, they must diversify 
(Chandler 1962; Steiner 1964). Entry into new markets, products, or serv- 
ices not only allows the firm to tap potentially unexplored resources, but 
also reduces risk by spreading it over several operational domains. Ad- 
ditionally, diversification generates economies of scale and scope across 
similar functions in the separate lines of business (Paine and Anderson 
1983). Thus, generalism is not merely a desirable state; it stands as a 
requisite for successful organizational performance on this view. 

The purported advantages of a generalist strategy underestimate the 
importance of the timing in the industry evolution, fluctuation in resource 
flows, market concentration, and distributions of organizational size and 
age. Environmental processes have been emphasized in the work on or- 
ganizational niche theory (Freeman and Hannan 1983; Hannan and Free- 
man 1989; Péli 1997). This theory specifies the configuration of environ- 
mental conditions under which the generalist strategy conveys advantage. 
In particular, the theory posits that generalists are favored when envi- 
ronmental variation is high and the pattern of variation is coarse. Our 
reading of the history of the industry suggests that changes in technology 
and in consumer tastes have been highly uncertain over the history of 
the industry and that changes have occurred irregularly over time. In 
other words, we think that variations in key environments of this industry 
have been highly uncertain and coarse grained. Our hypothesis about the 
main effect of niche width on mortality hazards is based upon this as- 
sumption. That is, in the case of automobile manufacturers we argue: 

HYPOTHESIS 1—An organization's hazard of mortality is a decreasing 
function of its niche width. 


* Although the constructs of organizational niche and niche-overlap density are closely 
associated with organizational ecology (Freeman and Hannan 1983; Hannan and Free- 
man 1989; Baum and Singh 1994), they have also been used in sociological research 
on networks (Burt 1993), work that combines network and ecological themes (Mc- 
Pherson 1983, McPherson, Popielarz, and Drobnit 1992; Podolny, Stuart, and Hannan 
1996), and research on social movements (Olzak and Uhrig 1999). 
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Above we noted the strong connection between an entity’s niche width 
and the degree to which its niche overlaps the niches of others. When an 
organization broadens its niche, it cannot decrease overlaps and generally 
increases them. So a complete analysis of the consequences of niche width 
must take account of the excess overlap entailed by broad-niche strategies. 

Our argument about the pattern of competitive intensity triggered by 
niche crowding builds on ideas about localized competition (Hannan and 
Freeman 1977; Barnett and Carroll 1987; Podolny, Stuart, and Hannan 
1996; Han 1998) and the theory of density-dependent legitimation and 
competition (Hannan and Carroll 1992, 2000). We argue that the intensity 
of competition experienced by an organization in a market segment is a 
positive increasing function of the number of niches it overlaps, that is, 
its niche-overlap density.* 

HYPOTHESIS 2.—An organization's mortality hazard is a positive in- 
creasing function of its miche-overlap density. 

Niche theory, like other first-generation theory fragments in organi- 
zational ecology, deals with highly general specifications. In particular, it 
assumes that the conditions that favor generalism or specialism hold uni- 
versally (Carroll 1984). In the years since niche theory was developed, we 
have learned about diverse forms of population and industry structure 
that develop over time, as identities and positions become defined and 
solidified. Hannan (1997) makes use of these arguments to argue that the 
effects of density on competition (and legitimation) are intense at the early 
stages of population growth when populations and industries lack struc- 
ture and that they decrease in strength over time. As an industry con- 
solidates in form and content, the patterns of organizational interaction 
gradually become embedded in the evolving industry structure. Possibly, 
in certain cases, the increase in niche-overlap density might represent the 
type of syndicate and alliance formation among newly established part- 
ners, rather than the clustering of potential competitors. Analyses of re- 
source partitioning have also shown that relationships among specialist 
entrants tend to form on the basis of symbiotic cooperation instead of 


* Not all work on competitive intensity would agree with this proposition. For instance, 
some industrial economists have argued that the transition from monopoly to duopoly 
has crucial consequences for strategy formation. Bresnahan and Relas (1991) show 
that, for markets with three to frve competitors in localized markets, the addition of 
any additional competitors does not make a substantial difference. Ecologists Barnett 
and Hansen (1996) show that, among Illinois banks outside of Chicago, only the arrival 
of the first competitor in the town increases the focal firm’s chances of failure signif- 
icantly. Their results suggest that “additional increases in the number of competitors 
in a local market actually decrease failure rates” (1996, p. 153). We explored this issue 
by supplementing the usual parametric analyses of the effects of overlap density with 
a more flexible specification. As we describe below, we found that the effect of overlap 
density is much stronger at higher densties. 
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cutthroat competition. In the case of the U.S. brewing industry, for ex- 
ample, the wave of entries of small niche producers (microbrews and 
brewpubs) strongly resembles the coevolving mutualism typical of a social 
movement (Swaminathan and Carroll 1995, p. 236). In general, the del- 
eterious effect of niche crowding on survival chances ought to fade as an 
industry matures (Hannan 1997). 

HYPOTHESIS 3.—The effect of niche-overlap density on an organisa- 
tion's hazard of mortality declines with industry age. 


INERTIA, CHANGE, AND LIFE CHANCES 


It is important to recognize that the predicted effects of niche width and 
niche-overlap density do not fully reveal or account for the experiences 
of all—or even most—organizations in competitive markets, For example, 
rarely does a firm enter the market with a certain set of product offerings 
and manage to sustain that exact same set throughout a long tenure in 
an industry. Whether as a result of innovation, copying from successful 
competitors, or obsolescence, product turnover is an inevitable part of 
keeping in pace with industry development. As new products become 
introduced and old ones abandoned, as mandates for technological com- 
parability and standardization intercede with financial objectives and 
limited resource availability, the line of product offerings that each firm 
can bring to market might vary substantially. So for many rea- 
sons—strategic, institutional, or random—the boundaries of an organi- 
zation’s niche and its width likely will fluctuate. We believe that such 
transitions involving change in niche width and market position have 
vital consequences for the organizations that initiate them. Thus under- 
standing the likely advantages and disadvantages conveyed by niche 
width and market position requires that we also investigate the processes 
by which organizations acquire such characteristics. We turn to this issue 
next by bringing in insights from several ecological theories, including 
structural inertia, resource partitioning, and organizational learning. 

Do efforts to broaden or narrow the array of an organization’s tech- 
nological competencies, or attempts to reposition in the market in search 
of new opportunities or of a less competitive sector, affect organizational 
performance and survival chances? Adaptation and selection theories sup- 
ply different arguments to answer the question. The former frequently 
point to the benefits that an organization can derive once the desired 
transformation has been completed, whereas the latter emphasize the 
difficulties associated with accomplishing the transition. Thus, the main 
issue in the lingering debate between the two perspectives has centered 
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on whether the costs of implementing organizational change outweigh the 
potential benefits of its outcome, or vice versa. 

A body of recent empirical research has singled out several important 
factors that—when taken into account—help to reconcile the adaptation 
and the selection arguments. This research shows that changes in core 
features of the organization increase mortality hazards, while changes in 
peripheral attributes sometimes have the opposite effect. Hannan and 
Freeman’s (1984) structural-inertia theory, which posits that core change 
degrades structural reproducibility and hence lowers an organization’s 
ability to act as a reliable and accountable collective entity, has been 
confirmed in most empirical studies conducted using the appropriate re- 
search design (Barnett and Carroll 1995; Carroll and Hannan 2000). Ex- 
amples of studies showing that mortality hazards rise with core change 
include research on several newspaper populations (Carroll 1984; Miner, 
Amburgey, and Stearns 1990; Amburgey, Kelly, and Barnett 1993; Dobrev 
1999), telephone companies (Barnett 1994), voluntary social service as- 
sociations (Singh, House, and Tucker 1986), social movement organiza- 
tions (Minkoff 1999), banks (Han 1998), and film-production companies 
(Kim 1998, 2000). 

Barnett and Carroll (1995) criticize research on this subject for failing 
to distinguish between the content and the process of change. They argue 
that research is most informative about the hazards of change when it 
separates the effect of the characteristics of the state to which an organ- 
ization is moving from the various obstacles and impediments it undergoes 
in trying to get to that state. Understanding why organizations fail to 
achieve the desired outcome of internal transformation or why they simply 
fail, involves understanding the process dimension of change. 

We define two types of organizational-level transformations, each per- 
taining to position in the market. Then we decouple the content from the 
process dimension in each. We begin with change in organizational niche. 
Hypotheses 1 and 2 posit that organizations with wide product niches 
will benefit from taking advantage of economies of scale, more adaptive 
resource utilization, and risk spreading, but, at the same time, will face 
stronger competitive pressures. These hypotheses concern states, not pro- 
cesses at the organization level. Therefore, from the perspective of an 
organization’s changing its niche, these hypotheses relate solely to the 
content of change. Yet, implementing a substantial change in niche width 
likely triggers additional unintended effects associated with change, per 
se. The theory of structural inertia predicts that the process effects will 
have deleterious consequences for survival chances if such changes entail 
fundamental organizational transformation. 

Why should this be so? The explanations concern both the organization 
and population levels. The loss of established routines and practices means 
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losing capabilities and competencies that need to be replaced with new 
ones that take time to develop. The likely reshuffling of the political status 
quo and the internal balance of power generates individual resilience and 
inertia as well as fear of the unknown (Dobrev and Barnett 2001). Re- 
lationships that rest on trust are slow to rebuild. Fundamental change 
also disrupts the intra- and interorganizational networks among employ- 
ees as well as between the organization and its external constituents that 
act as depositories of organization-specific tacit knowledge. In short, the 
process of organizational change resets the liability of newness clock (Han- 
nan and Freeman 1984; Amburgey et al. 1993; Péli, Pélos, and Hannan 
2000). 

To develop and test these arguments, we need to define market position, 
change in market position, and the difference between the content and 
process of change. One useful way to conceptualize the social structure 
of a market follows the resource-partitioning model. It construes the mar- 
ket as a reflection of an intersection of dimensions capturing the distri- 
butions of industry-relevant consumer preferences. The peaks of those 
distributions tell us where the market center lies—where resources are 
most abundant and competition is ordinarily stiffest. Market position of 
an organization then can be defined in terms of its distance from the 
market center, and change in market position corresponds to the difference 
in that distance between sequential observations. The content of change 
in this context refers to the costs and benefits associated with the position 
to which the organization is aspiring to move (as contrasted with those 
for the state it is leaving). 

By contrast, the process dimension reveals the difficulties of actually 
implementing and undergoing such a transition. The market and its center, 
of course, are dynamic constructs that can also change continuously. Al- 
though an organization can experience a change in its relative position 
without making any changes in its own product offerings, the argument 
linking the process of organizational change to increased exposure to 
selection pressures and higher likelihood of disbanding would not apply 
to such cases. Integrating the theory of structural inertia with the con- 
ceptual differentiation between content and process change yields the 
following two hypotheses predicting negative effects of process change in 
niche width and in absolute market position on survival chances of auto- 
mobile manufacturers. 

HYPOTHESIS 4.—Change in an organization's niche width increases its 
hazard of mortality. 

HYPOTHESIS 5.—Change in an organization's absolute market position 
increases its hazard of mortality. 
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Another important issue in studying organizational change concerns the 
ability to discern direct and indirect effects that might impact the outcome 
of internal transformation. For example, in their research on Finnish 
newspapers, Amburgey et al. (1993) find that organizational age has pro- 
nounced effects on the rate of disbanding and on the rate of change. The 
findings suggest that young organizations are more likely to change and 
less likely to experience the negative repercussions of that change. Under- 
standing the possible compounded effect of certain organizational attrib- 
utes on internal transformation mandates analyzing their relevance for 
survival chances and also the degree to which such factors influence the 
propensity to implement change. 

Above we argued that niche-overlap density would increase mortality 
risks by stiffening competition. Yet, not all organizations fail as a result 
of intensified market-segment rivalries and depleting resources. What al- 
ternative strategy might such organizations pursue? To answer this ques- 
tion, we turn to the resource-partitioning theory (Carroll 1985). This model 
derives market segmentation as a result of strong competition for position 
in the most lucrative sector of the market. The process drives industry 
concentration high, as typically just a few generalists end up occupying 
the market center. The rest of the competitors either fail or retreat toward 
the periphery where they tend to specialize and explore untapped re- 
sources. We think the transition toward the market periphery entails sig- 
nificant reorganization. Whether aimed at meeting the demands of new 
consumer groups or at expanding emergent specialized market segments 
and creating demand by introducing novelty products, the move from a 
more to a less saturated market segment likely involves both change in 
market position and in niche width. So, based on the resource-partitioning 
process, we expect that heightened competitive intensity not only increases 
mortality hazards, but it also promotes the occurrence of internal change. 

HYPOTHESIS 6.—An organization's likelihood of changing its niche 
width and position increases with its miche-overlap density. 

The management literature generally emphasizes the value of organi- 
zation’s capacities to learn (Argyris 1999). Much of the sociological lit- 
erature underscores the hazards and limitations of organizational learning 
(March 1991; Hannan and Freeman 1989). The two perspectives, however, 
are not contradictory. Organizations do indeed learn from what they do 
and eventually become better at it. Their propensity to do what they have 
learned and have become good at increases concurrently. This organi- 
zational feature, of course, need not be construed negatively as long as 
the substance of the cognitive accumulation itself constitutes the right 
course of action under the prevailing environmental conditions. However, 
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the dynamic and multifaceted nature of the environment and the changing 
needs of the organization at each stage of its life cycle demand continuous 
realignment between an organization’s strategy and its external constit- 
uents. In that context, falling in a competency trap (March 1988) by relying 
more on and becoming better at an infeasible course of action is a doomed 
strategy that results from the mixture of past learning and inertia. We 
examine possible path dependence in change processes by testing the 
following hypothesis. 

HYPOTHESIS 7.—The greater an organization’s experience with a type 
of change, the greater is its likelihood of repeating the same type of change 
and the lower is its likelihood of making other kinds of changes. 

Organizational sociologists have pointed to a link between structural 
complexity and inertia (Hannan and Freeman 1984; Barnett and Carroll 
1995). If organizations with complex structures are inherently less capable 
of fundamental change, what do they learn from their experiences? As- 
sume that the capacity to learn and the type of change incurred are held 
constant for all organizations (regardless of the complexity of their struc- 
tures). Just as we expect organizations with simpler designs to learn how 
to change and to repeat the same changes that they have already survived, 
this should be less true for complex organizations. In other words, we 
think that the “momentum of change” declines with complexity. We do 
not have a direct measure of organizational complexity. However, our 
reading of the primary sources on this industry leads us to think that 
organizational complexity for automobile manufacturers tends to be di- 
rectly proportionate to their niche width. Based on this assumption, we 
conjecture that 

HYPOTHESIS 8.—The effect of an organization’s experience with a type 
of change on its likelihood of repeating that change declines with the 
complexity of the organization. 


Completing the Model 


We agree with Barnett and Carroll (1995) that modeling internal organ- 
izational change will not be complete unless the general evolutionary 
trends of the industry are taken into account. So we try to account for 
all of the relevant factors that earlier research on organizational change 
and ecological research on determinants of rates of organizational dis- 
banding have indicated to be relevant. 

Previous research shows that organizational age and size have a pro- 
found impact on both organizational mortality and rates of organizational 
change. We introduce a measure of relative size, which makes it possible 
to interpret the effects of size in relation to a reference category, namely, 
the largest organization by year in the population. We use Hannan et al.’s 
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(19984) nonproportionate specification, which allows the effects of age 
and relative size to interact in affecting mortality hazards. 

We include a dynamic specification of the density-dependence model 
where the density terms are interacted with population age to capture the 
different cycles in the history of the industry (Hannan 1997; Hannan et 
al. 19985; Dobrev 2001). Following Hannan et al. (1995), we also distin- 
guish between the effects of system-level (European) and subsystem-level 
(country-specific) density to reveal the multilevel nature of the density- 
dependence process. It posits that legitimation can diffuse the image and 
identity of an organizational form across national boundaries and operates 
at a broader level than competition, which is constrained geographically 
within national markets. 

Organizational age has also played an important role as an explanatory 
covariate in previous analyses focused on the effects of change on survival 
and determinants of rates of organizational change. For example, Am- 
burgey et al.’s (1993) study of Finnish newspaper mortality found that 
only old newspapers faced increased hazards when they attempted core 
change. They also show evidence that the occurrence of organizational 
change gets modified by organizational age in two important ways. First, 
the rate of change decreases with organizational age, and second, the 
momentum effect of prior change is more pronounced for younger or- 
ganizations for one type of change, but stronger for older organizations 
for another. By contrast, a study of core change and survival of social 
movement organizations in the United States reports that older organi- 
zations are more likely to change, while the momentum of change declines 
with age (Minkoff 1999). 


RESEARCH DESIGN 


This research uses data on the automobile industries in France, Germany, 
and Great Britain. It considers the full history of the automobile industry, 
starting with 1885 and ending with 1981, the last year of full coverage 
in our most comprehensive source of data. The data come from a study 
that coded histories of automobile manufacturers worldwide from reports 
of automobile historians and collectors.* The most comprehensive infor- 
mation comes from two encyclopedias: Georgano (1982) and Baldwin et 
al. (1987). Because the sources highlight automobiles as technical products, 
the resulting histories of firms pertain to spells of automobile production 
not to lifetimes of the firms. These sources organize their reports around 
marques, not firms. Entries for marques had to be combined to create 


* Hannan et al. (1995) and Carroll and Hannan (2000) provide details on the design. 
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records for firms. The records contain information about spells of auto- 
mobile production by firms but not about their complete lifetimes, which 
sometimes involved operation in another industry before or after a spell 
of automobile production. In some cases, the sources do tell about the 
creation and destruction of the firms. Insofar as the sources permit, the 
research team reconstructed the organizational histories of the firms in 
these industries. 

This coding effort yielded data on 828 French, 373 German, and 995 
British firms. Many were small, short-lived, and obscure and left little 
historical record. However, including information on such obscure or- 
ganizations is crucial to sound inference about the causes of organizational 
mortality. Keeping short-lived firms in the analysis means relying on in- 
complete information. Therefore, we made several assumptions about 
obscure, short-lived firms to allow us to include them in the analysis, as 
we explain below. 


Entry and Exit Events 


Organizations entered automobile production by several paths, as we 
noted above. Three were especially common: (1) a firm is built de novo—it 
has no prior organizational experience at time of entry, (2) a new firm 
results from a merger of automakers or by the division of one automobile 
manufacturer into two or more firms, or (3) a firm enters from another 
industry. For the obscure firms, we know only that they began automobile 
production but not whether they migrated from other industries.’ This 
set presumably includes both newly founded firms and entrants from other 
industries. The more complete American data indicate that both types of 
entry were very common. 

Previous research on these populations found that the life chances of 
lateral entrants and firms that arise by merger or fission of automakers 
were better than those of firms that began de novo or whose type of entry 
is unknown. However, using the distinction between lateral entry and 
start by merger or fission did not improve model fits significantly. There- 
fore, as in previous research, we use only one distinction about type of 
entry in this analysis: whether a firm has prior existence of any kind 
(either in the automobile industry or in some other industry). We treat 
firms with unknown entry events as not having existed previously. 

For ending events, the most important distinctions concern (1) dis- 


* Individual entrepreneurs surely did come from other industries, and this information 
is sometimes available. However, our unit of observation is the organization, not the 
entrepreneur For us, the key distinction is whether the organization is new at auto- 
mobile manufacturing. 
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banding, (2) exit to another industry, and (3) merger or acquisition. Dis- 
banding has an unambiguous meaning: the firm failed as a collective actor. 
Exit to another industry also suggests a lack of success in automobile 
manufacturing. Although merger and acquisition both result in the loss 
of one or more independent collective actors, firms merge and acquire for 
diverse reasons. Sometimes a firm flounders and its owners seek to recover 
some fraction of their investment by selling the firm. In other cases, a 
thriving firm’s competencies command great value from potential ac- 
quirers or merger partners. Because of the ambiguous meaning of mergers 
and acquisitions, we concentrate on disbanding and exit to another in- 
dustry in this analysis (within the limits of the data, discussed next). 

We also do not know exactly what happened to most firms when they 
dropped from the set of producers, especially in the European populations. 
This is invariably the case when spells of automobile production were 
short.* Apparently, automobile historians rarely could reconstruct the de- 
tails about an exit unless a firm had become reasonably well established. 
Knowledge that a certain firm disbanded, was acquired, or left the in- 
dustry usually means that it persisted in the industry long enough that 
its exit event received notice in the press. 

As Hannan et al. (19984) explain, “unknown exit” could be interpreted 
as an ending event in its own right, governed by its own transition rate. 
But, they argue against this idea. Because availability of information on 
the kind of exit depends (strongly) on tenure in the industry for the Eu- 
ropean populations, this analytical strategy would confound the specifi- 
cation of the state space (the origin and destination states) and tenure. 
They note that the historical materials suggest that most exits of unknown 
type were disbandings or exits to other industries. Analysis of various 
alternative treatments of the unknown-exit cases lead Hannan et 
al. (1998a) to conclude that events of unknown type were governed by 
processes more similar to those underlying disbanding and exit. We follow 
the earlier research in treating these two events alike: the dependent 
variable in this analysis is disbanding/exit to another industry, defined to 
include events of unknown type. Firms known to have ended by other 
events (merger, acquisition, nationalization) are treated as (noninforma- 
tively) censored on the right at the times of these events. 


Dating Entries and Exits 


The archival sources contain varying degrees of precision in dating events. 
Sometimes the sources give the exact date; other times they give only the 


* This difference shows up clearly when one calculates integrated hazards of the various 
events as a function of duration (tenure in the industry). The rate of the unknown 
type of ext is much higher over the early years of tenure (up to about seven years) 
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month and year or season and year. Most often, they record only the year. 
To make analysis tractable, all of the information about timing was con- 
verted into a common metric; exact dates were represented as decimal 
years. When dates were given in terms of months or seasons, decimal 
years were assigned using judgments of the midpoint of the time unit. 
For example, “early fall” was coded as occurring on October 15 (day 288 
in the calendar), giving a decimal year value of 0.79. Dates given to only 
the year were coded as occurring at the midpoint of the year. 

What is normally called organizational age in the literature is, with few 
exceptions, actually a measure of duration or tenure in a particular or- 
ganizational population (Carroll and Hannan 2000). Tenures in auto- 
mobile production can be calculated straightforwardly when dates are 
exact or nearly exact. The challenging case involves entry and exit in the 
same year. The data we used impose the assumption that the tenure (in 
years) in the industry for such a case is bounded by 0 and 1. Such cases 
are assigned a tenure of 0.5 (the expected tenure under the assumption 
of a uniform distribution of events over the calendar year). Exits for such 
cases are coded as occurring at (just before) the end of the year. When a 
firm enters in one year and leaves in the next (e.g., a firm is reported to 
have begun production in one calendar year and ended production during 
the following year), we attach the starting time to the middle of the first 
year and the ending time to the midpoint of the next year, giving a com- 
pleted tenure of one year (which is, again, the expected tenure under the 
assumption of a uniform distribution).? These rules generalize to handle 
all of the relevant cases. 


MEASUREMENT 
Niche Width 


Adapted from the field of bioecology (Hutchinson 1957), organizational 
niche theory (Freeman and Hannan 1983; Hannan and Freeman 1989) 
decouples the construct of the fundamental niche—the multidimensional 
social space in which an organization (or an organizational form) can 
grow or at least sustain itself—from that of the realized niche, a subset 
of the fundamental niche in which an organization can sustain itself in 
the presence of competitors. We follow the lead of Podolny, Stuart, and 
Hannan’s (1996) research on semiconductor manufacturers, which ex- 
presses an organization’s fundamental niche as a function of its position 


* The general problem concerning data of this kind is known as time aggregation bias 
These procedures are consistent with Petersen’s (1991) recommendation for dealing 
with this problem. 
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TABLE 1 
MIASURES oF NICHE WIDTH, MARKET POSITION, AND CHANGE IN 
MARKET POSITION 
Variable Definition 

En” Firm 1's largest engine ca- 
pacity at ¢ 

A Ge A ae DR aia es Firm i’s smallest engine 
capacity at ¢ 

Es” The maximum engine ca- 
pacity among the four 
largest firms 

LAT AAA A A The minimum engine ca- 
pacity among the four 
largest firms 

Firm’s niche width (W,) ............. E” — E" 

Midpoint of a firm’s niche (Mf,)..... E + (E — EY 

Change in niche width (AW, = 1)... . WAW s S09 or 
WIW,->11 

Expansion ..... 2 0. o coooorroronanacono W/W 211 

Contraction ....... bese Gs eee WAW, 50.9 

Width of market center O) bririt a BAT — Esp” 

Midpoint of market center (C7) . .. . Es" + (E4m™ — Esp) 12 

Position: 

Distance above market center (Pz) ... max(M, — C70) 

Distance below market center (P+) ... max (Cr — M,,0) 

En 

(AP). .. e... Mu — C) (His, — Ci) 

Any change le abeat piia. 

(GPa =i) a a iia E MM, 


in a technological space. Specifically, we measure the niche width of an 
automobile producer in terms of the spread of engine capacity over all 
models that a firm produces measured in cubic centimeters (cc) in a given 
year.” (Operational measurements of niche-related measures are presented 
in table 1.) Firms with a single model or more than one model with the 
same engine have a minimal niche width, which we set to 1 cc to avoid 
having to speak of niches of zero width. A firm that produces one or more 


* When such data were not available but we could find data on horse power, we 
predicted capacity in units of cc using the coefficients of maximum (minimum) capacity 
on maximum (minimum) horsepower with controls for historical periods and organi- 
zational properties. 
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TABLE 2 
DESCRIPTIVE STATISTICS FOR NICHE WIDTH AND NICHE-OVERLAP DENSITY 


France Germany Great Britain 

Niche width (in 1,000 cc) minimum. 001 001 .001 
ES AA oeenoite SS 11.37 20.0 143 

Mean 1.87 1.85 114 

SD mias 2 06 2.24 1.46 
Niche-overlap density 

Min .... 0 0 0 

Max. ........... 175 106 218 

Mean .. 70.1 280 53.4 

SD. 49.7 218 472 


models with a very small engine and one or more with a very large engine 
has a broad niche by this construction." 

While the sources frequently provide information on the smallest and 
the largest engine capacity models, they rarely list all of the models that 
compose the firm’s set of offerings. This precluded us from constructing 
a measure of number of models produced. However, our study of the data 
sources leads us to conclude that firms with wide niches that produced 
very few models (e.g., producing only two models: a single high-engine- 
capacity model and a single low-engine-capacity model) have been prac- 
tically nonexistent throughout the history of the industry. In fact, when 
evidence is available, we find that the most intensely exploited techno- 
logical space falls at or near the center of the firm’s niche. This justifies 
our assumption that there is little, if any, unobserved heterogeneity related 
to product range within firms with similar niche width. Table 2 provides 
descriptive statistics on niche width for the three populations. 

Measuring organizational niches along a single dimension— 
technological scope, in our case—has its limitations. If our measures of 
niche width and overlap density capture only market position and com- 
petitive dynamics in technological space, they would be less than ideal 
for testing the theoretical propositions stated in the previous section. How- 
ever, we believe that our decision to define organizational niches along a 
technology dimension bas compensating advantages. Most important, it 
provides us with a means of making meaningful comparisons of firms 
that have existed in disparate historical periods, thereby allowing us to 
analyze the evolution of the industry from its origins. It makes it possible 
to draw as complete a picture as possible (given the severe paucity of 
information for many of the entries in our data sets) of the overall market 


1 Arguably, two firms with similar niche widths can differ from one another by the 
number of models they produce each year 
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niches of the automobile firms whose fates we analyze. In other words, 
we think that technological niches in this industry are informative about 
market position. We think this is so because, upon examination, the choice 
of engine capacity made by automobile producers over the years proves 
to be revealing not only of the ranges of their technological offerings but 
also of these firms’ strategies in product marketing and competitive pric- 
ing, customer segment targeting, supply-chain management, and inno- 
vation. We illustrate this point below with examples of well-known pro- 
ducers from each of the three European industries we investigate. 

In 1922, the renowned British automobile manufacturer Rolls-Royce 
made a clear change in direction toward expanding its customer base by 
offering some more affordable models with much smaller engines. In so 
doing, it also appealed to a customer segment with a different social 
disposition. The introduction of a smaller-engine model, nicknamed the 
Baby Rolls, complemented the powerful (big engine) and also very prof- 
itable model, known as the Silver Ghost. Interpreting the significance of 
this strategic decision, the industry historians Baldwin et al. (1987, p. 416) 
write, 


Although frowned on by the purists, as such innovations always are, [the 
Baby Rolls] brought Rolls-Royce motoring to a new market, not only those 
who could not afford a Silver Ghost but those who did not want an enor- 
mous car, or those who preferred to drive themselves. 


The obvious correspondence between difference in price range and 
difference in model engine capacity can be seen in the case of the legendary 
German automaker Mercedes-Benz. In the depression years of the 1930s, 
the company introduced the smallest Mercedes-Benz ever made until that 
time, a car with a 1.7 liter engine. Its low starting price of 4,150 marks 
made it the company’s best-selling model of the interwar era; more than 
85,000 such cars were sold in less than a decade. At the opposite end of 
the scale, we find the 7.6 liter engine model, which was 


aptly nicknamed “der Grosser Mercedes.” .. Some of the 177 first series 
of Grossers were sold to anyone who could afford them (at 38,000 to 47,000 
marks), but the 88 second series cars went entirely to the Nazi hierarchy 
or to foreign rulers such as Joseph Stalin, Finland’s Field Marshall 
von Mannerheim, Spain’s General Franco, and Antonio Salazar of Portugal 
(Baldwin et al. 1987, p. 122) 


The technological range of products offered by an automobile producer 


also speaks to organizational designs and internal arrangements of the 
companies, as well as the resources available to them. For example, in 
the post-World War I period in France, Renault’s technological niche 
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ranged from a 9.1 liter engine (in an “elephantine” car), on the high end, 
to the modest 0.95 liter engine of the Model KJ, on the low end. This 
extremely broad scope of offerings reflected an expansion of production 
facilities that allowed the company to make the supply-chain process 
internal and also clearly provided a resource advantage. According to 
Baldwin et al. (1987, p. 404), 


As a result of wartime expansion the Renault works .. in 1919 consisted 
of 87 built up acres, including a steel foundry, . . . three iron foundries, one 
aluminum and one brass foundry, general machining shops and assembly 
shops for all components including bodywork.. .In 1919 Renault acquired 
its own steel works. . the most modern steel plant in Europe. 


We think these examples (only a few of the many that we came across 
during the coding process) justify our reasoning in electing to focus on 
technological niches of organizations, a choice that allows us to interpret 
not only the range of product offerings, but also the overall market strategy 
and position of the firms in our data. That said, we also suspect that the 
dimensionality of niches in this industry has increased over time, as issues 
such as safety, durability, fuel economy, and so forth have come into 
prominence. If this surmise is correct, then our measure of the technical 
niche does a better job of characterizing firms’ complete niches in the 
earlier portion of the industry history than in more recent times. This 
reasoning supplies another, slightly different, interpretation of the rela- 
tionship proposed in hypothesis 3, which states that the effect of (technical) 
niche-overlap density on mortality rates declines with industry age. By 
this interpretation, the decline is due to the increasing dimensionality of 
niches with industry age. 


Competitive Intensity 


We measure competitive intensity as proportional to niche-overlap density, 
the number of organizations present in the focal firm’s niche. Table 2 
provides descriptive statistics on niche-overlap density for the three pop- 
ulations. Notice that the ranges are extremely broad. Each population 
contains firms whose niches do not intersect any others, and they also 
contain firms whose niches overlap with hundreds of other firms. 

Figures 1-2 show the joint distribution of niche width and overlap 
density for two time slices, 1900 and 1925, for the French population. 
(The pattern for the British and German populations is similar.) Notice 
that only extreme specialists (those whose niches are very close to zero) 
do not experience high niche overlap. Movement away from pure spe- 
cialism causes overlaps to increase very sharply. 
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Fic. 1.—Niche width and niche-overlap density in the French population of automobile 
manufacturers in 1900 


Position Relative to the Market Center 


Recall that we want to address resource-partitioning theory’s claims about 
competition in the center of the market. Ideally, we would like to use 
information on the distribution of consumer tastes to identify the market 
center. Lacking such information, we use information on the locations of 
the four largest firms in the (national) market, reasoning that large size 
can be gained in the market center. We define the range of the center as 
the range between the lowest minimum and highest maximum engine 
capacity among the four largest firms and the market center as the mid- 
point of this range 

We measure a firm’s position in the market in terms of its distance 
from this measure of the market center. We distinguish between position 
on the high end (ie., producing luxury or sports model cars with high 
engine capacity) and position on the low end (¡.e., producing economy 
models with low engine capacity). Both measures are set to zero for cases 
that fall on the other side of the market center from the direction being 
coded. 
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Fia 2.—Niche width and niche-overlap denstty in the French population of automobile 
manufacturers in 1925 


Changes in Niche Width and in Position 


Our measures of organizational changes reflect the important substantive 
dichotomy of content vs. process that underscores our theoretical argu- 
ments. We constructed a dummy variable for niche-width change that 
equals one if the firm’s niche expanded or contracted by at least 10% 
relative to the preceding year. We think an increase or decrease in niche 
width by at least one-tenth implies a meaningful internal reorganization, 
sufficient to trigger a change in the organizational core. We agree with 
Podolny et al. (1996, p. 663) that “an organization might gradually adjust 
its underlying technological competencies through ‘local search,’ but sig- 
nificant change in position .. . demands fundamental alteration of the 
organization itself.” 

We use two measures of change in position. Relative change in position 
is the difference in the distance of the firm’s niche center from the market 
center between consecutive years. Because relative change in position 
might occur without change by the organization (i.e., when only the mar- 
ket dimensions are shifting), we also include a dummy variable for ab- 
solute change that captures instances of change undertaken by the or- 
ganization, regardless of whether such change modifies the firm’s position 
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TABLE 3 
FREQUENCY OF CHANGES IN NICHE WIDTH AND POSITION 








France Germany Great Britain 
N % N % N % 
Spells with absolute post 
tion change. . ....... 448 8.7 382 16.8 1,478 265 
Spells with niche width 
change .... Sela otis e 602 116 273 12.0 1,057 189 
Spells with both changes . 306 59 193 8.5 833 149 
Firms experiencing absolute 
position change .. . .. 227 27.7 127 340 400 405 
Firms experiencing niche 
width change ........ .. 290 354 121 32.4 362 36.7 
Firms experiencing both 
Changes .... .. e Basal: 184 22.5 95 255 281 291 





relative to the market center. The frequency of changes in niche width 
and absolute position are presented in table 3. 


Organizational Size: Absolute and Relative 


Research on age-varying size and life chances has conceptualized and 
measured organizational size in two ways: capacity and scale of operations 
(Barron, West, and Hannan 1994; Carroll and Hannan 2000). We follow 
the previous research on these populations and concentrate on scale of 
operations, and we measure it in terms of a firm’s annual production of 
automobiles. This measure makes good sense in the context of the industry 
and its history. Even analyses of the current situation focus on market 
share, measured in vehicles sold. Moreover, this measure is available more 
regularly than accounting measures of performance, and it is likely to be 
measured more reliably in most cases. 

An exhaustive search failed to uncover precise information on annual 
production for most short-lived firms. Coverage is worst for France and 
Germany, with precise information lacking for 81% of the firms in each 
country?s industry; these account for 68% of the firm-years in France and 
65% in Germany. Information on size is most complete for the British 
industry, for which precise information on size is lacking for 62% of the 
firms in the British industry —these firms contribute 43% of the firm-years 
of observation. 

Hannan et al. (19984) argue that information about a firm’s size in- 
dicates clearly that its scale was very small. The counts often come from 
new vehicle registrations, which were presumably exhaustive, and from 
counts of surviving automobiles. Often the sources indicate that the firm 
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produced only “a few” automobiles. In others, no mention is made of the 
scale of production. In both cases, a firm was coded as “small.” Sometimes, 
the sources suggest that production levels were substantial, but they do 
not provide exact counts For instance, this is the case when there are 
gaps in the yearly series at the end of a firm’s history and previously 
reported production totals were large. In such cases, our research group 
distinguished “middle size” (the firm’s production level in a year was much 
higher than “small” but considerably smaller than the scale of the largest 
firms in the industry at the time). In a few cases (mostly occurring during 
the interwar years), it appeared that a firm’s production level approached 
those of the industry’s biggest firms; size in such cases was coded as 
“large.” 

As in the previous research, absolute sizes are imputed to firms lacking 
precise information based on the size distributions of firms whose pro- 
duction levels are observed to impute sizes for the three categories. When 
a firm’s yearly observation was coded as “small” and no more precise 
information was available, it was assigned a value from a uniform dis- 
tribution ranging from zero to the first quartile of the size distribution 
for the historical period. Firm-years coded as medium and large were 
assigned the second and third quartile values, respectively. 

Hannan et al. (1998a) analyzed the resulting data in two ways. In one, 
they merely distinguished the three size classes: small, medium, and 
large.” In the second approach, they used the natural logarithm of the 
value based on the precise measures and the imputed ones. There was 
little substantive difference in the results of the two kinds of analysis: 
both indicated that the rate of disbanding/exit is very high for the smallest 
firms and declines as a function of size. We follow the earlier research in 
using the constructed metric measure of size. We measured relative sise 
of a firm as the ratio of each firm’s size to the size of the largest firm in 
the national population at the time. 


CONTROLS 


We use a common population-level clock in the various analyses: industry 
age. The clock for each country begins at zero in 1885, the time of initiation 
of the French population of automobile manufacturing firms.” This choice 


4 For this analysis, they coded firm-years with precise information as small if the level 
was below the period’s first quartile, medium if it fell in the interquartile range, and 
large if it exceeded the third quartile 

2 This choice probably has little consequence for the empirical analyses, because the 
dates of initiation of these national automobile industries differs by a decade at most. 
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reflects the view that processes of legitimation likely spill over national 
boundaries (Hannan et al. 1995). 

We control for changes in institutional conditions and industry structure 
with period effects, and we chose to follow the prior research on these 
populations in defining historical periods.** The analyses reported below 
also contain effects of economic depression, the level of the gross domestic 
product (GDP) adjusted for inflation (taken from Maddison [1991], and 
the period effects. We excluded the years of the two world wars from the 
analysis because the production of motor vehicles for private use was 
stopped in all of these countries.** 


STOCHASTIC MODELS AND ESTIMATION 
The Hazard of Mortality 


Each organization in the population has some chance of disbanding/ 
exiting at any time during its tenure. The occurrence of such events is 
controlled by an instantaneous hazard. Because we lack information on 
the length of spells of prior existence for those firms that entered auto- 
mobile production from other industries, the only feasible choice of or- 
ganization-level clock, given our design, is tenure—the duration of the 
spell of automobile production. The half-life (considering all kinds of 
ending events) of an automobile producer has ranged from 1.8 years in 
the British and German populations to 2.6 years in the French. 

Given a random variable, Y(u) that records whether a firm has dis- 
banded or exited at tenure u (in which case F(u) = 1) or has not 
(Y(u«) = 0), the disbanding/exit hazard can be defined as: 


prob[V(u + 6) = 1[¥ (u) = 0] 


pe) slim r 


, uzo. 

(If a firm exits the population by some other event [e.g., acquisition], we 

treat the spell as [noninformatively] censored at the observed event time.) 
We represent the effects of changes in tenure with a piecewise-expo- 

nential specification. With the breakpoints denoted as 0 S7 Sh S... S 

7, and q+, = %, there are q periods: I, = {u|1,S#<7,,},p = 1,...,q. We 

estimate models with this general form with the method of maximum 


4 Specifically, the breakpoints for Britain are those used by Torres (1995): 1915, 1919, 
1939, 1946, and 1968. The breakpoints for France and Germany are those used by 
Hannan et al. (1995): 1915, 1919, 1930, 1939, 1950, and 1970. 

“ Firms that did not resume production of automobiles after a war (often because their 
factories were destroyed) are treated as right-censored at the end of the last prewar 
year 
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likelihood as implemented in TDA 6.2 (Réhwer and Pótter 1998). As in 
the prior research on these populations, we break the duration scale (in 
years) at: 0.5, 1.0, 3.0, and 7.0. With this choice, the first segment 
(0,0.5] includes dated events that occur within the first six months in the 
industry as well as cases that enter and exit at unknown times within the 
same year (as discussed above). The second segment (0.5,1] includes dated 
events that occur within the second six months and cases that enter at 
unknown time in one year and exit at unknown time in the next year. 
The third and fourth segments are defined similarly. The final segment 
begins at seven years and is open on the right. 

Our models specify that the disbanding/exit hazard is a function of 
tenure in the industry (4), industry age (t), a vector of variables (s,,) 
pertaining to size and prior experience (absolute and relative size and 
dummies for very small size and prior organizational existence [e], 
a vector of other measured covariates (x,), including GNP, depression 
year, and density at founding, a function of contemporaneous density and 
density-squared, as well as the interactions of these density effects and 
industry age, denoted by p (-). The key functions for assessing the ar- 
guments made above relate to niche width, market position, and niche- 
overlap density, denoted by y (-), and a function of changes in niche width 
and various measures of position, denoted by w (-). We estimate members 
of the general class of models of disbanding/exit with the form: 


p.(u¿) = exp(m,) exp(sa, +x.) > on.) 
«(mo APOS) KAWAP), uek, (1) 


Here m, denotes a set of duration-specific effects; the log-linear link im- 
poses the constraint that the baseline hazards be nonnegative. 

This kind of specification allows the hazard of disbanding/exiting to be 
a nonproportional function of tenure, size, and prior experience as in 
Hannan et al. (19984). These specifications allow tenure dependence to 
be negative for some size classes and positive for others. Adding effects 
of niche-related measures has hardly any impact on the effects of tenure 
and size. Therefore, the patterns summarized by Hannan et al. (19984) 
continue to hold. Because this issue is not central to the present analysis, 
we simply report the relevant estimates without commenting further. 


14 The earlier research using these data showed that the main results are fairly insen- 
sitive to these choices, presumably because so few firms experience tenures of greater 
than seven years. 
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Changes in Position 

For the other class of outcomes, change in niche width and market po- 
sition, we have repeated yearly observations on firms (where the number 
of observations per firm is one less than the number of years of tenure 
in the industry because we use year-to-year changes). In other words, the 
data structure is an unbalanced panel. Although much recent organiza- 
tional research analyzes such data with fixed-effect estimators (which uses 
only within-firm variation), we take advantage of a new class of robust 
estimators to analyze both between-firm and within-firm variation. Spe- 
cifically, we use the method of generalized estimating equations (GEE), 
developed by Liang and Zeger (1986; see also Zeger and Liang 1986). 
This approach generalizes quasi-likelihood estimation to the panel con- 
text. Like quasi-likelihood, GEE requires specification of only the first 
and second moments of the distribution of the outcome, rather than the 
full distribution as is required for maximum likelihood.” Under mild 
regularity conditions, GEE estimators are consistent and asymptotically 
normal. 

The outcome in each firm-year is a dummy variable that equals 1 if 
the firm changed the relevant dimension of its niche from the previous 
year and O otherwise. For the ith firm, we have w, observations; the vector 
of outcomes can be written as: y = (y,,y;,-.-,¥w)’. The covariates vary 
over firms and (in some cases) over time for given firms: X = 
(X,X,,..., Xy), where X, is (u, x k) and the number of covariates is 
k— 1. As in the usual framework for generalized linear models, one 
chooses a distribution for y and a link function to connect the outcome 
to the covariates. In this analysis, we obtain a (generalized) logit regression 
by choosing the binomial distribution and the logit link. 

We expect that the disturbance process will exhibit autocorrelation of 
the usual panel type: observations for the same firm will tend to be cor- 
related due to permanent and gradually changing unobserved firm prop- 
erties. But, we assume that observations are uncorrelated for different 
firms. GEE requires a specification of a “working” correlation matrix for 
the within-organization correlation of observations. The implementation 
we used—the XTGEE routine in version 6.0 of STATA (StataCorp 
1999)—allows a menu of choices for this matrix. We experimented with 
several, including the exchangeable correlation structure familiar from 
the standard random-effects specification for panel data, as well as first- 
order and second-order serial autocorrelation. We found that there is 
negligible autocorrelation for the French and German data. However, 


Barron (1992) discusses sociological applications of QL estimators (in the context of 
analysis of counts) 
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there is mild positive autocorrelation in the British data. For the latter, 
the estimates of key substantive effects were insensitive to specification 
of the working correlation matrix. Inspection of estimates suggested to us 
that the “exchangeable” version fits well. For consistency, we used this 
specification for all three data sets. That is, the working correlation matrix 
for each organization’s observations has the following form: 


1 ifs =? 
Roo), = p otherwise ` 
Finally, we based our testing on so-called robust standard errors, using 
the “sandwich” estimator developed by Huber (1967) and White (1982). 


RESULTS 
Rates of Disbanding/Exit 


Estimates of our basic specifications appear in table 4. Hypothesis 1 re- 
ceives strong support: niche width has a negative effect on disbanding/ 
exit hazards in all three populations; the effect is statistically significant 
for Britain and France. Hypothesis 2 also receives strong support: overlap 
density has a significant positive effect on the hazard in all three countries. 

According to these results, the worst case (from the perspective of life 
chances) is having a narrow niche that overlaps substantially with those 
of other organizations in the population. In other words, the highest haz- 
ards are for specialists in a market center. This pattern accords well with 
the central image of resource-partitioning theory. 

Consider the situation from the perspective of a specialist on the pe- 
riphery, one whose niche does not overlap with that of any other organ- 
ization. Such an organization does not suffer the deleterious consequences 
of competition within its niche, nor does it reap any of the gains of a 
broad niche. Can organizations improve their life chances by broadening 
their niches? The answer to this question depends upon the distribution 
of organizations in the market, on the strength of the effects of niche 
width and niche-overlap density, and the possibility that change per se 
can elevate the hazard of disbanding/exit. 

We can explore the first part of the answer by examining figures 1 and 
2. Recall that movement away from pure specialism makes the number 
of overlaps increase sharply. So the specialists face a genuine dilemma. 

We can get a sense of the “content” effect of change for a pure specialist 
by considering the multiplier of the hazard at different levels of specialism 
and overlap density. We continue to focus on the French population. In 
table 4, we see that this multiplier is exp (—0.06 - W/1,000 + .005 - NO). 
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TABLE 4 
ML ESTIMATES oF EFFECTS OF NICHE WIDTH AND NICHE-OVERLAP DENSITY ON THE 
DISBANDING/EXIT HAZARDS OF EUROPEAN AUTOMOBILE MANUFACTURERS, 1885-1981 











France Germany Great Britain 

Tenure in the industry 

MSOS aa ron 5.40 (3.881) 6.77 (55.15) —381 (—3 63) 

CEST EE 5835  (-3.77) —641  (-4.90) —3.42 (—3.26) 

1Se<3 0 o... 6.04 (-4.26) —705 (-5.38) —4.15  (-3.98) 

5 SST tds .  -781 (5.91) —14.93  (-6.19) -5.51  (-5.47) 

Tce A —8.24  (-6.22) —15.13  (-632) -587 (—5.83) 
I(GDP) 2... cece o. 116 (2 39) 95 (1.28) —.02 (-.04) 
Depression year ........... 0. 62 (3.02) 80 (2.03) —.12  (-.41) 
Prior existence. 

MED ida —70 (488) 83 (4.74) —58  (-4.56) 

A per —20 (15) —59 (245) -25  (-183) 
In(relative size) 

WSs o 122 (-2.04) —163 (-2.99 —.98  (-2.12) 

MDS S dices asavssecewees snes: -1.98 (3.62) —5.01 (—4.82) —149  (-334) 
A E 04 (-.87) 05 (77) —04 (1.38) 
Glee O AE 62 (3.98) 12 (61) 67 (4.24) 
Density at entry ..........0068- 01 (1.54) .005 (85) .001 (.63) 
European density (N y) 

Al) 36) =01 (-109) —.01 (3.07) 

N, x ind age (x 107 ..... 03 (1.01) 08 (2 05) 05 (2 28) 

Ny x ind age (x107 ....  —.003 (—58)  —.01  (-1.90) —.005 (-132) 
Country density squared (a) 

LO ista, : 22 (1.85) 3.35 (2.82) 12 (127) 

wx ind age (x10) ...... —.003 (—42) —26 (-357) —.01 (-107) 

wx ind age (x10... =01  (-.07) 4.29 (3.83) 10 (85) 
Niche width (W x 107) ..... —.06 (—1.92) —©4  (—.74) —.22 (—4.72) 
Niche-overlap density (WO) 005 (3.09) 01 (3.68) 01 (7.15) 
N of spells n.os o o 5 conoces 5,171 2,272 5,582 
N of events o oaaseeseoaaoo 718 309 810 
Log likelihood . .....coomooooo. — 1,732.0 —642.4 —1,860.9 


A AA L 
Norr.—Flgures in parentheses are istatisbos, # denotes tenure m the mdustry m years All spectfi- 
cations also contain histoncal penod effects 


This multiplier is 1.0 for a pure specialist with no overlap.” For a pure 
specialist that is unlucky enough to find itself in the center of the market, 
NO might plausibly equal 100, giving it a multiplier of 1.65, meaning that 
its mortality hazard is 65% higher than that of the specialist in an isolated 
position on the periphery. The multipliers for center specialists in this 
scenario are 2.7 for Germany and Britain (the multipliers for peripheral 


4 Since we added one to all niche widths, the minimum value is actually 0.001, and 
the multiplier for such a case is 1.001. 
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specialists in all cases equals 1.0). Clearly specialists cannot persist in 
crowded market centers. 

Less evident from the parameter estimates, the hazard for most gen- 
eralists also exceeds the hazard for a peripheral specialist when the center 
is crowded. For instance, if a generalist experiences NO = 100, then its 
hazard of mortality exceeds the hazard for a peripheral specialist unless 
its niche is extremely wide, greater than 8,333 cc, which is very far out 
in the tail of the distribution (table 2). The level of niche width at which 
generalism is superior to specialism for the same circumstances are 25,000 
cc for Germany (far outside the observed range) and 4,500 cc for Britain 
(well below the maximum but four times the mean). Suppose, in contrast, 
that concentration among generalists has taken place, that all specialists 
have been pushed out of the center, and that only three generalists remain 
in the center. Consider a remaining generalist with W = 5,000 cc. Since 
it overlaps with only two other firms, its multiplier is exp(—.06°5 + 
.005 * 2) = .75, which means that its multiplier is 25% below that of an 
isolated specialist. These results too confirm the central intuition of re- 
source-partitioning theory: it is reduced crowding in the center (concen- 
tration}—not simply generalism—that improves the life chances of 
generalists. 

We addressed the functional form of the localized competitive effects 
of niche overlap by using a nonparametric approach, replacing the log- 
linear effect of overlap density in table 4 with three dummy variables 
indicating overlap of 1-5, 6-50, and greater than 50 (with O as the excluded 
category). Although the results vary among the three countries, in each 
case we find that overlap density does not raise the hazards until it reaches 
a count higher than 50. 

We test hypothesis 3 by adding an interaction of niche-overlap density 
and industry age to the models whose estimates were reported in table 
4. As predicted, the estimated interaction effect is negative (and significant) 
in all three populations in table 5. For example, in France, the effect of 
niche-overlap density, strong at the inception of the industry, has been 
eliminated entirely 50 years later in the industry evolution. Competitive 
crowding and its positive effect on the hazards tend to be more pro- 
nounced in the early stages of the industry evolution. 

Now we turn to the process effects of organizational change on mor- 
tality, net of the effects of the niche width and niche-overlap density that 
result from the change. Table 5 reports estimates of specifications of the 
effects of niche expansion and niche contraction by 10% or more, along 
with a “change clock,” which is reset to zero whenever a firm makes either 
type of change. We explored the issue of nonproportionality of these ef- 
fects, and we determined that the niche-width effects vary with industry 
tenure in France but not in Germany and Britain. Overall, we find that 
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TABLE 5 
ML ESTIMATES OF THE EFFECTS OF NICHE WIDTH, OVERLAP DENSITY, AND POSITION 
CHANGE ON THE DISBANDING/EXIT HAZARDS OF EUROPEAN AUTOMOBILE 
MANUFACTURERS, 1885-1981 i 











France Germany Great Britain 
Niche width (W x 107% ....... —.06 (-2.01) —.08 (—135) —.21 (—4.54) 
Niche-overlap density (NO) . 01 (2.91) 06 (321) 01 (4.38) 
Niche-overlap density x dur 
age (x 1077) .. 5 . A —.02 (—202) —.12 (—2.53) — 02 (—1.95) 

aa in niche width e expansion: 

BES AA So “We weesacte 33 (1.67) 53 (2.86) .03 (1.33) 

Pa oo GA Saeed RRR Sae AG ese 22 (1.20) 53 .03* 
Contraction: 

WS eis Saved sal E E tess Bs 50 (2.33) —.03 (-.1D .09 (.61) 

US E ST (1.20) —.03* 09* 
Change clock (niche width) .... ... .03 (1.61) 03 (75) 03 (1 33) 
Log Hkelihood . ane —1,7246 —635 5 —1,857.8 
LR test vs. model in table 4 (df) . 14.8 (6) 13 8 (4) 62 (4) 








a eS A <A eo 
contam all covariates ın table 4. 
* Indicates that a coefficent is constrained to equal the one immediately above. 


the process effects of change are positive, that is, change increases the 
hazard of disbanding/exit, whenever the effect is statistically significant. 
Moreover, expansion appears to be more hazardous than contraction. 
Expansion elevates the hazard in all three countries, although the effect 
is not significant in Britain. In addition, the effect is nonproportionate in 
France: the effect is smaller after an organization’s tenure in the industry 
exceeds three years. The effect of niche contraction is positive and sig- 
nificant only for French firms with an industry tenure of less than three 
years (such change increases their hazards by 65% according to this result). 
The contraction effects are not significant in the German and British 
populations. 

The coefficient of the change clock tells how the hazard changes with 
the time elapsed since the organization’s most recent change. Generally 
research that uses specifications like ours finds that the deleterious effects 
of change erode over time. This pattern is usually termed negative du- 
ration dependence. This is not the case in the populations we study. In 
contrast, we find positive (but not significant) duration dependence in 
each case. 

Table 6 reports estimates of an alternative representation of the effects 
of change, looking at change in market position. These specifications, 
which include effects of measures of market position and change in market 
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TABLE 6 
ML ESTIMATES OF AN ALTERNATIVE SPECIFICATION OF THE EFFECTS OF NICHE 
WIDTH, OVERLAP DENSITY, AND POSITION CHANGE ON THE DISBANDING/ExIT 
HAZARDS OF EUROPEAN AUTOMOBILE MANUFACTURERS, 1885-1981 














France Germany Great Bntain 

Niche width (W x 107%) ee 004 (.10) 11 (14) —.17 (-—3 14) 
Niche-overlap density (NO) . ..... 01 (2.50)  .06 (291)  .01 (3 67) 
Niche-overlap density x mdustry 

age (x 10%) 1... Ei +. 703 (—2.23) —.12 (—2.38) —.03 (—2.53) 
Change in niche width 

Expansion .... z e iw 07 (—.38) 29 (1.23) -32 (—223) 

Contraction . ... s 7 (— 81) -—30 (— 88) -30 (—1.84) 

Change clock (niche width) . os 06 (97) —02 (— 28) — 002 (-.05) 
Position 

Distance above market center 

(x 10.. epa . . ~O  (-2.58) —.02 (— 96) -01 (-176) 
Distance below market EE 
(x 1077) : . 7004 (—77) —.01 (71.27 .001 (20) 

Change in relative position (AP : x 

1077) he . —01 (—1.19) —04  (-2.10) .001 (22) 
Any change in absolute. position 

(AP* = 1) Doe sas 

CEI we ds 5525 65 (4.31) 43 (2.21) 95 (7 66) 

“23 Wick ches A 32 (173) .43* 34 (2 11) 

Change dee (absolute change): . —.03 (— 53) 06 ( 68) 04 (65) 
Log Hkelihood .. . Le .... —1,711 1 —627.9 —1,824 9 
LR test vs. model in table 4 (df) . 41.8 (10) 290 (9) 72 0 (10) 


EIE EEEE PESE ee EEEE E All spectfications also 
contam all covariates m table 4 
* Indicates that a coefficent 1s constramed to equal the one immediately above 


position, also separate the effects of process and content change.” It ap- 
pears that in the European automobile industries, being a “high end” 
producer has been a beneficial position. The coefficient capturing distance 
of a firm’s niche center above the market center has a negative effect on 
the hazard; the effect is significant for the French and British populations. 
In France, for example, a company whose production centers around a 
model that is more powerful than the most sought after generalist model 
by 1,000 cc reduces its hazard of disbanding/exit by about a third. 

The effect of relative position change—change that does not necessarily 
involve organization-level transformation—is significant only in Germany, 
where the estimated effect implies that the hazard drops by about 30% 
if a firm repositions its niche center by 1,000 cc (¡.e., whenever a firm’s 


° To further investigate the effect of change in relative position, we allowed the effect 
to differ by direction (1.e , toward vs. away from the market center), but this distinction 
was not justified by the results. 
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“average” car becomes more similar or dissimilar to the “average” gen- 
eralist car by an engine capacity difference of 1,000 cc). So, it appears 
that, while in France and Britain being a high-end specialist confers an 
advantage, the real benefit in Germany came from constant exploration, 
even if it was incidental. 

Paradoxically, as the next set of results indicate, exploration generally 
comes at a very high cost and proves beneficial for life chances only as 
long as it is incidental. The estimates for the effect of absolute change 
(change that, regardless of its direction, involves repositioning by the focal 
firm) are positive and highly significant in all three countries. In France 
and Britain, these effects are also nonproportionate; they are much 
stronger for firms with less than three years of tenure in the auto industry. 
In Britain, such firms are 2.6 times more likely to fail, and they are 1.9 
times more likely to fail in France. Once their industry tenure surpasses 
three years, those disadvantages decline to a level at which they increase 
the hazard by about 40% in both countries. In Germany, the absolute 
change effects increase the hazard by roughly one half, irrespective of 
tenure. In France and Britain, where the absolute change effects hinge 
on tenure, the initial hump in mortality disappears to reveal monotonic 
negative tenure dependence. Overall, the results in tables 5 and 6 provide 
strong support for hypotheses 4 and 5. 

Summarizing the main results presented so far, we note two sets of 
effects that stand out as robust and consistent across the three populations 
of automakers. Both competitive crowding and organizational change (its 
process dimension, in particular) significantly elevate mortality hazards. 
Earlier we also argued that a possible second-order (indirect) effect of 
competitive crowding might heighten mortality hazards by increasing the 
propensity of firms to initiate change. To test this proposition, we now 
consider estimates of models of the rates of change in niche-width and 
(absolute) market-position. 


Change in the Niche 


We report results for change in the center of the niche in table 7, which 
reports GEE results for the panel data on change. Overlap density has 
a positive effect on the propensity to change. The effects are highly sig- 
nificant for France and Britain but not Germany. The effects in the French 
and British populations are strong in substantive terms as well. For in- 
stance, a French firm's log-odds of absolute position change is 112% 
higher when it overlaps with 75 firms as contrasted to the situation in 
which it has no overlaps. In Germany, the log-odds of change at NO = 
75 is 35% higher than with no overlaps. So, in accord with hypothesis 6, 
we find evidence that competitive intensity not only increases mortality 
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TABLE 7 
GEE ESTIMATES OF POOLED Locir REGRESSION MODELS oF (ANY) CHANGE IN 
ABSOLUTE POSITION BY EUROPEAN AUTOMOBILE MANUFACTURERS, 1885-1981 














France Germany Great Bntain 

Constant .......0.0 0... no... 7297 (-136) —1.07 (—1.64) 24 (51) 
Tenure in the industry .. seek, ce. AOL > (86) Ol (.45) 09 (6.41) 
Depression year . . .... eee 71 (57) 28 (85) — 25 (-1.13) 
ln (GDP) ...... o... accs. —.62 (-141) -25 (- 46) 46 (1.40) 
Prior existence... ....... .... 24 (-121) -37 (-2.28) 04 (35) 
In (relative size) .. .. o... —31 (=.51) —.09 (-52) 49 (2.68) 
lir (E o oa eel en es 05 (50) —07 (—81) 0002 (006) 
Sire < 50 . ..... a cs 28 (1.27) —.04 (—.16) —.29 (—2 15) 
% niche width (Wx 107) ...... .02 (23) .11 (156) —.08  (—1.50) 
% niche-overlap density (NO) aie 01 (4 12) .01 (121 004 (3.09) 
N of prior changes in niche width . —75 (-381) —.93 (-250) — 39 (5.94) 
Distance above market center 

(IO co hes eke .12 (114) —05 (-2.51) —.02 (—3 20) 
Distance below market center 

(AO os tia. cbc 01 (1.13) 01 (2.05) —.002 (—.94) 
N of pnor changes in absolute posi- 

tion (NPC) 2... .. o... ... 1.74 (6.01) 137 (326) 29 (6.70) 
NPC x tenure. ..... eme... . 704 (-3.71) —01 (-432) — 003 (—4.76) 
NPC x niche width .... ....... —.06 (-1.13) —.05 (-219) .02 (2.19) 
N of observations ........ 2... ... 5,171 2,272 5,582 
N of firms .......... A 819 373 987 
Wald a! lios id 309.90 105 45 223.09 


directly, but it also does so indirectly by increasing the rate of organi- 
zational change in two of three populations studied. 

Our hypotheses concerning learning and inertia hold that greater cu- 
mulative absolute-position change will trigger subsequent changes in ab- 
solute position but will deter change in niche width. Support for the first 
part of this proposition is strong; the estimates in table 7 show that ex- 
perience with the same change has a strong positive effect on the pro- 
pensity to change further. Additionally, consistent with earlier studies 
examining rates of organizational change (Amburgey et al. 1993; Minkoff 
1999), we find that the momentum of change declines with industry tenure. 

We also find support for the proposition that experience with one type 
of change deters the occurrence of another type of change. Cumulative 
niche-width change has a significant negative effect on the rate of position 
change in all three countries. 

Our final hypothesis predicts that the effect of previous changes on the 
likelihood of repeating the same type of change declines with niche width 
(regarded as a measure of complexity). The estimated effects of the product 
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of niche width and number of prior changes in position on the propensity 
to change position are negative, as predicted, for only two of the three 
populations, and the effect is only significant for the German population. 


Effects of Other Covariates 


To this point, we have focused on the effects that bear directly on our 
arguments. However, we embedded these effects in a model that specifies 
effects of size, industry tenure, density, and several environmental cov- 
ariates. Several of these effects deserve note. First, adding the effects of 
niche width and niche-overlap density do not change the pattern found 
earlier without these effects (Hannan et al. 1998a). The hazards of dis- 
banding/exit decline with relative size and increase with industry tenure, 
and the effect of each depends upon the level of the other (the effects are 
nonproportional). In particular, there is positive age dependence for small 
firms and negative age dependence for large ones. 

One set of findings, concerning the dynamic multilevel specification of 
density dependence, is new. The theory of multilevel density dependence 
holds that different levels of population density drive legitimation and 
competition. Modified to account for the nonsymmetric effect of popu- 
lation density at different stages of the industry evolution (Hannan 1997), 
the model, in the form specified here, posits that the baseline density effects 
(ow counts increasing survival through legitimation and high counts in- 
creasing mortality through competition) reverse direction as an industry 
structure evolves and solidifies. But resurgence in density in the late cycle 
of industry evolution again reverses the direction of effects to the original 
pattern. This account is consistent with the resource-partitioning story 
explaining the late surge in population density with the proliferation of 
specialists on the market periphery. 

This model has previously been estimated for founding rates but has 
not, to our knowledge, been applied to mortality rates. The results in table 
4 offer strong support for the predictions of the theory. Looking at the 
main effects of density, we see that Europe-wide density has a negative 
effect on mortality rates and (the square of) country density has a positive 
effect. These results agree with the view that legitimation processes flow 
more easily over national borders than do competitive processes. Looking 
at the interactions of these effects with industry age, we see that, with 
the exception of the interaction between the squared terms of industry 
age and country density in France, the legitimation and competition effects 
have the predicted directions for all three countries. 
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DISCUSSION 


The findings of this research expand our knowledge of the processes 
investigated by three well-developed and empirically tested theory frag- 
ments: density dependence, resource partitioning, and structural inertia. 
Our results help to establish conceptual links that integrate these theory 
fragments into a coherent research stream similar to the Mertonian type 
of theory of the middle range, thus further consolidating the broad knowl- 
edge base accumulated within the ecological framework of organizational 
analysis. 

Our contribution to density dependence here lies in elaborating the 
multilevel form of the model. Similar to models of locally partitioned 
density-dependence, we describe a process that could be termed “niche- 
localized” density dependence. 

At the core of our argument lies the assumption that a process of 
constitutive diffusion of a new product resembles the legitimation of a 
new organizational form. A single organization can rarely carry out the 
institutionalization of a new line of products or services. That many com- 
panies are willing to invest in and pursue an emerging market domain 
provides a sign that inspires trust and confidence in potential customers 
and partners alike. The presence of more than one actor in a market 
segment also reflects positively on the status of early entrants. 

Clearly, the transition from monopoly to duopoly poses constraints on 
access to resources and introduces a competitive relationship. But, at low 
levels of niche-overlap density, the disadvantages posed by the entry of 
competitors is minimized; indeed our results suggest that they are neu- 
tralized, by the subsequent entrants’ contribution to affirming the identity 
of the new product. Growth in local density also aids in defining, ex- 
panding, and fortifying the boundaries around the evolving market niche. 
This line of reasoning agrees with the implication of the resource-parti- 
tioning model that the flattening of the resource distribution over time 
results from the entry of specialists whose arrival promotes the consoli- 
dation of new market niches and changes consumer tastes. 

We find that there is a threshold level in niche-overlap density, above 
which niche crowding heightens mortality. Virtually all ecological studies 
that have analyzed the distribution of population density over time report 
a similar pattern: density increases, slowly at first, and then at a higher 
rate until it reaches a peak followed by a decline. This decline in the 
number of organizations can be explained (at least partly) by the higher 
exit rate of existing firms that fail in the competitive process once the 
limits of the population’s carrying capacity are reached. 

We find that this effect might also be indirect. Our findings suggest 
that competitive crowding propels organizations to change their niche 
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widths and positions. When this happens, organizations can change their 
strategies to those that seek to find a viable position outside of the market 
center, a process that leads to resource partitioning. But, change in range 
of product offerings and in location proves to have vital consequences. 
Organizations that attempt such change suffer increased exposure to en- 
vironmental selection. So, it appears that density dependence breeds re- 
source partitioning; that is, competitive crowding also has an indirect 
effect on mortality by decreasing the survival chances of organizations 
exploring less competitive market segments, a process that gives rise to 
the formation of peripheral niches with thinner but untapped resources. 

The resource-partitioning model predicts that market segmentation and 
rising concentration improve the survival chances of specialists, but this 
model does not specify how specialists come to proliferate outside the 
market center. Our findings, however, imply that the survival advantages 
associated with location on the periphery might hold only for new market 
entrants, because the deleterious effects of changes in niche width and 
position might offset those advantages. This is especially true in light of 
our finding that the hazard of failure due to niche width and position 
change does not diminish with time. So it seems that the specialist ad- 
vantage comes at the cost of becoming a specialist—a finding that un- 
derscores the importance of separating the effects of content and process 
change when studying organizational transformation. 

Organizational change is precarious partly because of the difficulty of 
foreseeing the amount of related subsequent structural transformations 
that will be induced by an initial change (Barnett and Carroll 1995). Until 
all such changes are completed, the organization is unfit to operate at its 
previous levels of accountability and reliability. In other words, a “re- 
covery” period follows nontrivial transformation attempts. During this 
period, the process consequences of organizational change are hard felt. 
In this research, we found no evidence that an actual recovery from the 
negative impact of process change is certain to occur. On the contrary, it 
appears that once structural inertia triggers liability to selection, this effect 
is continuously perpetuated and amplified with time. 

We believe that our approach to defining organizational strategy along 
the generalist—specialist dimension by using technological niches of or- 
ganizations can be explored and refined in future research on resource 
partitioning. Variance in organizational niches can be used to complement 
measures of size and product characteristics (e.g., content coverage in 
newspapers), frequently used to capture this distinction in previous anal- 
yses. The effects of industry concentration, for example, can be tested on 
the subset of narrow-niche organizations, or those located outside the 
market center. Similarly, the proposed scale-based selection mechanism 
that accounts for consolidation among generalist organizations (Carroll 
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and Swaminathan 2000; Dobrev and Carroll 2000) can be applied to 
organizations with wide niches. Overall, the use of the niche construct 
can be useful in explicating the intricacies in the relationship between 
organizational size, strategy, and resource partitioning (Dobrev, Kim, and 
Carroll 2001). 

The research presented here points to the variety of ways in which the 
niche construct could be exploited in other research fields. Our findings 
about the negative relationship between niche change and organizational 
survival, for example, can be extended to the sociological investigation 
of collective action in an interesting way. For instance, Olzak and Uhrig 
(1999) suggest that it is helpful to think about the niche of a social move- 
ment in terms of its tactical repertoire and routines for organizing, which 
eventually become embedded in the movement’s identity. Relying on the 
content/process distinction, one can test if the “fade-away” of social move- 
ments occurs when a movement attempts to broaden or diversify its tac- 
tical array. Even though new tactics might become more appealing to 
members, more socially acceptable, and thus more effective than old ones, 
the transition to different routines and tactics itself might imply that the 
movement’s identity has been lost, that it has been debauched, coopted, 
or defeated. More broadly, much sociological analysis rests on assumptions 
about embeddedness and structure. Defining and measuring the niche 
along dimensions and attributes of individual and collective social actors 
could yield the conceptual utility and the research versatility needed to 
investigate the dynamic relationship of persons and organizations to their 
positions in the social structure. 
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Chinese Villages and Townships as Industrial 
Corporations: Ownership, Governance, and 
Market Discipline’ 


Yusheng Peng 
Chinese University of Hong Kong 


Public firms in a reforming socialist economy face two problems: 
the old soft-budget constraint syndrome and new principal-agent 
problems. China’s township-village government-owned enterprises 
(I'VEs) outperform state-owned enterprises (SOEs) in growth rate 
and productivity. Three explanations are proposed in the literature: 
(1) TVEs represent informal or ambiguous private property rights, 
which are most efficient in partial reform. (2) The small size and 
scale of township-village governments as industrial corporations al- 
lows officials to monitor TVEs directly and to limit their where- 
withal for cross-subsidizing. (3) Strict market disciplines facing 
TVEs render indirect market monitoring to mitigate agency prob- 
lems effectively. Analyses of Jiangyin data show that (a) while both 
superior to local SOEs, village and township enterprises share sim- 
ilar productivity and (b) the scale of township-village corporations 
slightly increases productivity. I conclude that local state corpora- 
tism should be revised to include market discipline as an effective 
governance mechanism. 


INTRODUCTION 


China’s departure from central planning in the industrial sector can be 
viewed from three aspects: decentralization, marketization, and property 
rights reform at the enterprise level. Since 1978, the Chinese central gov- 
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ernment devolved control and ownership of state enterprises to lower 
levels of government and granted local governments the right, in the form 
of fiscal contracts or tax farming, to derive residual income from the 
enterprises under their jurisdictions. While centralization and decentral- 
ization have been perennial problems for China’s political economy since 
the 1950s (Lardy 1975), post-Mao decentralization significantly changed 
the relationship between central and local governments and between local 
governments and their enterprises. Rather than merely playing the role 
of collection agents for the Ministry of Finance, local governments now 
actually earn income from enterprises in their jurisdiction (Wong 1992; 
Wong, Heady, and Woo 1995; Oi 1999). 

More significantly, the post-Mao decentralization occurred in parallel 
with a marketization process that gradually shifted Chinese industry from 
a planned system to a market-driven one, and created various markets, 
including those that traded securities. A key feature of the Chinese mar- 
ketization process was a dual track system in which market transactions 
and bureaucratic allocations coexisted. As the absolute volume of indus- 
trial goods allocated through planning remained fixed, the share of eco- 
nomic activities carried out in the market sector expanded as the economy 
grew. By 1993/94, the Chinese economy had basically grown out of the 
plan (Naughton 1995). 

Property-rights reform of the state-owned enterprises centered on re- 
partitioning control and residual income rights, in the form of granting 
autonomy to SOEs and allowing them to retain part of their profits. A 
managerial contract system was implemented in an attempt to link man- 
agers’ pay with performance. While large-scale privatization of state or 
collective assets was insignificant until recent years, initial changes in 
ownership composition proceeded by gradually liberalizing restrictions on 
private entrepreneurial activities. 

According to Kornai, the decentralization was a classic type of failed 
reform, as attempted in Eastern Europe,’ and simply transferred public 
ownership from single to multiple public owners. Granting state-owned 
enterprises partial autonomy put them in a dual dependency (on the bu- 
reaucracy and the market), in which bureaucratic dependence often emas- 
culated market incentives—it did not solve the problem of bureaucratic 
intervention and soft budget constraints. Without a large private sector, 
Kornai (1990a, 1990b, 1992) asserted that genuine market coordination 
was impossible. But in China, decentralization seems to have been es- 
sential and effective in transforming the economy from a centrally planned 


2 Kornai (1989) called the first generation of Hungarian reformers, who attempted to 
imitate the market system via decentralization in the 1970s and early 1980s, “naive 
reformers ” 
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system to a market driven one (Walder 1995; Oi 1992, 1999; Qian and 
Weingast 1997; Qian and Roland 1998). Chinese economic reforms appear 
to have successfully created a market system based not on private own- 
ership, but on decentralized public ownership. 

The spotlight has focused on the spectacular success of rural enterprises 
owned and operated by township and village governments (TVEs). From 
1985 to 1996, the share of TVEs in gross industrial output expanded from 
14.6% to 27.8%, whereas that of state-owned enterprises (SOEs) shrank 
from 65% to 28.5% (Jefferson and Rawski 1999, p. 27). In 1997, all rural 
enterprises (including private ones) contributed 60% to total value added 
of rural society and 27.7% to the national GDP (Ministry of Agriculture 
1998, p. 3). Even though the private sector and “other” sectors (mostly 
foreign investment or joint ventures) expanded the fastest, most rural 
industrial output was produced by the government-run TVEs. 

Classical property rights theory maintains that clearly assigned private 
property rights are important for the smooth operation of a market econ- 
omy for at least the following two reasons. First, private ownership of 
rights serves fo internalize externalities whereas public or communal own- 
ership incurs large externalities (Demsetz 1967). Second, the constitutional 
sanctity of private property rights fends off costly disputes and, most 
important, state predation that could dampen incentives and cripple ec- 
onomic growth (North and Weingast 1989). Thus, the collective or “public” 
nature of TVEs in China seems to challenge these basic tenets of the 
orthodox property rights theory. This puzzle has aroused the interest of 
both sociologists (Walder 1995; Lin 1995; Nee 1992; Peng 1992) and econ- 
omists (Rawski 1999; Naughton 1994; Stiglitz 1994, p. 176; Weitzman and 
Xu 1994; Chang and Wang 1994). 

Whereas economists debate over whether economic reform has im- 
proved the productivity of SOEs (Li 1997; Jefferson, Rawski, and Zheng 
1992, 1996; Woo et al. 1994; Woo 1999; Rawski 1999), there is over- 
whelming evidence that TVEs outperformed SOEs in productivity growth 
(Woo et al. 1994; Jefferson 1999; Perkins 1996; Wu 1995), despite the fact 
that TVEs generally use lower-end technology and derive large efficiency 
gains from SOE skill spillovers (Peng, Zucker, and Darby 1997). As 
Naughton (1994, p. 268) has noted, the rapid growth of the TVEs can be 
explained by external forces such asthe availability of cheap labor, tax 
concessions (and evasion), and distortions in state industries.’ An inter- 
esting question remains whether TVEs represent an efficient institutional 
adaptation to partial reform. The objective of this study is to explore the 


* Indeed, even the classical socialist system was able to achieve high extensive growth 
rates (Szelényi 1989). 
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institutional and organizational factors that account for the observed su- 
perior productivity performance of TVEs vis-a-vis SOEs. 

Largely three theoretical approaches emerge from the literature, em- 
phasizing one or two of the three aspects of reform. (1) The informal 
privatization approach emphasizes property rights reform and interprets 
the TVEs as a form of informal or disguised privatization (Nee and Su 
1996; Sachs and Woo 1997), or organizational hybrids (Nee 1992) with 
ambiguous property rights (Li 1996). In this light, TVEs were the “second 
best” adaptation to the environment of market imperfection and political 
hostility toward private property. This approach is labeled “convergence 
theory” because its proponents view Chinese TVEs as a halfway house 
in the transition toward a full-blown Western-style market economy based 
on private ownership (Nee 1992; Woo 1999). (2) The corporate governance 
approach emphasizes that the devolving and paring of control rights and 
residual income rights over enterprises to local governments turned them 
into economic corporations and public officials into boards of directors. 
This institutional arrangement became known as “local state corporatism” 
(Walder 1995, 1998; Oi 1999) or “local market socialism” (Lin 1995). The 
TVEs’ superior performance is to be explained in terms of corporate 
governance, that is, township-village governments’ commitment to hard 
budget constraints and better monitoring of TVE managers. (3) The mar- 
ket discipline approach (Lin, Cai, and Li 1998, 1999; Che and Qian 1998) 
shares the local state corporatist conviction in the feasibility of an efficient 
market system under public ownership but differs from the latter in its 
emphasis on external market environment and market disciplines rather 
than internal governance structure in explaining the success of TVEs. 
Woo (1999) terms the second and third approaches the “experimentalist 
school,” because they explicitly or implicitly believe that Chinese reform 
in the public sector, especially the success story of TVEs, represents a 
unique institutional innovation that is not only feasible but also viable, 
at least during the transition period. 

In this study I compare the productivity performance of government- 
owned enterprises at the village level, township level, and municipal level. 
Adopting the conceptual framework of local state corporatism, I incor- 
porate both enterprise and corporate-level variables in a hierarchical pro- 
duction function. Analyses of data from Jiangyin, a successful local state 
corporatist region, show that: (1) while superior to local state enterprises 
at the county level, village-owned enterprises share similar productivity 
with township-owned enterprises; and (2) the scale and size of township- 
village corporations slightly increase rather than decrease productivity. I 
conclude that a revised version of local state corporatism that incorporates 
market competition and market disciplines best explains the relatively 
superior efficiency of TVEs. 
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FISCAL DECENTRALIZATION AND ENTERPRISE REFORM 


The reform experiences of state-owned enterprises in Chinese cities seem 
to follow Kornai’s logic about the unreformability of public enterprises 
in reforming Eastern Europe. Two problems loom large in the reform of 
the Chinese urban state industry: the old soft budget syndrome and the 
new principal-agent problem. 

Chinese State bureaucracy has six levels. At the top are various in- 
dustrial ministries (e.g., Ministry of Textiles, Ministry of Petroleum). Prior 
to the economic reforms, a ministry planned and supervised the operation 
of most large and medium public firms in a specific industry, and its 
jurisdiction cut across geographical boundaries. During the reforms, these 
ministries lost a lot of enterprises to regional governments but kept the 
large ones, and they began to behave like giant industrial conglomerates. 
In 1998, a number of these ministries were separated from the government 
bureaucracy and became officially incorporated (Forney 1998). At the next 
level are the 30 provincial governments. By the end of 1993 there were 
335 prefecture-level governments below the provinces, and nested under 
the prefectures were 2,166 county (xian) governments. Chinese cities range 
across three administrative levels: 3 provincial, 196 prefectural, and 371 
county-level cities. Below the counties there are 48,000 townships (xiang- 
shen),’ whose governments are the lowest rung in the state apparatus. At 
the grassroots level there are numerous village (cun) committees that are 
not part of the formal state apparatus but perform governmental 
functions. 

Chinese public enterprises belong to different levels of government 
jurisdiction (Walder 1992). Firms directly run by ministries and those 
owned by provincial or prefecture-level governments are called “state 
owned” (guoying); those at the county level are “local state owned” (difang 
guoying); and those at the township and village levels are classified in the 
“collective” sector (jiti). The state’s treatment of enterprises at different 
levels was very different and so was their marketization process. Local 
SOEs, usually small in size, played peripheral roles in the eyes of the 
central planners and thus received less favorable treatment in terms of 
resource allocation Rural enterprises at the township and village levels 
were never part of central planning and thus not entitled to state subsidies 
such as cheap credit or low-priced inputs. In the prereform period they 
were named commune-brigade enterprises and had to struggle for survival 
in the interstice of the planning system. During the reform process, local 
SOEs were the first to be released from the planned sector into the market, 


“Numbers of administrative units change from year to year. The figures cited here 
are from the State Statistical Bureau (1994, pp. 3, 17) 
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while large and medium SOEs gradually grew out of the plan via the 
dual track system (Naughton 1995). 

Prior to 1978, fiscal revenues and expenditure were centralized in the 
Chinese state bureaucracy. Profits and taxes collected by local govern- 
ments were handed over to the center. The expenditure of local govern- 
ments depended less on local revenue-generating efforts and more on 
bargaining with higher authorities for budgetary slack, the allocation of 
which softened budget constraints on local governments and on the SOEs 
lodged in their jurisdictions. Local governments would not hesitate to bail 
out failing enterprises because shortfalls in one year became the rationale 
for bargaining for larger allocations the next year (Walder 1995). This 
cycle gave rise to the classic soft budget constraint syndrome identified 
by Kornai (1992, pp. 489-97). Kornai saw this as an incurable disease 
inflicting state-owned enterprises because the state, as the owner, also has 
objectives (e.g., maintaining employment and political stability) other than 
financial performance. On the macrolevel, soft budget constraints lead to 
overinvestment and shortages; on the firm level, they engender ex ante 
and ex post inefficiency. First, bailing out poor-performance firms ex post 
means that the mechanism of “constructive destruction” is not at work. 
Second, because they expect government bailouts, firm managers have 
little incentive to improve productivity and efficiency. 

In order to increase the revenue-generating effort on the part of local 
governments, the center not only devolved control of most SOEs to them, 
but also made local governments the residual claimants of SOE profits 
in their jurisdiction by implementing a fiscal contracting system in 1980. 
Fiscal contracting was basically a form of multilayer tax farming. At the 
first level, the center negotiated with each province a fixed lump sum or 
fixed proportion of its revenue (taxes and SOE profit remittance) to be 
handed over to the center. The provinces retained the remainder after 
fulfilling their obligations. The terms of the contracts were to remain fixed 
for three years or more, except for the three “cash cow” metropolises 
(Shanghai, Beijing, and Tianjin), for which the contract provisions were 
renegotiated annually. This arrangement was replicated down to the town- 
ship level.‘ Fiscal contracting made the local government accountable for 
its own revenue and spending. After the reform, local governments could 
no longer expect slack in grant allocation to make up for its own deficits. 


* Fiscal contracting in the 1980s actually led to a boom in local budgetary and extra- 
budgetary spending and a decline in the center’s revenue relative to national GDP 
This propelled the center to revamp the tax system in 1994. The resulting new tax- 
sharing system separated state and local taxes but maintained the basic features of 
fiscal contracting (Wong 1997). 
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Thus, its budget became highly dependent upon the financial health of 
enterprises within its own jurisdiction (Wong 1992; Wong et al. 1995). 

The financial relationship between local governments and SOEs was 
shaped and reshaped by two reform processes. The first was the tax reform 
that started in 1983 with “tax for profit” (ligaishus), in which profit (and 
loss) remittances were replaced by taxation. Until the 1994 overhaul of 
the tax system, the tax-for-profit reform did not create a level playing 
field because taxes and profit remittances were tailored to each enterprise 
in a profit-sharing contract or “contract responsibility system” (qiye cheng- 
bao). Tax targets and profit-sharing arrangements were individually ne- 
gotiated with the government and remained fixed for three years. The 
second was “loan for grant” (bogaidas), in which administrative grants 
were replaced by bank loans. The intention of this reform was to increase 
the accountability of SOE managers: if enterprises had to pay back loans, 
they would use funds more efficiently. At the same time as this reform, 
the local branches of the state bank gained more autonomy in approving 
credit, 

The soft budget constraint problem for SOEs did not disappear, how- 
ever. Although fiscal decentralization and competition hardened budget 
constraints, credit decentralization in the state banking system softened 
it (Qian and Weingast 1997; Qian and Roland 1998; Wang 1991). The 
softness of SOE budget constraints took on a new form: soft grants were 
replaced by soft loans. SOEs, on the one side, found it possible, if not 
convenient, to procrastinate or default on payment. SOE managers could 
always come up with reasons to rationalize their poor performances, such 
as unfavorable prices and undue social responsibilities. The municipal 
government, on the other side, unwilling to see SOEs go under, could 
persuade or order the state bank branches in its jurisdiction to provide 
“policy loans” to failing firms or to refinance unprofitable projects ex post. 
For local officials, SOEs running in the red were more palatable than 
laid-off workers going onto the streets. Besides, if the bank could pick 
up the tab, why not? The state bank took on the characteristics of a 
public-good externality, because local branches could expect the central 
bank to come to their rescue. According to one estimate, by the mid-1990s, 
nonperforming loans in the state banking system amounted to over 40% 
of its total loans (mostly to SOEs), or 30% of the country’s GDP (Ziegler 
1997). This propelled Premier Zhu Rongji to start commercializing the 
state banking system in 1994 and to cut the link between regional gov- 


* Externalities exist when social costs or benefits are not included in the calculation of 
private persons or organirations. In the case of excessive bank credit in China, each 
locality gets the full benefit of borrowing and shares the cost, inflation, with all other 
localities (Qian and Roland 1998) 
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ernments and the state banking system in 1996 (Naughton 1997), measures 
that forced many SOEs to close down or to lay off and furlough workers. 

Principal-agent problems arose when the state (the principal) delegated 
many contractual and residual control rights to SOE managers (the agent). 
In the classical redistributive system, all SOEs were subject to central 
planning. Deprived of control rights and residual income rights, the SOE 
managers were like “puppets,” receiving not only production targets but 
also quotas for labor, capital, and raw material. All profits or losses were 
handed over to the state. In the context of planning, depriving enterprise 
autonomy was rational in terms of reducing managerial indiscretion and 
ensuring the implementation of plans (Lin, Cai, and Li 1999). Agency 
costs were minimal (except for shirking), but the system suffered serious 
information distortion and stifled the incentive for efficiency and quality 
improvement, and initiatives for technological innovation on the part of 
the managers who were most informed and knowledgeable about the 
operation of the enterprises. The incentive that SOE managers did have 
was to use their information advantage to bargain with planning agencies 
for larger budget allocation.’ 

Thus, management reforms in the 19803 focused on expanding SOE’s 
autonomy to produce outside of the plan and to self-market overquota 
outputs and allow them to retain partial profits for bonuses and invest- 
ment, and so on. These reform measures did increased SOE’s profit mo- 
tives and may even have increased their productivity—at least in the 
1980s—but they also ushered in serious principal-agent problems, for 
example, managerial indiscretion, insider control, and state-asset strip- 
ping. In the 1990s, SOE profits took a nosedive, even when output in- 
creased (Bai, Li, and Wang 1997; Woo et al. 1993). 

The massive entry and rapid growth of rural industries since the early 
1980s changed China’s economic landscape (Naughton 1995; Ho 1994; 
Peng 1999). The rural enterprises owned by township and village gov- 
ernments demonstrated dynamics and vitality unforeseen in the urban 
state sector. How could the “public” TVEs avoid the pitfalls of SOEs? 


PROPERTY RIGHTS, CORPORATE GOVERNANCE, AND MARKET 
DISCIPLINE 


Rural industrialization in China is often mapped into three famous “mod- 
els”: (1) the Southern Jiangsu model (suman moshi), which features col- 
lective ownership; (2) the Wenzhou model, which features private own- 


* This is what Wolf (1988, p. 70) calls internality induced nonmarket failure, e.g., the 
tendency of the U.S Defense Department to justify rather than reduce costs 
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ership; and (3) the Pearl River Delta (Guangdong) model that features 
foreign investment and export-oriented manufacturing.* This section dis- 
cusses three theoretical perspectives pertinent primarily but not exclu- 
sively to the Southern Jiangsu model in the 1980s and early 1990s. 


Informal Privatization 


The informal privatization approach represents an attempt to reconcile 
the apparent paradoxical “progress without privatization” within the 
framework of property rights theory. It may be called the “private incen- 
tive” interpretation because it focuses on the role of TVE managers and 
entrepreneurs keen on taking advantage of new market opportunities and 
maximizing personal income. 

How could private incentives align with the “collective” nature of TVE 
ownership? Proponents of this approach argue that property rights of 
TVEs are informally private (Nee and Su 1996; Nee 1992) or ambiguously 
defined (Li 1996). Informal private property rights evolve from two im- 
portant institutional changes: (1) profit sharing, which limits local gov- 
ernment’s claim on the firm, and (2) managerial contracting, which gives 
TVE managers large autonomy in controlling the operation and large 
discretion in using the above-quota profits. These rights are not backed 
by legal ownership but are based on a mutually beneficial collusion be- 
tween managers and local officials and buttressed informally by social 
networks such as family ties and kinship networks, and personal con- 
nections. Thus, the security of these rights depends on the strength and 
stability of the social network in which these rights are embedded (Nee 
and Su 1996, p. 114). 

According to Nee, informal property rights represent a hybrid organ- 
izational form that is most efficient in partial reform. Due to political 
discrimination and market imperfection, pure private ownership incurs 
very high transaction costs (e.g., difficulty in obtaining bank loans) and 
invites governmental predation (debilitating taxes, fees, levies, and bribery 
exaction). Nee (1992, p. 3) defined local corporatism as a “loosely coupled 
coalition between local government, financial institutions, and firms (col- 
lective and state owned) aimed at promoting market-oriented growth.” 
The convenient alliance between governments and enterprises was a lo- 
cally efficient solution to the problem of partial reform and weak market 
institutions, but it may have exacerbated overall inefficient resource al- 
location. Thus, Nee (1992) predicted that as the marketization process 
advances and the cost of market transactions falls, the cost of the hybrid 


* In terms of property rights regime, the Pearl River Delta region in Guangdong rep- 
resents a whole spectrum of governmental, communal, and private blend. 
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form and informal privatization will outweigh the benefits. This will gen- 
erate demands for legal security of private ownership and clarification of 
property rights in the state sector. 

Nee’s view is shared by many economists. Echoing Nee’s emphasis of 
manager/entrepreneur’s autonomy, Zhang (1997, p. 69) argues that under 
public ownership managerial discretion (in manipulating account books 
and hiding profits) is a “good thing” because it increases incentives and 
hardens budget constraints. Li (1996) maintains that it is the manager- 
entrepreneur who created the firm and invited government officials 
as partners in a contingent contract. Ambiguous ownership arises as a 
second-best solution when market transactions involve high costs and 
high uncertainty. Sachs and Woo (1997; Woo 1999) observed that many 
of the so-called collective TVEs were actually “red caps” for covering up 
de facto private ownership. Their observation was based on the experi- 
ences of Wenzhou in Zhejiang Province, where private economic activities 
flourished with the acquiescence of local officials (Liu 1992). 


Local State Corporatism 


The local state corporatist approach, first of all, emphasizes that township 
and village governments are the de facto owners of TVEs. They have 
the rights to appoint and dismiss managers, to make important investment 
decisions, to dispose of assets, and to claim part of the net profits (by law, 
the local governments are entitled to 40% of the profits of TVEs that 
they own). Even though the ownership of TVEs is officially classified as 
“collective” (jiti), their relationship with township-village governments is 
not very different from that between SOEs and municipal governments.” 
TVE property rights are unambiguously public (Walder 1995; Che and 
Qian 1998; Naughton 1994). Fiscal contracting aligned the incentive of 
officials to local economic development. We need not assume that Chinese 
officials were altruistic and noble idealists. Personal income (bonuses and 
fringe benefits) and perks (luxury cars, travel, and large expense accounts), 
as well as bureaucratic careers, were pegged to local public revenue (Whit- 
ing 1996; Oi 1999). These benefits, according to Walder (1995), are not 
different from those enjoyed by American CEOs. The key point is that 
ownership and incentives get aligned to public offices. 

Second, the better performance of TVEs vis-a-vis SOEs were derived 


* Walder (1995) pointed out that the Chinese official distinction between “state owned” 
and “collective” here was not an ownership distinction but one signaling different 
privileges ın the planning system Both types indicate government ownership The 
collective sector was usually left out of the central planning system and the subsidies 
associated with it. I agree with him with regard to the township sits, but will argue 
later that village jiti belongs to a different category. 
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not from any ownership difference, but from the greater incentive for and 
capability of township-village governments to enforce their rights as own- 
ers. The call for clarification of property rights of public assets has missed 
the point, as they are already clear. What is needed is a good corporate 
governance structure that alleviates bilateral bargaining and mitigates 
agency cost. Three organizational features of government jurisdictions 
have been identified as important for corporate governance: (1) admin- 
istrative levels in the state bureaucracy, (2) governments’ nonfinancial 
objectives in enterprises such as unemployment prevention and welfare 
provision, and (3) the economic scale and organizational size of the local 
government corporation. 

A general observation of local state corporatism is that the adminis- 
trative levels of the state bureaucracy matter and that the performance 
of public enterprises progressively worsens as one moves up the ladder 
(Walder 1992, 1995). Not only do TVEs outperform SOEs, it is argued; 
but village firms should outperform township firms as well (I address this 
in the next section); county-level SOEs should outperform prefecture-level 
SOEs and so on. Both governments’ nonfinancial objectives and corporate 
scale are correlate variables used to explain the differential efficiency 
across different administrative levels. But administrative levels per se are 
seen to have an impact on the security of the public property rights of 
local governments over SOEs, hence local officials’ incentive to enforce 
their property rights. Nearness to the center implies that local govern- 
ments’ property rights are more attenuated by central intervention and 
supervision. Thus, a higher-level government has fewer incentives to en- 
force property rights over SOEs in its jurisdiction than does a lower-level 
one (Walder 1995). 

The second feature is related to local governments’ propensity to bail 
out failing firms. Township-village governments face many fewer political 
constraints to maintain local industrial employment (Walder 1995, p. 286). 
TVE workers are mostly local or migrant farmers who have a piece of 
farmland to return to at home if they are laid off. For TVE workers, 
industrial jobs are a means to improving their standard of living, not a 
means of subsistence. Besides, these underprivileged former peasants were 
quite used to the vicissitudes of the labor market. Urban SOE workers, 
on the other hand, had been the aristocrats of the Chinese working class 
and were used to “iron rice bowls” and a cradle-to-grave welfare system 
that could even be passed from generation to generation. They had already 
been humiliated when they experienced a relative decline in income, fringe 
benefits, and status and became very resentful when, in the mid-1990s, 
reform deprived them of the last vestige of their privileged status—life- 
long job tenure. Thus, city governments were under much greater pressure 
to bail out unprofitable firms in order to maintain employment. 
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The third feature, corporate size and scale, is seen to affect the ability 
of local governments to monitor their firms and to constrain their resources 
for bailing out unprofitable firms. Township and village corporations are 
usually small in size and in scope; this enables officials to more closely 
monitor the operations of their enterprises and thus to reduce managerial 
indiscretion and lower agency costs (Walder 1995). A municipal govern- 
ment, for instance, typically has a very large number of enterprises within 
its jurisdiction, which would weaken its incentive and capacity to monitor 
each individual enterprise (Walder 1992, 1995). It has to turn to “arm’s- 
length” governance or rely on multilevel supervision, that is, creating 
secondary- or even tertiary-level monitoring agencies. Additional admin- 
istrative levels, however, also tend to increase information distortion and 
further diffuse property rights and monitoring efforts (Walder 1995, p. 
288). Managerial indiscretion and insider control eat into SOE assets and 
profits. Contrary to the informal privatization approach, which views 
managerial autonomy as an incentive-enhancing device, the corporate 
governance approach views slack in managerial contracts as generating 
agency cost. Walder (1995, p. 289) maintains that it is precisely the capacity 
of lower-level government to monitor public enterprises and enforce its 
property rights that has improved economic performance.’ 

The small scale of lower-level government corporations may also help 
to toughen budget constraints, by limiting officials’ ability to redistribute 
funds from healthy to failing firms (Walder 1995). It is not uncommon 
for township-village governments to pay off debts for TVEs, but their 
limited wherewithal reduces their tolerance for chronic loss makers. The 
larger industrial base in a higher-level government would give public 
officials much greater leeway to cross-subsidize failing enterprises. 

This is in fact a very strong argument because it implies that the neg- 
ative impact of scale has to be so strong that it negates the general effect 
of economy of scale. Thus, I generate the first testable hypothesis (the 
diseconomies-of-scale hypothesis): 

HYPOTHESIS 1a.—The size and scale of industrial corporations at the 
township-village level are negatively correlated with firm productivity. 


Market Discipline 


Neither of the above two camps would dispute the efficiency-inducing 
effect of market competition. They differ in whether the coupling of mar- 


Nee (1992), on the other hand, criticizes this approach as “state centered” because 
it ignored the fact that local bureaucrats also prey on enterprises. Note that Nee’s 
interpretation of fiscal contracting and profit-sharing mirrors that of Walder. For Nee, 
these institutional “innovations” are important because they restricted and limited both 
the central and local state’s claim on profits, a precondition for economic growth. 
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ket with nonprivate ownership is viable. Sachs and Woo (1997), for in- 
stance, would argue that market reform can only succeed by first getting 
the property rights right through swift and decisive privatization. Kornai 
(1990a) described the matching of market with public ownership as “weak 
linkage” that would not last. 

Sharing the local state corporatist conviction about the viability of 
public ownership in a market system, Lin, Cai, and Li (1998, 1999) propose 
a different explanation for soft budget constraint syndrome and agency 
problems associated with public ownership by focusing on external market 
environment and market disciplines. They argue that the agency problem 
due to the separation of ownership and control in public firms is not 
different from that observed in large corporations in the West and can 
be mitigated by indirect market monitoring rather than direct bureau- 
cratic monitoring. According to modern agency theory, information asym- 
metry arises when ownership and control are separated; mangers have 
intimate knowledge of the firm, whereas owners do not. The manager 
can use that knowledge to improve a firm’s profitability or for his own 
advantage, which may conflict with the interest of the owner(s). Thus, 
information asymmetry gives rise to agency problems such as moral haz- 
ard, insider control, and managerial indiscretion. Diffusion in ownership 
further weakens each individual owner’s incentive to oversee the oper- 
ation of the firm due to free-rider problems. Detailed monitoring in large 
corporations is either impossible or prohibitively costly. Market compe- 
tition overcomes information asymmetry by making relative profit rates 
a reliable summary indicator of the manager’s performance. Thus, man- 
agerial incentive can be aligned with the owners’ interest by linking the 
manager’s compensation and future wages (reputation) with the firm’s 
financial performance (Holmstrom 1982; Fama 1980). 

But this mechanism is not working in China’s SOEs (Lin, Cai, and Li 
1998, 1999). As reform progressed, state-owned firms, especially small 
local SOEs, were thrown onto the market to compete among themselves 
and with the TVEs. Intensified competition eroded the monopoly profits 
of SOEs but did not mitigate the agency problems such as insider control 
and state asset stripping. The problem is that SOE profits or losses may 
not reflect managerial competence and effort due to policy burdens such 
as irrational price control, redundant and retired workers, and bureau- 
cratic intervention. Unable to distinguish policy losses from mismanage- 
ment losses, the state had to assume full responsibility. This gave the 
government rationale to yield to the manager’s demand for subsidies and 
policy loans. Hence the vicious cycle of policy burdens, subsidies, agency 
problems and bureaucratic intervention. Soft budget constraints and 
agency costs seem to reinforce each other and aggravate each other’s 
effects. The availability of soft loans leave state-asset stripping unchecked, 
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while managerial indiscretion and insider control amplified the debilitat- 
ing and destabilizing effects of soft budget constraints. Without a level 
playing field, managerial contracts in SOEs did enhance the managers’ 
profit incentive, but they did not align that incentive to a focus on im- 
proving efficiency.”’ After all, it was much easier to maximize income by 
bargaining for more favorable contractual terms and “policy loans” than 
by improving efficiency and cutting losses. When bargaining became the 
game rule, even the conscientious managers had to do it just to avoid 
being left out of the ball game. 

If we extend this logic to TVEs, the inference is obvious: TVEs are 
subject to the harsh disciplines of market competition and few policy 
burdens. Thus, TVE managers are held accountable for the financial 
performance of their enterprises. Knowing that they cannot blame anyone 
or anything for their losses and that they have to sink or swim under 
market competition, TVE managers focus their attention on streamlining 
production, cutting costs, and adopting new technology. Managerial con- 
tracts in TVEs have resulted in a relatively efficient incentive alignment. 

Contrary to the local state corporatist approach, which attributes TVE 
success to closer bureaucratic monitoring, the market discipline approach 
would argue that bureaucratic monitoring of managers is not only in- 
effective given the large corporate size, but it may even be counterpro- 
ductive because it provides the managers with excuses for making losses 
and bargaining for subsidies and loans. TVEs are more productive than 
SOEs because more (not less) managerial autonomy and fuller exposure 
to market disciplines achieved ex ante incentive alignment. This inter- 
pretation is also consistent with the 1991/1992 World Bank survey data 
that states that TVE managers enjoy far greater autonomy (i.e., less mon- 
itoring) than their SOE counterparts (Jefferson, Zhang, and Zhao 1999). 

Che and Qian (1998) pointed out that even if product markets are 
perfectly competitive, an inefficient firm can still survive as long as 
it faces soft budget constraints. Instead of focusing on internal cross- 
subsidizing, however, they examined the relationship between local gov- 
ernments and local branches of the state banks to look for sources of soft 
budget constraints on firms. They noted that as the grassroots levels in 


1 Both Walder (1995) and Lin et al. (1998) argued that nonfinancial objectives and 
policy burdens (e g , employment maintenance and welfare provision) constrained the 
state’s ability to enforce financial discipline over SOEs. But their focuses differed 
somewhat. For Walder, nonfinancial interests made governments dependent upon firms 
for nonfinancial outputs and increased firms’ bargaining power so that the governments 
could not close down the firm For Lin et al, policy burdens distorted market com- 
petition and worsened information asymmetry between the state and managers so that 
the state could not tell a good manager from a bad one and therefore could not enforce 
market disciplines 
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the administrative hierarchy, township and village governments did not 
have any authority over the local branches of the state banks. They could 
not engage in deficit financing or tap into the seemingly fathomless re- 
sources of state banks to finance or refinance unprofitable projects (Che 
and Qian 1998; Jefferson 1998; Byrd and Zhu 1990). In the 1980s, rural 
officials were able to influence Rural Credit Cooperatives in their juris- 
dictions (Whiting 1996). However, these rural credit cooperatives have 
small funding capacity, limited to local deposits, and have to account for 
their own losses and gains. While municipal governments had both in- 
centive and ability to harangue local branches of the state bank to pump 
money into large and bleeding SOEs, township-village governments had 
neither. 

Unlike bank loans, which looked like a “free lunch” both to city gov- 
ernments and to SOEs, internal cross-subsidizing is an increasingly strin- 
gent source for bailouts because it does not involve obvious externalities 
under conditions of fiscal contracting and profit sharing. Profit sharing 
divided SOE profits into a part handed over to the local government and 
a part retained by the SOEs. Local state officials are not eager to let go 
of their share of tax revenue and extrabudgetary revenues, the source of 
their bonuses, fancy office buildings, and imported cars. Further, these 
officials do not have legitimate rights over the share of profits retained 
by the SOEs. None of the money-making SOEs part easily with their 
retained profits without some form of compensation. When it does occur, 
“internal cross-subsidizing” usually comes in the form of the “reorganizing 
and restructuring” of assets (zichan chongsu), which may not induce ef- 
ficiency to the same degree as mergers in the Western market system, but 
does entail some loss of control on the part of the receivers. 

To summarize, if corporate size and scale impair direct monitoring but 
not indirect market monitoring and if external bank borrowing rather 
than internal cross-subsidizing is the source for soft budget constraints, 
then the diseconomy of scale hypothesis is unnecessary. Thus, market 
discipline approach does not contradict the economy-of-scale hypothesis. 

HYPOTHESIS 1b.—The organizational size and economic scale of in- 
dustrial corporations at the township-village level do not adversely affect 
the firm’s productivity. 


COMPARING VILLAGE AND TOWNSHIP CORPORATIONS 


Even though grouped under the same category, township enterprises and 
village enterprises differ in many aspects. Most existing empirical analyses 
(except Svejnar 1990 and Dong and Putterman 1997) simply assume that 
they are the same and thus overlook many institutional and organizational 
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differences that are theoretically relevant. Here, I compare village and 
township enterprises from different theoretical perspectives. 

The most important distinction between townships and villages is that 
the township government is part of the formal state bureaucracy and the 
village government is not. Township cadres are on the state payroll; village 
cadres are not. Township heads, appointed by the county government, 
often come from outside of the township where they hold office (Whiting 
1996). Village heads almost always come from within the village; al- 
though their appointment has to be officiated by township officials, they 
do need to have the support of the villagers. The legitimacy of village 
leaders’ power comes from their villagers’ support, and the state will not 
normally remove them as long as they have that support. 

In the sense that townships are integrated into the state apparatus, the 
notion of local state corporatism is applicable here. A typical township 
government contains three key structures: the Party committee, govern- 
ment office, and a township economic commission that oversees all busi- 
ness activities within the township (see chart in Ho 1994, p. 211) Staff 
heavily overlap in the three structures, however. In advanced regions like 
Southern Jiangsu, the township industrial corporation is the most im- 
portant branch in the economic commission and is usually headed by a 
party boss. All the township-owned industrial enterprises are under this 
corporation (Ho 1994; Ma et al. 1994). In the 1980s, most of these cor- 
porations adopted profit-sharing contracts with their enterprises. Market 
competition intensified in the mid-1990s and drove down the profit mar- 
gins, forcing local officials to switch to the more efficient (and more “pri- 
vate”) rental contracts (Zhu 1998)." 

The picture of the village collective (cun jiti) is indeed ambiguous. In 
Huang’s (1993) words, it is a “third realm” where the state and the society 
interpenetrate and the line between the public and private spheres be- 
comes blurred. In sunan villages, the appropriate description ranges from 
“informally private” (Nee and Su 1996) or “vaguely defined cooperatives” 
(Weitzman and Xu 1994). On the one hand, property rights are partitioned 
between the cadre-entrepreneurs who occupy village offices and the res- 
idents of the village community. Either entrepreneur-turned-cadre or 
cadre-turned-entrepreneurs, village leaders control village corporations, 
which entails private profits and privileges. In return, they promise to 


2 According to my own fieldwork in developed regions, township heads are often 
college graduates or retired military officers who are placed in township offices for 
observation and evaluation before further promotion. 

“Tn one of my field trips to Jiangyin, I asked about the reason for switching from 
profit-sharing contracts to rental contracts; the township fiscal auditor said, “How 
could I know how much profit they [township enterprises] actually made?” 
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make a profit to share with the villagers, the nominal owners. It is a form 
of communal ownership.“ On the other hand, village corporations are 
informally private in the sense that control is notoriously entwined with 
family-kinship networks (jiazku hua). Family or kinship organization of- 
ten blends with, or even superimposes upon the trinity of the Party, the 
government, and the industrial corporation (Lin 1995, p. 339). Even 
though no available data indicate the exact extent, researchers frequently 
run into “collective” corporations in which the main accountant is the 
wife, daughter, or mistress of the big boss. It is sometimes hard to tell 
whether it is the village government or the incumbent family that controls 
the village corporation. 

I illustrate the intricate relationship among the state, the village cadre, 
and the village residents with two examples. Example 1: the most famous 
village in sunan, Huaxi village, built identical track houses for all villagers, 
with a Santana—a family car made by Shanghai-Volkswagen—in every 
garage. The houses were allocated in an egalitarian fashion according to 
the size of households. The party boss, Wu Renbao, who engineered the 
Huaxi success, had four sons and one daughter helping him to run Huaxi 
Corporation. Between 1993 and 1997, the township government approved 
a 0.8-1.6 million yuan bonus for Mr. Wu each year. He declined to accept 
the bonuses, out of modesty or arrogance. 

Example 2: the architect of Daqiu village (another famous North China 
location, near Tianjin, where Nan Lin [1995] did his fieldwork), Yu 
Zuomin, built an empire around an elaborate network of family, kinship, 
and personal ties (Lin 1995). Yu was so carried away by his success and 
power that he got implicated in a murder case and blatantly blocked 
police investigations. Eventually he was arraigned in court and jailed in 
1993. In an obvious attempt to extend state power into the village, the 
county government decided to upgrade the village to a township with 
four village divisions and sent in “state cadres” to staff the new township 
government. According to one of the four village heads, this helped break 
down the family control of the corporation. 

Table 1 highlights the different interpretations of the institutional and 
performance features of village, township, and municipal enterprises from 
the three theoretical approaches discussed above. Despite their different 
characterization of the nature of ownership of TVEs, both the informal 
privatization and local state corporatist approaches would predict that 


4 Weitzman and Xu (1994) describe the collective TVEs in China as “vaguely defined 
cooperatives ” They emphasize the fact that residents of rural communities are the 
nominal owners of TVEs and share the profits in the forms of dividends, job oppor- 
tunities, welfare provisions, etc They argue that pnvate nghts of TVEs need not be 
well defined because the Chinese can solve their collective problems via a “cooperative 
cultural tradition” and “mutual trust.” 
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TABLE 1 
DIFFERENT THEORETICAL INTERPRETATIONS OF PRODUCTIVITY 
PERFORMANCE AT THREE LEVELS 








Village Township Municipal 
Informal privatization: 
Private property nghts .... . Most secure Less secure Least secure 
Managerial incentive ae Strong Moderate Weak 
Efficiency .... .. Very high High Low 
Local state corporatism: 
Public property rights Least attenuated Attenuated Most attenuated 
Corporate scale . ....... Small Medium Large 
Bureaucratic monitoring ... Strongest Strong Weak 
Cross-subsidizing .... Limited Moderate Large 
Managerial indiscretion . Limited Moderate Large 
z bee into Very bigh High Low 
Market discipline explanation: 
Policy burdens ... . . Few Few Many 
Market discipline... Harsh Harsh Lenient 
Access to soft bank loans No No Yes 
Budget constraints ...... Hard Hard Soft 
Alignment of manager’s 
incentive Abgned Aligned Misaligned 
Efficiency High High Low 


village enterprises should perform better than township enterprises. For 
the informal privatization approach (Nee 1992; Nee and Su 1996), village 
enterprises should do better because villages go a lot further on the road 
of informal privatization. Since the security of property rights correlates 
positively with the stability of social networks in which these rights are 
embedded, logically, the social networks in the villages should be more 
stable than those at the township level, hence the more secure informal 
property rights for village entrepreneurs. 

From the local state corporatist point of view (Walder 1995; Oi 1999), 
village government’s public property rights over enterprises are least at- 
tenuated by central regulations and thus most secure (Walder 1995, p. 
280). Given the small scale of village corporations, village governments 
have the greatest ability to supervise the detailed operation of their en- 
terprises and enforce their property rights. Village governments are often 
merged with enterprise management, and monitoring is very detailed. In 
such cases village leaders are the general managers and directly control 
the operation of enterprises—enterprise managers are more like branch 
heads and have little decision power. Usually the village headquarters 
controls all profit.and expenditure.of the enterprises, so that managerial 
indiscretion and agency costs are less of a problem. Furthermore, village 
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corporations have the least resources for cross-subsidizing. Thus, either 
due to more secure private property rights of managers of village enter- 
prises or due to more secure public property rights of village governments. 

HYPOTHESIS 2a.—Village enterprises outperform township enterprises 
in total factor productivity. 

The market discipline approach, on the other hand, would be consistent 
with an alternative prediction. Here, ownership form, either public or 
communal or private, is of secondary importance. Full competition and 
financial market discipline are most important. Because both township 
and village enterprises operate under hard budget constraints and compete 
in the same input and output markets, neither can afford to be less efficient 
than the other. Therefore, information asymmetry between government 
and enterprise managers at the township level, presumably more severe 
than that at the village level, can be remedied by indirect market mon- 
itoring and incentive alignment. Empirically, Svejnar (1990) and Dong 
and Putterman (1997) found similar productivity and technical efficiency 
between township, village, and private ownership. Peng (1992) reported 
that township and village/private enterprises were governed by a similar 
wage determination equation different from that in the urban state sector. 
Thus, we have a competing hypothesis: 

HYPOTHESIS 2b.—Village and township enterprises do mot differ sig- 
nificantly in terms of productivity. 


DATA AND VARIABLES 


The site of this research is Jiangyin in the Southern Jiangsu region. Chang- 
shu, Jiangyin, Wuxi, and Zhangjiagang made up the vintage “Southern 
Jiangsu model” (sunan moshi). The four adjacent county-level cities had 
an early start to rural industrialization in the 70s and are now ranked 
highest in rural industrial output per capita in China. They shared similar 
institutional and policy environments, with city governments playing-the 
role of a developmental state, township-village governments actively en- 
gaging in setting up and managing rural enterprises and private entre- 
preneurial activities discouraged until 1994. Although Jiangyin's experi- 
ence with rural industrialization cannot be see as a generalization of the 
whole country, it is certainly representative of the sunan model. i 
The data set used here comes from the 1993 original statistical reports 
to the Jiangyin municipal bureau of industry and municipal bureau ọf 
TVEs. It covers all industrial enterprises at the municipal, township, and 
village level within the city. Table 2 provides summary information about 
the data set According to table 2, the original data set contains records 
of 3,478 enterprises, out of which 56 are owned by the municipal gov- 
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TABLE 2 
DESCRIPTIVE DATA, LOCAL GOVERNMENT CORPORATIONS, JIANGYIN, CHINA, 1993 





Total/ 
Village Township Municipal Average 
Administrative units 489 31 1 521 
(537) (31). (1) (569) 
Total N of enterprises ............ 2,143 819 ` 52 3,014 
(2,534) (888) (56) (3,478) 
N of enterprises per corporation ... (4.7) (28 6) (56) (6) 
Sales income per enterprise ... 3,870,000 14,784,000 54,091,000 7,702,000 
Fired asset per enterprise ...... 835,000 3,293,000 19,951,000 1,833,000 
N of employees per enterprise . 56 192 676 104 
Fired asset per worker .......... 14,900 17,150 29,500 17,600 
Sales income per worker ..... 69,100 77,000 80,000 74,060 
Wage per worker (yuan)  ...... 3,160 3,270 4,750 3,390 


Norr —Except for administratrve units and total N of industrial enterprises, amounts shown are 
averages. Figures in parentheses are raw data before cleaning 


ernment, 888 are rural industrial enterprises owned by 31 township gov- 
ernments (including 1 suburban district), and 2,534 are owned by 537 
village governments (including 10 village level units). Excluding 412 en- 
terprise with missing values and 52 cases with inconsistent data or extreme 
values, 3,014 enterprises nested under one municipal, 31 township, and 
489 village governments are retained for the following analysis. Forty- 
eight villages are excluded from analysis either because no valid enterprise 
record remains after data cleaning or because the whole village is recorded 
as a single firm, when the “firm” apparently consisted of a number of 
enterprises. For instance, none of the 29 villages in the township of show- 
zhuang reported any labor or wage data on its 133 village enterprises and 
therefore all were removed. 

Because they own and operate enterprises, village, township, and city 
governments are simultaneously administrative organs and industrial 
“corporations.” Thus defined, there were 569 distinct industrial corpo- 
rations in Jiangyin in 1993 (each village in Jiangyin reported at least one 
enterprise). Even though, administratively, village governments are nested 
under township governments, a township “corporation” does not include 


= The process of data cleaning follow these step: first, 27 observations were deleted 
for missing output values, second, 319 cases deleted for missing capital asset or labor 
or both; third, 66 cases were deleted for missing industrial codes; fourth, 29 cases were 
deleted due to data inconsistency; and finally 23 extreme regreasion outliers were 
thrown out. The total number of bad cases is 464. 
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enterprises owned by villages within the township jurisdiction. Similarly, 
the city corporation does not include enterprises owned by townships.” 

The administrative hierarchy is, nevertheless, important. Consistent 
with Walder’s (1995) observation, the size and scale of these corporations 
increases as we move up the administrative hierarchy. On average a vil- 
lage corporation owned about 4.7 enterprises; a township corporation 
owned about 29 enterprises; and the city government operated 56 factories. 
The average size and scale of individual enterprises in terms of capital 
asset, labor, and output also increase as we ascend the administrative 
ladder. But note that the average wages in village and township enter- 
prises are similar, while both are substantially lower than that in city- 
owned enterprises. These figures are consistent with published statistical 
data Jiangyin Yearbook Committee 1993, pp. 130, 416). 


Measurement 


The original data set contains both enterprise records and records for 
township-village subtotals. Highly detailed checking was carried out to 
ensure the quality of the data. Reported township-village subtotals were 
generally consistent with those computed from enterprise level data. In 
the few cases where village subtotals were not reported, computed sums 
were substituted. To ensure reliability, three measures of output were used: 
gross value of output, sales income, and value added. 


Enterprise-Level Variables 


Gross value of output is the total value of output (before sales), including 
the value of material inputs, measured at 1993 market price. Sales income 
is the gross income after sales, excluding output value in the storage 


16 My definition of local state corporatism here differs from Oi’s (1999, p. 103) view 
of the whole jurisdiction of a county as a corporation, with county government as the 
headquarters and townships and villages as divisions and subsidiaries. Thus defined, 
the county corporation would include all the village and township enterprises. My 
definition of local state corporation uses a strict ownership criterion The county gov- 
ernment does not own the township or village enterprises in any real sense of word 
because (1) it does not have any claim over their residual income except collecting 
local taxes, and (2) it is not involved in the management and disposal of those assets. 
A county government does indeed oversee all business activities (even the private ones) 
in its jurisdiction to make sure that they fall within the legal and policy boundaries 
and sometimes, if requested, may even help some large TVEs to obtain bank loans, 
technology, and market information But this is more administrative supervision than 
corporate control. Township governments collect a small management fee from village 
enterprises, usually no more than 1% of the net profits They are not residual claimants 
of village enterprises, as they are of township enterprises, in the sense of profit sharing 
or capital charges. 
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inventory.” Value added is the net value of output before sales, excluding 
the cost of material inputs, measured at the 1993 market prices.'* Capital 
is the net value of all fixed assets at current prices. In some cases where 
the net asset value is missing but the original asset value available, the 
latter is substituted after depreciation according to village or township 
average depreciation rate. Labor is the average number of employees in 
the enterprise for the whole year. If this value was missing, the year-end 
figure was used if available. Average wage is the total wage bill (including 
bonuses) divided by the total number of employees. Most TVEs use a 
piece-rate wage system and do not tie labor pay with profits. Thus, average 
wage is a good proxy for the labor quality and labor cost. Industrial sector 
is obtained using information on major products, equipment, and factory 
names. It is first coded into the Chinese standard two-digit classification 
of industries (Sun, Wang, and Li 1994) and then regrouped into (1) food 
processing, textile, and garments, (2) chemical, (3) metallurgy, (4) manu- 
facturing, (5) construction, transportation, and utilities. 


Corporate-Level Variables 


Corporate administrative level refers to the public ownership at the vil- 
lage, township, and (county) municipal levels. Corporate number of firms 
refers to the number of enterprises owned by the same village, township, 
or municipal government (according to counting before data cleaning). 
Corporate employment size refers to the total number of employees of all 
industrial enterprises within the same village, township, or municipal 
jurisdiction. Corporate capital assets refer to the total capital assets of all 
industrial enterprises within the same village, township, or municipal 
jurisdiction. 


METHODS 


Multilevel analysis is needed for exploring the effect of corporate level 
characteristics on firm performance. Ordinary least square regression is 
inappropriate in this case because (1) firms within the same corporation 


Twenty-eight otherwise valid observations have missing values for sales income. 
Using conditional mean imputation methods described in Little and Rubin (1987), I 
estimated their log sales income according to the following regression equation. 
log (sales income) = —0.093 + 0.985 x log (gross value output), with R?’ = 0.946 and 
N = 2,910 

I recalibrated the value-added if the ratio of the originally reported value-added 
over gross output value was less than 5% or more than 95%, according to the following 
equaton: log(value added) = —1.13 +093 x log(gross output value), with R’ = 
0.865 and N = 2,676 I recalibrated 371 cases this way 
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are inherently clustered, that is, subject to the same disturbance; and (2) 
treating corporate level variables as the same as firm-level variables in- 
flates the number of observations. Both situations would lead to an un- 
derestimation of standard errors in OLS. Thus, I specify the following 
multilevel models. 


N 
Iny, = A, + Bink, + ylal, + Alnwy + >) MSi + Ey) (1a) 
h=1 


A, = a + pC, +1) + u. (1b) 


In equation (la), y, stands for per worker gross value output or sales 
income or value added of the ith firm in the jth corporation (i = 
1,2,3,... ,# andj = 1,2,3,... ,518); k, for capital input per worker; J, for 
total number of employees; w, for average wage; Sw, stands for a set of 
dummy variables of industrial sectors with & = 1,2,3,4. As specified, the 
coefficient for labor indicates return to scale. With a constraint y = 0 (i.e, 
constant returns to scale), this would conform to the standard Cobb- 
Douglas production function. A, in (la) is an intercept, indicating total 
factor productivity in this case. Its subscript suggests that it is a random 
effect and varies from corporation to corporation. Equation (1b) specifies 
it as a function of corporate administrative levels. C, and 7, in (1b) are 
dummy variables for county ownership and township ownership, re- 
spectively. Both the enterprise-level error term e, and corporate-level error 
term #, are assumed to be normally and independently distributed. The 
variance of u, captures the residual correlation within the same 
corporation.'” 

To explore the impact of corporate size and scale on enterprise per- 
formance, we only need add a few more contextual variables to (1b). 


N 
Iny, = 4, + Bink, + ylnl, + Alnw, + 2 MS Fey (2a) 
hml 
A,=atypC,+ e+ olnN,+0lnK,+rlnL, + 4,. (2b) 
N, in (2b) stands for number of enterprises in the jth corporation; the 
upper case K, for total capital assets; L, for employment size (i.e., total 


This will become apparent if we combine the fixed and random components into a 
single equation. Substituting (15) into (1a), e.g., would give 


LA 
DY, =a+pC,+T,+ Blnk, + ylol, + Alnw, + Dy tH +6. 
bn] 
Thus, the total disturbance of the ith firm ın the jth corporation would be eij = 
«jy + eij. Thus, for any two firms in the jth group, cov(s,,¢,) =cov(w, + €, 4, + 
En) =cov(u,, 4,) tcov(s,, Ep) +cov(s,, 4,) +COv(8,, E) = varn). 
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industrial labor force). Both equations (1a—1b) (model 1) and equations 
(2a—2b) (model 2) are estimated via the restricted maximum-likelihood 
method in SAS/PROC MIXED procedure (see Singer 1998; Kreft and de 
Leeuw 1998). 


FINDINGS 


Models 1 and 2 are estimated for gross value of output, sales income and 
the value added, and the results are presented in table 3. All three mea- 
sures of output yield similar coefficient estimates, indicating that the re- 
sults are robust against differences in measurement. 

The following four findings in table 3 should be noted. First, consistent 
with previous studies, TVEs show a higher total factor productivity than 
municipal SOEs. In all six equations, TVEs generate more output, sales 
income, and value added than do local SOEs with the same amount of 
capital and labor inputs. Due to the small sample size of SOEs in this 
study, the productivity differential is only significant in two out of six 
equations. 

The observed productivity differential should be qualified on two fronts. 
On the one hand, SOEs have technological advantages over TVEs. The 
finding of superior productivity of TVEs implies that SOEs’ superior 
technology and technical personnel do not transpire into greater produc- 
tivity. As shown in table 2, SOEs have a much higher capital-labor ratio 
than the TVEs in Jiangyin. According to Lin, Cai, and Li (1996), TVEs 
tend to employ labor-intensive technology that reflects the comparative 
advantages of China’s resource endowments. Even though SOEs have 
higher output per worker, TVEs use capital more efficiently and have 
higher total factor productivity. On the other hand, SOEs may have some 
“unfair” disadvantages, primarily in terms of redundant workers and un- 
productive investment (like housing). Jefferson (1999), for instance, found 
that the total factor productivity of TVEs (in 1990) was higher in all but 
one out of seven industrial sectors than that of SOEs. The differential 
disappears after controlling for not only nonproductive investment in 
housing, but also institutional variables such as the level of supervision, 
proportion of self-marketing, management system, and commitment of 
collateral by managers, and so on. 

Second, either with or without controlling for corporate size and scale, 
village enterprises do not outperform township enterprises in productivity, 
except model 1 for sales income, in which township enterprises better 
village enterprises. I have argued that local state corporatism correctly 
describes township-owned enterprises, and informal privatization is prob- 
ably a more appropriate description of village-owned enterprises. Thus, 
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the fact that township enterprises are equal in productivity to village 
enterprises and superior to urban SOEs suggests that ownership 
form—public or communal or private—is not an overriding factor in the 
performance of China’s rural industry, at least not at the stage under 
discussion. Privatization may be the best solution to efficiency, but a large 
efficiency gain is feasible within decentralized government ownership. 

Third, the size and scale of village and township industrial corporations 
do not seem to affect productivity adversély. The coefficients for the num- 
ber of enterprises in corporations and total size of workforce are insig- 
nificant across the board. The corporate total capital stock seems to have 
a consistently positive, albeit small, effect on the productivity of individual 
TVEs. Figure 1 visualizes the relationship of total factor productivity in 
output value and corporate size and scale. I refrain from theorizing about 
the increasing return to the corporate total assets. It suffices here to con- 
clude that the organizational size and economic scale of local government 
corporations does not adversely affect productivity. This leaves policy 
burdens, market disciplines, and access to bank loans the most likely 
factors explaining the differential productivity between TVEs and SOEs. 

Last, please note that the coefficients on labor are generally positive 
except for model (2) of value added, which yields an insignificant negative 
estimate. Given the way the production function is defined in (la) and 
(2a), a positive coefficient on labor indicates a slightly increasing return 
to scale at the enterprise level. As constant or slightly increasing return 
to scale is expected of such kind of data, the result should also be taken 
as an indication of data quality. 


CONCLUSION AND DISCUSSION 


Even though the superior productivity performance of TVEs over SOEs 
is not a new finding, my analysis has made three contributions: (1) the 
use of local government (corporation) characteristics as contextual vari- 
ables; (2) the finding of increasing returns to corporate scale; and (3) the 
finding of similar performance between enterprises owned by township 
governments and those owned by village governments. I conclude that 
these findings are consistent with the market discipline approach, which 
emphasizes incentive alignment under fully enforced labor, product, and 
financial market disciplines. 

Accordingly, I offer two revisions of Walder’s (1995) argument. First, 
TVEs had fewer agency problems, not due to more effective direct bu- 
reaucratic monitoring, but due to more effective indirect market moni- 
toring. When ownership is separated from control, information asymmetry 
renders direct monitoring ineffective or prohibitively costly. But market 
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monitoring did not work for SOEs because they were not placed on a 
level playing field and shouldered idiosyncratic policy burdens. Second, 
local government’s external borrowing power from the state bank better 
explains the softness of budget constraints on SOEs. Internal cross- 
subsidizing within the local state corporations is probably limited and 
more calculated when it happens. 

Public ownership entails potential externality because it is nonexclud- 
able (too many residual claimants taking too much) and nondiminishable 
(Le., losses are replenished by fiscal or financial subsidies). As Jefferson 
(1998) points out, privatization is neither a sufficient nor a necessary 
condition for eliminating public enterprise externality; hardening budget 
constraints (cutting off fiscal/financial subsidies) is a necessary but not 
sufficient condition. Privatization may be the best remedy, but it was 
neither feasible in China during the 1980s given the general legal 
and political environment, nor was it viable in the absence of a well- 
functioning property rights market. Partitioning of property rights 
between the central state and various levels of local governments en- 
hanced profit incentive but ushered in agency problems that could have 
worsened excludability problems. The reform experience of public enter- 
prises at different levels represents a whole spectrum of responses de- 
pending on how diminishable the public assets are and how rivalrous 
their consumption is. 

On the village level, collective enterprises are definitely diminishable 
and have natural excludability due to the clear boundaries of villages. 
An efficient solution depends on whether village residents and village 
leadership (party boss, village head, and enterprise managers) can work 
out some form of income sharing arrangement, based not on legally de- 
fined private ownership, but on the informal institution of mutual trust. 
Villages with a strong collective spirit will have an advantage. On this 
account, I agree with Nee’s argument of informal privatization and Weitz- 
man and Xu’s characterization of “ambiguously defined cooperatives.” 
But it is a bit farfetched to argue that the collective spirit comes from a 
“cooperative tradition” in Chinese culture. Confucianism teaches loyalty 
among family and kinship members but not trust of strangers or respect 
for contracts with them. I suggest that collective spirit can only come 
from two sources: strong kinship networks that may have survived com- 
- munist rule (Huang 1993) and socialist collective legacy that survived 
agricultural decollectivization in some regions in China, where rural in- 
dustrialization had an early start in the 1970s. 

On the township level, informal privatization or cooperative solution 
is not feasible because, first of all, the township government is more 
integrated with the state bureaucracy than with local community. Second, 
the nominal ownership by all the township residents is too diffuse to 
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exercise any monitoring function or to directly benefit from township 
industries—besides through public facilities and better employment op- 
portunities. Income-sharing agreements between the township leadership 
and the residents are untenable. Under such conditions, local state cor- 
poratism became a viable alternative in which a combination of local 
government property rights and hard budget constraints succeeded 
in aligning the incentive of public officials to market-oriented growth- 
promoting behavior. The experience of township industrial corporations 
suggests that public ownership externality can be reduced by strength- 
ening labor, financial, and product market disciplines. 

As SOEs gradually grew out of the plan in the early 1990s, they com- 
peted fiercely with each other and with the TVEs. They could afford to 
be inefficient and unprofitable for the time being because they still enjoyed 
subsidies and faced weak financial disciplines. The situation has been 
changing since the second half of the 19903. The banking reform in 1996 
severed ties between the state bank branches and local governments. By 
1996 the lifetime job security for Chinese urban workers was as good as 
gone (Rawski 1999, 145). We can only speculate as to whether large SOEs 
could acquire the ability to survive on their own in market competition 
by following in the footsteps of TVEs. 

History may have vindicated Nee’s market transition theory (Nee 1991, 
1992, 1996) and other convergence theories (e.g., Sachs and Woo 1997), 
when in 1997 the Fifteenth Congress of the Chinese Communist Party 
proclaimed that private ownership is an important component of “so- 
cialist” market economy. The Chinese institutional innovation represented 
by sunan moshi is to become a fond memory of experimentalists and 
market socialists. With the recent wave of privatization sweeping China, 
even the officials in Southern Jiangsu are no longer proud of the label 
sunan moshi and actually shy away from it. The new policy regarding 
public enterprises is to retain the large and “let go” (privatize) the small 
(most of which are already insolvent or on the brink of bankruptcy). The 
glory of sunan moshi will soon fade in the memory of politicians, but its 
institutional legacy may linger in the reflective minds of academia for 
some time to come. 
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Career Advancement as Party Patronage: 
Sponsored Mobility into the Chinese 
Administrative Elite, 1949-1996* 


Bobai Li and Andrew G. Walder 
Stanford University 


Core features of mobility regimes are obscured by models common 
in comparative research. Party patronage in China is apparent only 
in the timing of career events. Elites are chosen from among party 
members, but only some are eventually chosen. Those who join the 
party while young enter a career path that includes sponsorship for 
adult education and more likely promotion. While the party’s pref- 
erence for youth from “red” classes has yielded to one for prior 
education, party sponsorship endures. Because patronage blurs dis- 
tinctions between politics and merit, it confounds interpretations of 
returns to individual attributes. 


Although interest in the subject has surged following recent changes in 
the world, research on stratification and mobility in communist states has 
a long history. Most of our attention is now trained on the consequences 
of these distinctive economic institutions and their recent transformations, 
but the role of political institutions—in particular, ruling communist par- 
ties—is a subject of equally vital importance. Despite decades of com- 
parative mobility studies based on survey data from these regimes, anal- 
ysis of the role of politics in career processes is still in its infancy. This is 
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largely because data on individual party membership did not become 
available until the mid-1980s (Szelényi 1987), by which time for most 
countries the subject had assumed largely historical significance. However, 
in those countries where communist parties have relinquished power, the 
subject still presents questions of genuine comparative and historical im- 
portance. Where the party still rules, the subject remains vital for un- 
derstanding the social consequences of current economic transformations. 

Political inequalities were by no means neglected by students of com- 
parative mobility, and the lack of data on party membership did not block 
the research agenda. The party’s control over career opportunities was 
not ignored; it instead constituted a central if implicit role in defining the 
core intellectual problem. In early studies of communist regimes, the dis- 
parity between party elites and nonparty masses was a central topic of 
interest (Marcuse 1958; Rigby 1968; Sorokin 1959). Scarce aggregate data 
nonetheless made clear the marked advantages of party members in career 
advancement and the allocation of resources, and this led generations of 
scholars to conclude that political inequalities based on party affiliation 
were central to stratification under state socialism (Bauman 1974; Connor 
1979; Feldmesser 1960; Goldthorpe 1966; Parkin 1969; Walder 1985, 
1986) When survey data from such countries as Hungary and Poland 
became available in the 1970s, the lack of individual-level data on party 
membership still did not block the comparative agenda. Instead, knowl- 
edge of the party’s system of power and privilege served to frame the 
intellectual problem, that is, given that communist regimes had political 
and economic institutions that differed in fundamental ways from those 
of other industrial societies, did they nonetheless exhibit a similar pattern 
of mobility that linked educational achievement to occupational attain- 
ment (Moore 1944 1950; Inkeles 1950)? The first generation of survey- 
based mobility studies appeared to answer this question decisively in the 
affirmative (e.g., Ganzeboom, De Graaf, and Robert 1990; Giddens 1973; 
Inkeles and Bauer 1959; Parkin 1971; Meyer, Tuma, and Zagórski 1979; 
Simkus 1981; Treiman and Yip 1989). Inevitably, however, the perennial 
absence of party membership as a variable in mobility models excused 
researchers from carefully conceptualizing its role in the processes of status 
attainment. 

When individual data on party membership finally became available 
in the mid-1980s, interest in the subject revived. For the first time, it was 
possible to directly examine the relationships among family background, 
educational attainment, party membership, and occupational mobility, 
and to shed new light on old questions that had heretofore been addressed 
only indirectly. The first studies have generally conceived of party mem- 
bership as a credential to be earned by individuals and whose impact is 
analogous to, and compared with, that of education. Some would examine 
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the role of family background, education, and occupation in attaining 
party membership (Szelényi 1987; Walder 1995). Others would treat party 
membership as an intervening variable between family background and 
occupational outcomes (Blau and Ruan 1990; Lin and Bian 1991). Some 
studies, based on a conception of a single status hierarchy, have modeled 
the attainment of occupational prestige or an elite occupation and have 
shown that party membership has an effect independent of that of edu- 
cation (Blau and Ruan 1990; Lin and Bian 1991; Massey, Hodson, and 
Sekulic 1992). Studies of China have shown that the effects of party 
membership and education vary by type of career. Party membership is 
irrelevant for mobility into elite professions but vital for entry into the 
administrative elite, while education is paramount for professional oc- 
cupations but only moderately important for administrative posts (Walder 
1995; Walder, Li, and Treiman 2000). 

Treating party membership as a credential is a convenient modeling 
solution. It permits comparison of “returns” to party membership and 
education, and in the tradition of comparative mobility research, it pro- 
vides an accounting device for the extent to which career mobility is 
affected by principles other than the “meritocratic.” This approach makes 
two simple assumptions about party membership: first, that it operates 
as a qualification, much like a college degree, for which candidates for 
promotion into “closed” positions are screened (Sgrensen 1983); and sec- 
ond, that it operates as a “signal” about unobserved attributes of the 
candidate, much as a college degree acts as a signal of unobserved abilities 
or other valuable personal qualities (Arrow 1973; Spence 1973; Stiglitz 
1975). Party membership is in this sense a rough indicator of loyalty to 
the regime and worthiness to receive rewards (Walder 1995, p. 312). Ac- 
cording to this minimalist approach, a candidate’s party membership 
indicates two things to the party officials who oversee personnel decisions 
in work organizations: that at some point the individual showed interest 
in party membership through efforts to meet the party’s standards for 
recruitment (participation in meetings, cooperation with party officials, 
and the display of the proper political and personal demeanor), and that 
at some point in the past a party committee in some organization certified 
that the individual’s behavior and background qualified him or her for 
membership (Walder 1995, pp. 312-14). 

These ideas provide conceptual scaffolding for estimating the effects of 
party membership, but they leave unanswered key questions about the 
attainment of party membership and the role this plays in individuals’ 
careers. How exactly does party membership operate as a credential? Is 
it obtained by individuals based on open competition, or is it systemat- 
ically granted by the party and party officials based on family background 
or other political considerations? Is it primarily an alternative career strat- 
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egy for those with little education or a qualification that enhances the 
careers of all highly motivated individuals? While the regime may enshrine 
party membership as a criterion for advancement, and while we may 
gauge its effects on career mobility, the organizational decisions and in- 
dividual strategies that lead to these associations are still far from clear. 

At the core of the problem is the fact that admission into the party 
does not in itself elevate a member to elite status. Ruling communist 
parties are not small elite clubs but mass political organizations composed 
predominantly of people from ordinary occupations (Schurmann 1968, p. 
138). Aggregate tabulations show that party members enjoy substantial 
privileges over nonparty members, but this is primarily because those in 
elite occupations join at much higher rates (Rigby 1968; Szelényi 1987; 
Walder 1995). People from a variety of occupational backgrounds are 
incorporated into the party as a matter of policy. Moreover, those who 
join are well known to vary widely in their degree of commitment to the 
regime and its political program and in their motives for joining (see, e.g., 
Shirk 1982). Some are politically committed believers and join for patriotic 
motives; some are ambitious young political activists who seek party mem- 
bership simply as a means to advance their careers; and others may join 
passively in response to party recruitment efforts in their place of work. 
Some join early in life, while others join in mid- or late-career. In short, 
the relationship of individual members to the party organization may 
vary enormously in unobserved ways, and while we have strong reasons 
to suspect such variations systematically affect career outcomes, no one 
has addressed the problem with relevant data. 


SPONSORED MOBILITY THROUGH PARTY PATRONAGE 


Party membership does not fit readily into the conceptual apparatus of 
comparative mobility research. Such research has long been founded on 
the distinction between ascription (or inherited status advantages) and 
achievement (based on meritocratic competition). The core research 
agenda has been to gauge the extent to which these principles influence 
educational attainment and intergenerational status inheritance across 
types of economic and political systems, and through time, as societies 
urbanize and industrialize. Ascription is the label for advantages or dis- 
advantages attached to persons by birth, but which continue to affect 
their life chances from childhood on. Ethnicity, gender, parental education, 
and occupational status are measurable attributes of individuals that have 
long had a demonstrable impact on life chances. Achievement (or uni- 
versalism) is the label given to processes presumed to be based on com- 
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petition, in which individual motivation and ability are assessed based 
on performance (obviously, this is an ideal-type). 

Party membership, in contrast, represents neither a quality given by 
birth nor an achievement-based indicator of ability. To some extent, one 
can treat party membership as a status achieved through motivation, 
effort, and competition, and that may be conferred partly according to 
perceived leadership potential.” But it is essentially a relationship to a 
national political organization with branches in almost all places of work 
(and, in part, the officials who staff such branches) for which party mem- 
bers are rewarded. Particularism is the concept usually reserved for the 
principle that one’ s life chances are affected by a relationship with those 
with wealth or power. In this case, party loyalty is a form of “principled” 
particularism characteristic of modern political machines (Walder 1986; 
Walder et al. 2000). What has differentiated state socialist from other 
political machines is the enormous scope of the party’s patronage: at one 
point, they controlled the vast majority of the nation’s productive wealth 
and career opportunities. 

To properly relate party membership to the ideal-types that motivate 
comparative mobility research does little more than to help frame our 
questions. We still need propositions that help us to understand how this 
“particularistic” principle of advancement and reward might be related 
to the more familiar ascriptive and merit-based principles in an analysis 
of career processes. We propose a model of party-sponsored mobility that 
focuses on the timing of party membership in an individual’s life. This 
model, based loosely on ideas about elite mobility in England first offered 
by Turner (1960), posits that individuals are chosen by an elite relatively 
early in life according to some combination of ascriptive and behavioral 
characteristics, and that individuals so chosen at an early age enjoy sub- 
sequent advantages in certain forms of educational attainment and career 
advancement. It assumes that different kinds of people join the party 
early in life rather than later and that subsequent opportunities depend 
on the timing of membership. Party membership is therefore taken as a 
marker that an individual has been selected for potential sponsorship, but 
the important feature of this model is that only early party membership 
brings these career advantages. In other words, party membership does 
not operate as a credential that has the same effect regardless of the timing 
of its attainment within one’s career. If in fact the effects of party mem- 
bership vary systematically according to its timing, then standard mobility 
models may partially misconceive and mismeasure a mobility regime in 


? For this reason, party membership is treated as a form of “political capital” analogous 
to “human capital” in some recent efforts to analyze income determination in planned 
and transitional economies (Liu 1999; Xie and Hannum 1996). 
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which patronage and particularism play a central role through the life 
course. 

We propose that party membership should be conceived not as a cre- 
dential but as something roughly analogous to membership in a club that 
can confer advantages upon members throughout their lives. In Turner’s 
(1960) conception of “sponsored” mobility, favored status is granted to 
individuals by an established elite according to the “supposed merits” of 
individuals. Candidates are selected early in life, and they are put onto 
a separate path of career advancement. Turner suggested that elite mo- 
bility in England was “sponsored” in that children of the elite were placed 
in separate and exclusive schools at an early age, schools that provide 
large subsequent advantages in entering the corporate, legal, and civil 
service elite. This pattern presumably contrasted with “contest” mobility, 
a system in which ability-based competition is predominant at each step 
in the educational ladder.’ What makes the notion of “sponsored” mobility 
relevant to the case of party membership in a socialist state is the fact 
that some group exercises control over the allocation of elite status and 
that there are explicitty observed criteria of elite selection other than 
educational attainment.‘ If party members are “sponsored” in this fashion, 
then the timing of party membership should have observable effects on 
individuals’ subsequent careers. 

The time-dependent implications of Turner’s concept of sponsored mo- 
bility have been developed further by Rosenbaum (1976, 1979, 1984), who 
conceives of career advancement in a corporate hierarchy as a series of 
contests, or “tournaments,” through the life course. A victory in an early 
tournament qualifies one for competition in the next, an evidently path- 
dependent process in which events early in the career can alter the out- 
comes of subsequent competition for career advancement‘ If we conceive 


* Turner’s conception has not had a large impact on comparative mobility research 
except for comparisons of education in the United States and England (e.g., Kerckhoff 
and Everett 1986, Kinloch 1969; Tang 1992; Turner 1975; for exceptions, see Kerckhoff 
1974, 1990; Raffe 1979; Winfield et al. 1989). Research has converged on the conclusion 
that the educational systems of both countries contain elements of both contest and 
sponsored mobility (Rosenbaum 1976, 1979, 1984; Useem and Karabel 1986) and that 
in the end mobility outcomes are largely similar in England and the United States 
(Kerckhoff 1990, Treiman and Terrill 1975; Winfield et al. 1989). 

* As Turner (1960, p 858) noted, “system[s] of sponsored mobility develop most readily 
m societies with but a single elite or with a recognized elite hierarchy.” 

* In tournament mobility, “careers are conceptualized as a sequence of competitions, 
each of which has implications for an individual’s mobility chances in all subsequent 
selections. Although tournaments can be constructed with numerous variants in the 
rules, the central principle involves an mportant distinction between winners and 
losers at each selection point. Winners have the opportunity to compete for high levels, 
but they have no assurance of attaining them; losers are permitted to compete only 
for low levels or are denied the opportunity to compete any further at all. As in a 
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of entry into the party as the first of a series of career “tournaments,” then 
the analogies with the tournament model are also clear. This is apparently 
a promising conceptualization of how party membership might affect 
career outcomes under state socialism. But does the conception fit the 
observable career processes of state socialist societies? We therefore con- 
front a series of qualitatively new research questions in the form of testable 
hypotheses. 


UNRAVELING THE PROCESS OF PARTY SPONSORSHIP 


Recent research on China has shown that party membership matters only 
for the attainment of elite administrative positions but has no measurable 
impact on the attainment of elite professional positions (Walder 1995; 
Walder et al. 2000). We therefore focus our attention exclusively on the 
administrative career path and further explore the relationships between 
party membership, educational credentials, and occupational mobility, 
established in previous studies that treat party membership as a credential. 
Instead of asking how party members differ from nonmembers in career 
opportunities, we ask how individuals are recruited into the party and 
how party members are subsequently selected for positions of adminis- 
trative power. 


The Timing of Party Membership 


Are party members selected early in their careers, or is party membership 
a credential for which individuals compete as part of a career-long quest 
for advancement? While party membership has long been thought to yield 
career advantages, only a minority of party members eventually attain 
elite administrative posts, given the large number of party members and 
the limited availability of administrative positions. This fact leads nat- 
urally to the suspicion that some party members are selected for future 
leadership positions, while others are selected for different reasons. What 
determines the likelihood that a party member will eventually become an 
elite administrator? While status attainment models may suggest such 
factors as education and parental status, the concepts of sponsored and 
tournament mobility require us to look closely at the timing of party 
membership in an individual’s career. 

The central feature of sponsored mobility is that selection occurs early 
for a separate career path to eventual elite status. Turner (1960, pp. 


contest model, winners must continue competing in order to attain high levels, for 
there is no assurance; but as in a sponsored model, early selections have irreversible 
consequences for losers” (Rosenbaum 1979, pp. 222-23). 
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859-60) argued that early selection is designed to cultivate the appropriate 
elite manners, loyalty, and ideology. According to Rosenbaum (1990, p. 
292), however, early selection and subsequent competitions are a mech- 
anism that permits a firm to make efficient investments. By eliminating 
a large number of people from contention in the beginning, tournaments 
reduce the number of candidates that require an investment of a firm’s 
resources, providing a way of rationing and sequencing firm investments 
for an ever-smaller cohort of candidates. Both rationales can be readily 
applied to elite selection in socialist states where the cultivation of loyalty 
and the constraints of resources are even more pronounced. Moreover, 
although party membership itself signals political worth, party members 
continue to be observed and evaluated. Because loyalty and reliability 
are not immediately observable, screening involves a long process. Given 
that member recruitment is the primary channel through which the party 
selects loyal candidates for leadership positions, party members are likely 
to be sponsored in ways parallel to Turner and Rosenbaum’s accounts. 
If party members are sponsored, the party will focus recruitment efforts 
on the young. The more recruitment is focused on the young, the more 
it approximates a pattern of sponsorship; the more evenly it is spread 
over the first third to half of the career, the less it approximates a pattern 
of sponsorship. We therefore offer two plausible hypotheses, the first of 
which is more representative of a sponsorship pattern than the second, a 
null hypothesis that people join the party at a relatively constant rate 
until midcareer (a null hypothesis of constant rates across the career would 
be unrealistic) 

HYPOTHESIS 1.—The party concentrates on recruiting members while 
they are still very young. Understanding thai those who join while young 
have the best future prospects, individuals are most interested in joining 
while still young, but their interest declines rapidly thereafter. In this 
“sponsored” pattern, observed rates of joining the party will be highest 
shortly after beginning the work career and will decline steadily afterward. 

HYPOTHESIS 2.—The party recruits members actively up through the 
midcareer, but after the midpoint in a person’s career, they are no longer 
considered good prospects. Individuals understand that party membership 
is a credential that brings roughly the same career benefits so long as they 
join before middle age, therefore they have a relatively constant interest 
in joining up to this time. In this nonsponsored pattern, rates of party 
membership are relatively high up through the midcareer, and begin to 
decline thereafter. 
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Do Early Entrants Have Different Characteristics from Later Ones? 


In Turner’s work, selection for sponsorship was heavily influenced by 
ascription: Children from elite families were favored in admissions to the 
elite schools. From 1949 to 1978, the Chinese Communist Party officially 
favored offspring of families headed by members of former exploited social 
classes (workers and peasants) and “revolutionary cadres, soldiers, or mar- 
tyrs” who fought for the revolution before its victory, in party recruitment, 
school admissions, and job assignments. Some research suggests that in 
fact this policy provided cover for favoritism toward the children of high 
officials, most of whom were senior enough to be counted as “revolutionary 
cadres” (Kraus 1981; Unger 1982). Therefore it seems highly plausible 
that party recruitment may have been a mechanism for elite reproduction. 

In Rosenbaum’s “tournament” model, on the other hand, mobility con- 
tests are decided based on competition and performance. This resonates 
also with the fact that, for most periods in China, it took serious effort 
to earn party membership, and especially in the Mao era, young adults 
competed vigorously to display the kind of conformity and political ac- 
tivism that was explicitly judged in the recruitment of party members 
(Shirk 1982; Unger 1982; Walder 1986). One attribute that was frequently 
judged was demonstrated leadership and organizing ability. To the extent 
that early selection into the party is determined by ascriptive character- 
istics such as “class background,” it conforms to Turner’s definition of 
sponsorship; to the extent that selection is based on individual ability or 
formal education, it conforms more closely to Rosenbaum’s definition of 
tournament mobility. Therefore there are two potential ways in which 
early and late entrants may differ: according to social background and 
according to educational attainment. 

The principle of “counterselection” according to politically favored fam- 
ily backgrounds has been understood in two distinct ways by students of 
socialist regimes. Studies of eastern Europe have usually conceived of 
counterselection as a program designed to enhance the mobility of people 
from working-class and peasant backgrounds and to remake patterns of 
career and intergenerational mobility along more egalitarian grounds (Ma- 
teju 1993; Szelényi 1998, pp. 11-15). Studies of China, however, have 
tended to view such policies as a cover for a process whereby party elites 
transfer their high status to their own children (Shirk 1982; Unger 1982; 
Lee 1991; Deng and Treiman 1997). In either form, elite selection is based 
primarily on family background, rather than on educational credentials 
or prior occupational attainment. The first possible pattern of recruitment 
is therefore one of “counterselection”: 

HYPOTHESIS 3.—Party sponsorship is a mechanism for consolidating 
revolutionary power: “counterselection” will determine the recruitment of 
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young members for career sponsorship. Those from politically designated 
family categories will be favored over others, while prior educational at- 
tainment will be unimportant. To the extent that those with higher levels 
of education are recruited, they will tend to join later in their careers. 

A second pattern is a “technocratic” one, in effect the opposite of the 
first. Many studies of mature communist regimes have noted a trend 
toward the recruitment of people with higher education. It has been de- 
scribed variously as the formation of a new technocratic elite (Bailes 1978; 
Lee 1991) or as a merger of the intelligentsia with the political ruling class 
(Konrád and Szelényi 1979). In this pattern, the party recruits heavily 
from those with higher education, and college graduates who join while 
relatively young are favored for important administrative posts. 

HYPOTHESIS 4.—Party sponsorship is a mechanism for the formation 
of a technocratic elite: early members are selected heavily from among 
those with higher educations. Those from “red” households are not selected 
preferentially at an early age. 

We state these two competing hypotheses at the outset because it has 
long been assumed that the first will characterize the initial years of a 
communist regime, while the second will emerge in a later period. Our 
model posits only that early members will be sponsored, but it does not 
require any assumptions about who becomes those early members. The 
question of whether party membership exhibits the characteristics of spon- 
sorship is prior to the question of whether selection of individuals for 
sponsorship is based on counterselection or technocratic principles. One 
obvious possibility, long heralded by many observers of communism, is 
that technocratic patterns supplant counterselection through time. 


Career Attainment and the Timing of Party Membership 


To the extent that career advancement takes the form of party patronage, 
those who are selected as members early in their lives should enjoy sub- 
sequent career advantages. Joining the party early signals a higher degree 
of commitment and also permits a more extended process of observation, 
cultivation, and training. Therefore, other things being equal, those who 
join the party while young enjoy higher odds of subsequent promotion 
into an elite administrative position than those who join the party in 
midcareer. If, on the other hand, party membership enhances the odds of 
promotion regardless of the age at which it is earned, those who enter 
the party at midcareer should enjoy the same subsequent career advan- 
tages as those who join near the beginning of their career. Indeed, the 
desire to obtain a career-capping promotion to a leadership position may 
be a primary reason why someone would seek to join the party in 
midcareer. 
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HYPOTHESIS 5.—If the party seeks to identify candidates for career 
sponsorship early and indeed does treat early entrants more favorably, 
those who join the party early in thetr careers will increase their odds of 
advancement into elite administrative (cadre) positions. Those who join 
the party later in their careers will mot increase their odds of such a 
promotion. 

If those who join the party early in life are more likely to be promoted 
into leading posts, they may consider party membership as more central 
to their identity and may as a consequence of their success feel stronger 
loyalties to the party organization. Those who join later, however, will 
be more likely to have been admitted because of prior success in their 
careers and their rising prominence in their work organization. For them, 
party membership may be little more than a career accessory or an hon- 
orary award, while the party’s motivation may be primarily to coopt 
people who have emerged as important figures in their places of work. 


Party Membership and Educational Opportunity 


To the extent that the party recruits young members according to the 
pattern of “counterselection”—favoring those from politically designated 
family backgrounds at the expense of those with higher education—it will 
be faced with a pool of candidates for sponsorship into elite positions who 
are insufficiently educated. A relatively straightforward solution to this 
problem is to send the younger recruits back to school for continuing 
education. This practice conforms to Turner’s notion of sponsored mo- 
bility in that it would be individuals selected according to ascriptive stan- 
dards that are allocated educational opportunity. It also resonates with 
Rosenbaum’s tournament model, in which those who win early compe- 
titions are selected to receive further investments in training. After the 
revolution, the Chinese communist government established an extensive 
system of adult education (e.g., party schools, worker’s colleges, television/ 
correspondence/night colleges, and vocational training programs) in which 
working adults were sent back to school for further education (Ministry 
of Education 1984, pp. 575-627). Adult education provides a potential 
channel through which the party invests in the education of current and 
future administrators. Therefore, early selection into the party may lead 
to career advancement through opportunities for adult higher education. 

HYPOTHESIS 6.—Young party members who have not already attended 
college will improve their chances of returning to school for college edu- 
cation—an important mechanism through which sponsorship affects ca- 
reer outcomes. 

The sponsoring of young party members for continuing education is a 
potentially important mechanism whereby the demand for political loyalty 
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may be reconciled with the educational requirements for leadership and 
management. If this hypothesis is supported by the evidence, we will have 
altered the ways in which we have posed questions about the relative 
role of education and party membership in career advancement. For we 
will no longer be able to interpret a positive association between a college 
education and a higher administrative position as a straightforward mea- 
sure of a meritocratic process, even in the kinds of event-history models 
used in recent studies (see, e.g., Walder et al. 2000). If early recruitment 
into the party brings enhanced educational opportunities, then the treat- 
ment of party membership and education as conceptually equivalent cre- 
dentials becomes highly problematic, and gauging their relative effects 
on career mobility may be more complicated than comparing the mag- 
nitudes of their respective coefficients. The reason is that educational 
attainment may itself be part of the process of sponsorship enjoyed by 
those who join the party while young. Moreover, estimates of the effect 
of education on recruitment into the party may mask two qualitatively 
different processes: early recruitment of those without a college education 
who will be sponsored subsequently for education and career advance- 
ment, and later recruitment of those with college education who have 
already attained professional success. If in fact the relationship between 
party membership, education, and promotion into administrative posi- 
tions is so highly dependent on the timing of events in the career, then 
there is one final logical implication for what we should observe in career 
patterns: 

HYPOTHESIS 7.—Those who receive a college education in this “spon- 
sored” fashion will be much more likely to be promoted into cadre positions 
than those who recetve a formal college education before entering the 
workforce. 

In other words, if we divide college graduates into two groups—those 
who receive higher education through the normal educational ladder and 
those who join the party while young and are subsequently sent back to 
college by their workplaces to complete their education—we should ob- 
serve that the latter group is more likely to be promoted eventually into 
an elite administrative position. 


DATA AND RESEARCH DESIGN 


Uncovering the processes of party-sponsored mobility evidently requires 
life-history data and event-history analysis. Cross-sectional data and con- 
ventional regression models will not do because we need to make dis- 
tinctions about the timing of different career events, especially the timing 
of higher education, party membership, and promotion into an elite ad- 
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ministrative post. Because recent work has demonstrated that career pat- 
terns in China have been altered due to shifting state policies across 
historical periods (Walder et al 2000; Zhou, Tuma, and Moen 1996), life- 
history data are essential in order to distinguish the effects of timing within 
the career from period effects. Our analyses employ career and educational 
history data from a nationally representative sample of Chinese adults 
conducted in 1996. All regions of the People’s Republic of China except 
Tibet were included in the sampling frame. The survey used a multistage 
sampling design, and the primary sampling unit was the county-level 
(xian ji) jurisdiction as defined by the Chinese census bureau. Through 
multistage sampling procedures, the survey obtained a representative 
sample of all adult residents (ages 20-69) registered as “urban” or “rural” 
nationwide. Field interviews were conducted for a total number of 6,473 
cases (for details, see Treiman 1998). We employ only the urban sample 
of 3,087 cases because rural society has few party members and few 
organizations with career lines that can be organized by party committees. 

The data set contains detailed information about respondents' educa- 
tional and career histories, which enables us to model life events in the 
ways suggested by the conceptions of sponsored and tournament mobility. 
We use duration-dependent event-history models to investigate whether 
recruitment into the Chinese Communist Party exhibits the time-depen- 
dent patterns suggested by the sponsored mobility model (hypotheses 1 
and 2) and whether early and late recruitment into the party are based 
on different standards (hypotheses 3 and 4). We then use nonparametric 
models to determine whether young recruits have qualitatively different 
careers than later recruits (hypotheses 5, 6, and 7). Because our analyses 
involve different types of event-history models and different sets of var- 
iables, we will describe them separately as we introduce each of the anal- 
yses to follow. 


THE TIMING OF RECRUITMENT INTO THE PARTY 


Our first question is about the temporal pattern of party recruitment. Are 
rates of joining the party highest during the first few years of work, 
something that would suggest a pattern of sponsorship (hypothesis 1)? Or 
are rates of joining relatively constant by individuals’ midcareers (i.e., 
during the first 10-15 years of work), a pattern more consistent with the 
proposition that party membership is a credential whose value is relatively 
independent of its timing (hypothesis 2)? To explore these hypotheses, we 
_ examine recruitment into the party regardless of the historical period in 
which it takes place. Figure 1 graphs the hazard rate of joining the party 
along two time dimensions: age and labor force experience. The line of 
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Fic. 1.—Age dependence and career dependence of party recruitment 


age dependence, which is the hazard function by respondents’ natural 
age, shows a left-skewed bell-shaped pattern, with the hazard rate in- 
creasing rapidly in the late teens, reaching its peak at age 22 or 23, and 
declining steadily thereafter. When the time clock is set to individuals’ 
labor force experience (measured in years after entering the first job), the 
patterns becomes even clearer: The hazard rate is highest at the beginning 
of the career and declines almost monotonically over time. 

Table 1 statistically confirms the career-stage dependence pattern of 
party recruitment with Gompertz models, which are suitable for the mon- 
otonic hazard function in figure 1.° The general form of the models is 
given as 


r(t) = exp(aX, + BX at), (1) 


where r(t) is the hazard rate at time t, X, is a vector of covariates with 
corresponding time constant effects a, X, is a vector of covariates with 
corresponding time varying effects 8.’ 


* The hazard rate of labor force duration dependence in figure 1 is approximately 
monotonic and therefore has to be estimated by parametric models, especially given 
our interest in the timing of Joining the party. Other models, such as the Weibull and 
log-logistic models, are also suitable for such a monotonic function (see Blossfeld and 
Rohwer 1995, chap. 7) We employ Gompertz models because they are easier to in- 
terpret, especially for nonproportional models with time-varying effects (e.g., model 3 
of table 1) 

7 A closely related model is the Makeham model, an extension of the Gompertz model 
that allows estimates of an extra positive constant (Le., a Makeham term, say y > 0) 
in addition to the Gompertz terms (Le., a and 8). The model assumes that the hazard 
rate is always greater than zero regardless of the length of duration, an unrealistic 
assumption for the case of party recruitment because the rate of joining will become 
zero at advanced ages (e.g., after retirement). 
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TABLE 1 
MAXIMUM LIKELIHOOD ESTIMATES OF THE GOMPERTZ MODELS FOR PARTY 
RECRUITMENT IN URBAN CHINA, 1949-96 


MODEL 1 MODEL 2 MoDrEL 3 
VARIABLES a Term fTem «Term Tem «Term § Term 
Constant ...... ....... 4.265 —.037 —6.477 .030 —7.412 .045 
(.067) (005) (288) (028) (321) (028) 
Gender (male) ............ Sido 1.256 a 1.203 3 
( 100) (.101) 
Age (in 10 years) ... .... eee ee 306 —.014 446 —.017 
(055) (.005) (058) (005) 
Red background .......... Waa .623 a 
(111) 
High school education 498 
(.102) 
College education ....... 243 046 
(238) (015) 
XO cobh ia 661 286 5 368.1 
o tae OS ñ 1 4 8 


Norr —SEs are given in parentheses Unweighted N = 3,079, number of events is 525 


In its simplest specification (i.e., the model without any covariates), if 
the constant $ = 0, it becomes an exponential model with a constant log 
hazard rate of a. If 8 < 0, however, the log hazard rate is decreasing from 
the initial value a (£ = 0), with an annual rate of $; and the opposite is 
true if 8 > 0. Thus, to support hypotheses 1, that party recruitment occurs 
mainly in the early career, we need to have a significantly negative 6 
term. 

The models in table 1 require further clarification. In all models, ob- 
servation starts at the first job, with two exceptions. First, for those who 
started working before age 18, the period of observation begins at 18. 
This is because an individual has to be 18 or older to qualify for party 
membership.’ Second, for individuals who entered college before entering 
the labor force, the observation begins at the time of college entrance.” 


* Some people did join the party before age 18, but this was rare and occurred largely 
in schools and in the army, rather than ın workplaces. 

* College education includes (1) specialized college education, including part-time adult 
educational programs through television, correspondence, on-the-job training, night 
schools, and profeasional training, which if completed confer credentials equivalent to 
two-year college degree; (2) regular university education (4 years or more); and (3) 
graduate studies at either masters or doctoral level. Here college entrance is defined 
as the first time the individual moves directly from secondary school (academic or 
vocational) to either of the first two types of tertiary education listed above (graduate 
study is excluded). 
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This is because, as past studies have shown, college students may join 
the party at significant rates (Szelényi 1987). Moreover, those who joined 
the party before the period of observation are not treated as censored, 
but as joining immediately after the initial time (i.e., duration = .5 years).’° 
These models are estimated by Transitional Data Analysis (Rohwer 
1997)." 

Model 1 is the null Gompertz model with two constant coefficients. As 
expected, this model provides a much better fit than the null exponential 
model (x? = 66.2; df = 1). The 8 term is significantly negative (8 = 
—.037; p < .001), indicating that the rate of joining the party is the highest 
at the very outset of the work career and declines over time. The initial 
rate (i.e., the rate at time 0) is .014(e7*7 œ 014), and it declines at an 
annual rate of about 4% (e %” e .964) for every additional year. In our 
data, 17.3% of the sample reported having joined the party, and half of 
all party members joined by their eighth year in the labor force. If we 
ignore right-censored observations and assume that every individual has 
a maximum duration of 40 years, model 1 predicts that about a quarter 
of all individuals would eventually join the party, and accordingly, the 
predicted median time of joining the party is after 12 years in the labor 
force.” 

The time dependence revealed in model 1 may be an artifact of unob- 
served heterogeneities (Blossfeld and Rohwer 1995, chap. 10). To minimize 
this, we add two control variables. In model 2, gender is a dummy variable 
with male coded as “1.” Age is measured by respondents’ age (in 10 years) 
in 1996 and is a time-constant variable used to account for cohort effects. 
Model 2 improves the fit over model 1 significantly (x? = 220.4; df = 3), 


* Tn our sample, there are 27 such cases, some 5% of the total number of party members 
Technically, these cases are left censored and should be excluded from the analysis. 
However, these young party members compose a substantial proportion of the early 
recruits who become candidates for sponsorship. For this reason, we retain these cases 
in the models. The best way of doing this, given that our time clock is the duration 
in the labor force, is to treat these people as joining the party immediately after 
beginning the first job (duration = .5 years) 

4 The Gompertz models in tables 1 and 2 are estimated by TDA, while subsequent 
models are estimated by STATA (StataCorp 1997). The estimations from TDA do not 
adjust for the effects of our survey’s multistage sampling design. 

4 These predicted rates are calculated based the survivor function, 


a 

S() =expl = zlep) - 1D) 
where t 1s duration and a and $ are the constant coefficients estimated in the null 
Gompertz model At time 40, the proportion of party members in the sample is 1 — 
S(40) = .25. Solving the eq. (1), S(2) = 125 yields the median of duration? = 12 years. 
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indicating strong gender and cohort effects." Model 2 shows that men 
have a substantial advantage (see also Rigby 1968; Szelényi 1987) and 
that it persists through the whole career. The positive effect of the age 
variable in the a term suggests that earlier cohorts are more likely to join 
the party early in their careers than later cohorts. However, the negative 
sign of the $ term indicates that as the career proceeds the rate of joining 
the party declines much more rapidly for older cohorts than for younger 
ones. After controlling for cohort effects, the intercept in the 8 term 
becomes insignificant. All of this implies that the time-dependence in party 
recruitment is much more pronounced for earlier age cohorts. It also hints 
that these relationships change across historical periods, something we 
will investigate below. 

These results confirm the patterns illustrated in figure 1, and they sup- 
port hypothesis 1, the pattern consistent with sponsorship, rather than 
hypothesis 2, the pattern consistent with credentialism. People are most 
likely to join the party when very young, and this likelihood declines 
steadily with age. Viewed cross-sectionally, in any given period young 
adults are much more likely to join the party than their older counterparts. 
Party recruitment is concentrated among the very young, as we would 
expect in a sponsored pattern. 


SELECTION CRITERIA FOR PARTY RECRUITMENT 


Do those who join the party while young have different social back- 
grounds from those who join later? Two background characteristics are 
of particular interest. The first is parental status, especially whether young 
party recruits came predominantly from households that the regime 
sought to favor. From 1949 to 1977, party policy explicitly favored people 
from “red” backgrounds, which included workers, peasants, and those 
from households headed by people who had joined the party or Red Army 
before 1949, or who were from elite cadre families (e.g., Kraus 1981; Unger 
1982). Did these ascriptive characteristics influence recruitment into the 
party, and did they do so equally regardless of the age at which they 
joined the party? To the extent that they did, we observe a pattern of 


4 The effect of gender is not statistically significant in the £ term and thus is excluded 
in the model. The reported model yielded the best fit among all possible model spec- 
ifications for the available variables. The same applies to red background and high 
school education in model 3. 

“Some previous studies based on aggregate or cross-sectional data have found a 
positive relationship between age and party membership in the Soviet Union (Rigby 
1968) and Hungary (Szelényi 1987). Our analysis indicates that these cross-sectional 
correlations are the likely result of longer exposure of older respondents to the risk of 
party membership, which may nonetheless be highest among the young, not the old. 
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“counterselection” consistent with hypothesis 3. The second background 
characteristic of interest is the level of prior educational attainment. Has 
the party recruited preferentially from among college students and young 
college graduates? To the extent that it has, we observe a “technocratic” 
pattern consistent with hypothesis 4. 

To answer these questions, we add three variables—red background, 
high school education, and college education—to the previous model. Red 
background is a dummy variable with three categories of individuals 
coded as “1”: those from revolutionary families (i.e., revolutionary cadre, 
soldier, and martyr; see Unger 1982, pp. 13-14), those whose fathers were 
elite administrators when the respondent was 14 years old, and those 
whose fathers were party members.“ Using this measure of red status, 
739 cases, or 24% of our sample, come from such red families. College 
education is a time-constant dummy variable including only those who 
went to college before entering the labor force; individuals who took 
college-level courses after working for a period of time (.e., through fur- 
ther education) were excluded. We exclude these individuals because, as 
we will show later, there are qualitative differences between regular ed- 
ucation and continuing education that have direct consequences for our 
analysis. Because only 179 cases, or 6% of our sample, have college-level 
education, we include high school education, those who attended senior 
high school before entering the labor force (either academic or technical), 
as an additional measure of education. A total of 829 individuals, or 28% 
of our sample, had some high school-level education. 

The selection model is reported in model 3 of table 1. This is again the 
best fit model (improving from model 2 with x? = 81.6; df = 4) among 
all possible model specifications. People from a red family background 
have higher odds of joining the party throughout their careers. Strikingly, 
while the party did recruit at higher rates from those with a high school 
education (as suggested by the positive and significant coefficient of high 
school education in a term), a college education does not increase the 
odds of joining the party early in the career (as indicated by the small 
and nonsignificant coefficients for college education in the a term). Unlike 
red background and high school education, however, the effect of college 
education increases—at an annual rate of about 5% (e™ = 1.047)—as 
an individual’s career proceeds. At this rate, the effect of college education 
will surpass that of red background after 13.5 years (.623/.046 = 13.5) in 


4 Past research has suggested that children from working-class and peasant households 
did not enjoy the same advantages as the children of the political elite (Unger 1982). 
Adding worker and peasant households would mean that more than 80% of our sample 
is in the red category. We limit our examination of “counterselection” to those house- 
holds with demonstrably close ties to the regime 
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the labor force. While red background is important in the early career 
and thereafter, late in the career, a college education becomes the para- 
mount predictor of party membership. 

These results strongly support the counterselection model ERARE 
by hypothesis 3. The party recruits preferentially those young people from 
red households, but older recruits are selected according to different cri- 
teria. Among those who join at later stages in the career, college education 
surpasses red background as a criterion for recruitment. The relationship 
between education and party membership observed in past studies ap- 
pears to mask a sharply differentiated career-dependent pattern. If the 
entire period from 1949 to 1996 is considered as a whole, the college 
educated have been recruited into the party at later career stages than 
those from red households. Those from red households have been spon- 
sored, while the college educated are coopted into the party at a later 


career stage. 


CHANGING PATTERNS THROUGH HISTORICAL TIME 


The results so far summarize an aggregate pattern for the entire period 
from 1949 to 1996. Although the cohort effects revealed by the age variable 
hint that the time-dependent pattern of party recruitment was stronger 
in the Mao era (1949-76) than in subsequent periods, the extent to which 
these patterns are confounded with temporal shifts in party policies is not 
yet clear. Of particular interest is the question of whether the “counter- 
selection” pattern of recruitment has given way to the “technocratic” one 
so often thought to characterize eastern European regimes and the Soviet 
Union during the 1960s and 1970s. We will now investigate the extent to 
which “counterselection” is restricted to the Mao years. 

Table 2 reports results from the same Gompertz models employed in 
table 1. The models are modified from the earlier ones in the following 
ways. First, we split the durations into two parts, with those occurring 
in the Mao period coded as “0” and the reform period as “1.” Second, we 
include reform period as a dummy variable and its interactions with our 
key variables, red background, high school education, and college edu- 
cation, into the model. Finally, we estimate the model with different def- 
initions of historical periods and compare the results.'* 


“ Examining histoncal effects, especially timing-dependent ones, is technically difficult 
with Gompertz models due to the conflict between different time clocks (for a brief 
discussion of models with several domains, see Rohwer 1997, p. 145). Therefore, the 
strategies employed here are by no means perfect solutions. The effects of the period 
dummy and its interactions with other key variables can serve only as proxy estimates 
of the differences between the two historical periods. And different periodizations are 
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TABLE 2 
HISTORICAL VARIATIONS OF PARTY RECRUITMENT IN URBAN CHINA, 1949—96 


MODEL 1 MODEL 2 
(1949-77 vs. 1978-96) (1949-87 vs. 1988-96) 
VARIABLES a Term 8 Term a Term 8 Term 
Constant... .. A oe —6.421 .017 —7,424 072 
(451) (032) (379) (.033) 
Gender (male) 1202 ; 1.209 at 
(.101) (.101) 
Age (in 10 years) . 2 . . 311 —.015 456 —.021 
(080) (005) (.068) ( 006) 
Red background. ........... +... 835 San 733 ; 
(.164) (130) 
High school education ; 095 A 334 
(161) (120) 
College education A —.425 073 —.576 087 
(447) (.036) (348) (019) 
Reform penod ......m........ f —1.098 049 —.721 O14 
(.262) (.013) (.344) (015) 
Reform x red background . —.430 ya -.511 eii 
(219) (248) 
Reform x high school ..... ... 876 eis 877 
(216) (254) 
Reform x college education . 1 704 — 061 2733 —.122 
(549) (.040) (551) (036) 
x (df) 409 7 (14) 401.1 (14) 


Notr —SEs are grven m parentheses Unweighted N = 3,079, N of events is 525 


In model 1, 1978—+the year when decisive shifts in party policy against 
Maoism began—is designated as the starting point for the reform period. 
The results suggest that the patterns of party recruitment reported in table 
1 apply largely to the Mao period and have been reversed since 1978. 
Holding other factors constant, the positive effect of the reform period in 
the 8 term (8 = .049; p < .001) indicates that the pattern of career-stage 
dependence in party recruitment has been weakened in the reform period. 
Moreover, while the importance of red background has declined, the ef- 
fects of education, especially college education, has become increasingly 
pronounced in the reform period, as suggested by interaction effects in 


employed in order to examine internal heterogeneities within the reform period 
(1978-96), on the one hand, and to simplify the model, on the other. In table 2, we 
report only the results for year 1978 and 1988 because 1978 marks the beginning of 
the reform period and 1988 presents the sharpest contrasts between the two post-Mao 
periods Moreover, we report the best fit models, which include only statistically sig- 
nificant effects and effects directly related to our arguments. Results for other period- 
izations are available on request. 
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the a term. Finally, in the reform period, college education becomes sig- 
nificant in early selection, but its effect declines (though not significantly) 
over time, 

While model 1 considers the post-Mao period as a whole, prior research 
shows that change accelerated in urban areas after a lag of roughly a 
decade (Bian and Logan 1996; Walder et al. 2000). If so, the career-stage 
effects observed in model 1 may still be confounded with period effects. 
After estimating the same model with different starting dates for the 
“reform” period, we have found, consistent with past research, that 1988 
yields the clearest contrasts with least sacrifice of the overall model fit. 
The results for this periodization are shown in table 2, model 2. 

There are two interesting contrasts between models 1 and 2. On the 
one hand, the main effects of reform period, both the magnitude and the 
significance level, are considerably reduced in both terms of model 2. 
These imply that deviation from the time-dependent pattern in party 
recruitment occurred mainly in the early reform period (1978-87). The 
late reform period returns to a pattern of sponsorship observed in the 
Mao era. On the other hand, the main effects of college education and 
its interaction effects with reform period are larger in both vectors in 
model 2. Combining the early reform period with the Mao period in fact 
appears to strengthen the negative effect of college education, while main- 
taining those of red background and high school education in early party 
selection. This suggests that there is no qualitative difference between 
party recruitment patterns in the Mao period and the early reform period 
in terms of selection standards for early recruits. By the same token, the 
interaction effect between the reform period and college education in- 
creases from 1.70 to 2.73, almost triple in terms of odds ratios, in model 
2, meaning that the party did not begin to recruit heavily from among 
young college graduates until the late reform period (1988-96). Moreover, 
in the late reform period, unlike the Mao and early reform periods, college 
graduates have become less rather than more likely to join the party as 
they age. 

The results herald a rather dramatic shift from a sponsored pattern of 
“counterselection” characteristic of the Mao period to a “technocratic” 
pattern that emerges after 1988. The contrast between the two periodi- 
zations leads us to infer that the early reform period was a transitional 
one in which no clear patterns of sponsorship are observed and the long- 
established phenomenon of time dependence in party recruitment was 
temporarily suspended. Such a suspension may be largely due to the 
disruptive effects of the Cultural Revolution, during which party organ- 
izations were severely damaged. To recover, the party had to recruit any- 
one qualified without any bias toward young loyalists. When the time- 
dependent pattern reemerges after 1987, it is the technocratic rather than 
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the counterselection pattern. The party has resumed its targeting of in- 
dividuals from specific backgrounds for early recruitment. But it is now 
college students and college graduates rather than those from red back- 
grounds who are preferred. 


THE CAREER CONSEQUENCES OF PARTY MEMBERSHIP 


Career-stage differences in party recruitment may be due either to the 
party’s preferences, to individual preferences, or both. On the one hand, 
party branches may consciously seek to identify young party loyalists. On 
the other hand, young adults whose future careers are still far from de- 
termined may pursue party membership more assiduously than their older 
counterparts. Or perhaps those who failed to join early may abandon 
their efforts through time. In any case, we already know from prior re- 
search that party membership is associated with promotion into elite ad- 
ministrative (though not professional) positions (Walder 1995; Walder et 
al. 2000). Is it early rather than later membership that brings such career 
advantages (hypothesis 5)? How much advantage, if any, does early mem- 
bership bring? If early membership brings substantial career rewards, 
then mobility fits the sponsored pattern; if the advantages accrue relatively 
independently of timing, then party membership could be properly treated 
as a credential. 


Timing of Party Recruitment and Selection of Elite Administrators 


To answer these questions, we examine the effect of early party recruit- 
ment on entry into the cadre elite. We use logistic regression instead of 
event-history models in this analysis because standard event-history mod- 
els will produce biased estimates of the effects of early party selection. 
Because those who joined the party earlier would have a longer risk 
period—given that early party membership does not mean early pro- 
motion into the cadre positions, the actual effect of early selection would 
be offset and the main effect of party membership would be exaggerated 
in standard event-history analysis.” Logistic regression models, on the 
other hand, do not have similar drawbacks because the duration of the 
risk period is ignored. Therefore, we estimate logistic models with careful 
treatment of the timing of events and censored observations. 

Table 3 reports the relative odds ratios of becoming an elite cadre from 


1 This problem could be avoided by normalizing the duration of party membership 
with reference to the potential mean length of risk period We did try such a procedure 
in Cox models, which produced results similar to the logistic models reported here 
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logistic regression and robust maximum likelihood estimation." The de- 
pendent variable is whether the individual has ever attained an elite cadre 
position. Elite cadres in this and subsequent analyses are defined as the 
heads of work organizations, which include individuals with decision- 
making and managerial positions in public agencies and their first-level 
subunits. The survey recorded 13 broad occupational categories defined 
by the Chinese census bureau, among which “middle-level management” 
and “high-level management or leader” are coded as elite cadre occupa- 
tions. With this definition, 284 cases or 9.3% of our current urban sample 
had ever held elite cadre positions from 1949 to 1996.*” The independent 
variables are age, gender, high school education, college education, party 
membership, and the timing of joining the party. Age and gender are 
identical to those used in the Gompertz models employed earlier. In order 
to compare their relative effects, high school education and college edu- 
cation are recoded in a new way such that the high school education is 
the reference group of college education. Party membership is a dummy 
variable coded “1” for those who joined before becoming a cadre (including 
those who never became a cadre). Those who joined after becoming a 
cadre are treated as nonmembers. The timing of joining the party is 
measured as the number of years from the time the individual entered 
the work force to the time of joining the party.” Those who were not 
party members are coded as “0” such that this variable can be regarded 
as the interaction between party membership and the duration of waiting 
time. In order to assess the effects in different historical eras, we estimate 
the models respectively for three periods: the Mao period (1949-77), the 
early reform period (1978-87), and the late reform period (1988—96). Only 


“ Standard maximum-likelíhood estimation (MLE) may be biased due to the multistage 
sampling design of our data. Respondents were selected from households with different 
numbers of adults and therefore do not have equal probabilities of selection Moreover, 
the clustering of observations complicates the proper calculation of standard errors. 
To correct these problems, we first apply a case weight, which is the inverse of the 
probability that an individual was selected, and then employ robust variance estimation 
(Royall 1986, Lin and Wei 1989, StataCorp 1997) in which the standard errors are 
calculated based on the primary sampling units (Le., county-level jurisdiction in our 
data), rather than on individual observations. Robust MLE does not change the point 
estimates, but it ensures unbiased standard errors. Except in tables 1 and 2, all the 
models employ robust MLE. 

1 Six individuals who attained cadre positions before 1949 are excluded from the 
analysis. 

* The timing of joining the party is measured with reference to the timing of labor 
force entry: (timing) = (year joined party) — (year began first job). For those who 
worked before age 18, year began first job 1s set to birth year plus 18. The timing of 
joining the party is set to O for those who joined the party before the first job or before 
age 18. For those who have never joined or joined the party after becoming a cadre, 
the timing of joining is set to 0. 
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those at risk in a particular period are included in the analysis. In other 
words, those who entered the labor force in later periods, or those who 
had already obtained cadre positions in previous period, are excluded 
from the analysis in the current period. 

The key variables in table 3 are party membership and timing of mem- 
bership. As shown in model 1, party membership substantially increases 
the odds of becoming an elite cadre across all three periods. The effect 
of party membership is strongest in the early reform period, although it 
declines afterward. Moreover, consistent with previous studies (Walder et 
al. 2000), a high school education is sufficient for promotion into cadre 
positions, while a college education does not bring much additional ad- 
vantage (the effects are even negative) in any of the three periods. When 
we add the timing of membership to the model, we see subtle changes in 
the effects of party membership. In the Mao period, the effect of party 
membership depends on the timing of joining: the earlier the individual 
joins, the more likely it is that she or he will become an elite cadre. When 
an individual joins the party in the early career (i.e., timing equals 0), she 
or he is 6.5 times more likely to become an elite cadre than is a nonparty 
member. The advantage of joining while young, however, declines at a 
rate of 8% every year thereafter; after 24 years in the labor force, the 
advantage of joining disappears. Clearly, the career advantages of party 
membership accrue primarily to those who joined the party early in their 
careers. This is consistent with our suspicion that there are generic ad- 
vantages attached to early selection into the party and conforms to hy- 
pothesis 5. 

The timing effect of party membership is not present, however, in the 
early reform period. The timing variable does not improve the fit of the 
model at all, nor does it have any effect on the odds of becoming an elite 
cadre. As a result, the main effect of party membership does not change 
significantly from model 1. This suggests that all party members, regard- 
less of their age at membership, enjoyed advantages over nonmembers 
to the same degree. This is consistent with the results of table 2, where 
we found that the early reform period did not conform to the general 
pattern of recruiting heavily from the young (we will elaborate on this 
point later). In contrast, the late reform period exhibits a pattern similar 
to the Mao years, although both the model and the timing effect just fail 
to meet the 0.05 level of statistical significance (in part because of the 
small number of events). Enhanced opportunities are enjoyed by those 
who join while young, and every year of delay decreases the odds of 
promotion by 4%. 

Except for the early reform period, which has emerged as transitional, 
the results in table 3 are a decisive answer to the question of whether 
early incorporation into the party brings substantial career advantages. 
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Whether it is the party’s preference or individuals’ preferences that lead 
to such high rates of incorporation early in the career, early incorporation 
opens doors into the cadre elite. 


Party-Sponsored College Opportunities 


If the party-state created a cadre elite in the Mao period by drawing on 
such a low proportion of the college educated, how was it able to minimize 
the long-noted problems of having uneducated reds exercise authority 
over highly educated professionals (Schurmann 1968)? The solution is 
disarmingly simple. Those who join the party early are given further 
education as adults before promotion into leadership positions. 

The Chinese regime has created various channels through which work- 
ing adults are sent back to school for further education. This, along with 
the rapid expansion of the regular educational system, has resulted in an 
extensive system of adult further education, which includes peasant, 
worker, and cadre education in the Mao era and adult higher education 
such as correspondence, television, and night college in the reform period. 
The impacts of adult further education are surprisingly large. On the one 
hand, a large portion of the population has received its final education 
through certain kinds of adult education. On the other hand, credentials 
earned through adult education have been officially treated as equivalent 
to those earned through the regular schooling system (Ministry of Edu- 
cation 1984, pp. 575-627). 

Table 4 shows the large overall impact of adult further education on 
the attainment of educational credentials. Among 3,087 cases in our sam- 
ple, about two-thirds did not have a chance to go to regular high school, 
and only 5.8% attended college before their first job. Consistent with the 
above analyses, party members and elite cadres do not have significantly 
better educations than other individuals through the regular schooling 
system. When we include adult education attained after a period of em- 
ployment, we can see that for the whole sample, the percentage of college 
educated almost doubles, from 5.8% to 11.3%, while the proportion with 
less than high school education decreases only slightly from 67.4% to 
62.1%. More dramatic changes are evident among party members and 
elite cadres. Due to continuing education for adults, the proportion with 
college education increased from 9% to 24% among party members and 
from 6.6% to 26.6% among elite cadres, while the proportion with less 
than high school education decreased substantially for both (from 59.6% 
to 46.7% for party members and from 61.7% to 40.7% for elite cadres). 
These figures show that adult education has been the main channel 
through which the party has promoted the education of those it has se- 
lected as candidates for eventual leadership positions. 
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TABLE 4 
DISTRIBUTIONS OF EDUCATIONAL LEVELS IN URBAN CHINA, 1949-96 











REGULAR FINAL 
EDUCATIONAL LEVEL EDUCATIONAL LEVEL 
EDUCATIONAL Party Elite Party Elite 

LrvrL Overall Members Cadres Overall Members Cadres 

Below high school ... 2,079 319 179 1,918 250 118 
(67.4) (59 6) (61.7) (62.1) (46.7) (40.7) 

High school ......... 829 168 92 821 158 95 
(26.9) (31.4) (31.7) (26.6) (29 5) (32.8) 

College -o.oo 179 48 19 348 127 77 
(5.8) (9.0) (6 6) (11.3) (23 7) (26.6) 

Total ais 3,087 535 290 3,087 535 290 
Notz.—Regular educational level inchides only schoolng attained before entering the labor force, 


Unlike the regular schooling system based largely on local, provincial, 
and national entrance examinations, access to adult education is granted 
by workplaces that select candidates and provide financial sponsorship. 
Although some types of adult education require entrance examinations, 
selection standards are usually low. More important, because there is no 
separate job placement system linked to adult education, people usually 
take leave from their jobs, continue to receive salary and benefits, and 
return to their workplace after finishing adult education. Thus, continuing 
education is sponsored in a quite literal sense of the term. The specific 
arrangements made for adult education therefore permit local party or- 
ganizations to direct these opportunities preferentially to those being 
groomed for administrative positions.” 

Table 5 reports tests of the proposition that opportunities for continuing 
higher education are allocated preferentially to party members and cadres, 
which in turn improves the overall educational level of the political elite. 
The models in table 5 are piecewise exponential models defined as 


r(t) = exp, ` X,), (2) 


where 7,(t) is the hazard rate of going back to college for continuing 
education in period p, X, is a vector of covariates (including a constant 
intercept) with corresponding coefficients $, for period p. We provide 
estimates for three periods—Mao period (1949-77), the early reform period 


7 Tt is also possible that adult education occurs before the attainment of party mem- 
bership and cadre positions; that is, some people may be sponsored for adult education 
first and then recruited later into the party or promoted into cadre positions. 
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(1978-87), and the late reform period (1988—96)—respectively. The anal- 
ysis includes only those without a college education before entering the 
labor force, and those who already had a college education are treated 
as censored and omitted from the sample.” The observation period begins 
at the time of entering the labor force and ends whenever the individual 
went to college for continuing education (for those who began work before 
age 16, the initial time is set at age 16). Our main interests are the relative 
effects of party membership, cadre occupation, prior education, and age; 
gender, father’s education, and father’s cadre status are treated as control 
variables. 

Model 1 reports the estimates without including a variable for cadre 
occupation. We can see that the chances for adult higher education are 
far from equal. First, the negative effects of age across three periods 
strongly indicate that opportunities for continuing higher education are 
given predominately to young adults—each additional year of age de- 
creases the odds by about 10%, 4%, and 8% in successive periods. Second, 
party members and those who already had a high school education enjoy 
roughly the same large advantages. In the Mao period, party members 
are 4.4 times more likely than nonmembers to attend college through 
adult education. The effect of party membership is even more pronounced 
in the late reform period. When we include cadre occupation in model 2, 
we find that except in the Mao period, cadres are about 4 times more 
likely to obtain adult college education, holding other factors constant. 
In general, being young, a party member, or a cadre greatly enhances 
one’s chance of attaining a college degree through adult education. There- 
fore, those who join the party early are more likely to attend college as 
adults than are those who join later.” 

We should also note some potential ambiguities in interpreting the rel- 
ative effects of party membership and cadre position on opportunities for 


2 Our treatment of continuing college education does not take into account the dis- 
ruptive effects of the Cultural Revolution (1966—76) During this period, colleges and 
universities initially were closed (1966-72) and afterward (1973-76) only admitted 
students who had worked for a period. If people in these cohorts attended regular 
universities by taking the entrance examination after it was restored in 1977, they are 
treated as having attended continuing college education. Uncovering the disruptive 
effects of the Cultural Revoluton on continuing college education would require sep- 
arate and more focused analyses (see Zhou and Hou 1999). Only a small number in 
this age cohort in our sample (24 cases, or 13% of those who would attend college) 
returned to college through regular national entrance examination from 1977 to 1979. 
22 Due to the very large impact of age, the timing of joining the perty was not significant 
and therefore not included in the models Although we can infer that early party 
members will have an advantage over later joiners, it should be noted that the ad- 
vantage of early joiners is, statistically speaking, not produced by the timing of joining 
per se, but by the independent effects of age. 
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adult college education. In the sponsored mobility model, party-sponsored 
adult college education occurs either before or after young party recruits 
are promoted into cadre positions. The relative importance of party mem- 
bership and cadre status therefore reflects these two potential sequences 
of events. In the Mao period, the coefficients suggest that sponsored adult 
college education took place largely before young party recruits were 
promoted into cadre positions. There are two reasons why cadres did not 
acquire further college education at a higher rate. First, cadres in the Mao 
period were less educated, as many of them were revolutionaries who 
joined the party or the army before 1949, and thus could obtain only low- 
level adult education. Second, the Cultural Revolution may have exerted 
a negative effect on cadre education, not only because the educational 
system was disrupted but also because many party officials and govern- 
ment bureaucrats were purged (Deng and Treiman 1997). In the early 
reform period, however, further college education occurred mainly after 
promotion into cadre positions. In this period, as revealed in preceding 
analyses, the party began to recruit members during mid- to late-career 
and to promote them quickly thereafter into cadre positions without a 
long period of screening, cultivation, and training. As a result, further 
education occurred mainly after rather than before individuals were pro- 
moted. Not until the late reform period are the effects of party membership 
and cadre position roughly equal. Both cadres and young party recruits 
are being sent to college, either to augment their education or to prepare 
for future promotions.“ Despite these complexities, it is fairly clear that 
for most of its history the party has tended to recruit young members in 
workplaces who will subsequently be sponsored for further education, 
rather than recruiting those who already have a college degree while they 
are still young.” 


* This, along with the historic shift toward recruitment of young college graduates as 
party members, indicates the strong commitment to education that has characterized 
party policy for more than a decade (Lee 1991) As the older cohorts of party cadres 
retire, and as younger cohorts enter with higher educational levels, we would expect 
the role of adult college education eventually to decline 

* The estimated effect of prior high school education is of equal magnitude to that of 
party membership. This indicates that despite the preferential access enjoyed by young 
party recruits to the system of adult college courses, a party member cannot take 
advantage of the opportunity without high school education, and that the system is 
large enough to permit large numbers of nonmembers to receive adult college training 
It is also possible that one variant of party-sponsored mobility undetected by our 
models is at work: some with a high school education may be selected for adult college 
education first and recruited into the party and promoted into cadre positions later. 


1400 


Career Advancement 


Career Consequences of Adult Education 


Party-sponsored adult education has changed the landscape of career 
mobility dramatically, for education is not merely a cause but an outcome 
of mobility. If, further, those sponsored for adult college education are 
more likely to be promoted into administrative positions than those who 
attended college before working, we should observe two groups of college- 
educated individuals with different career destinations. Those who obtain 
college education via adult education should be promoted into adminis- 
trative positions at higher rates than the former (hypothesis 7). 

Table 6 estimates the attainment of elite cadre positions using piece- 
wise exponential models with period specific effects defined in equation 
(2). In all the models, we include age, gender, and party membership as 
control variables, and our main purpose is to compare the effects of regular 
and adult college education. Consistent with the results reported in table 
3, those who earned their college education through the regular system 
are not promoted into administrative positions at a substantially higher 
rate than people without a college education, other things being equal. 
Not only are the estimated effects small, but they are not statistically 
significant in most of the models. On the contrary, those with an adult 
college education are much more likely to be promoted into administrative 
positions, with odds ratios at least double that for regular college in all 
periods. 

The gap between the two college-educated groups is much larger if we 
consider the fact that some obtained adult college education after being 
promoted into administrative positions, as shown in table 5. Table 4 
showed clearly that the high percentage of college-educated party mem- 
bers and officials is primarily due to adult higher education. Of the 179 
respondents with formal college education, only 27% ever joined the party. 
In contrast, 48% of the 169 respondents who attained college education 
through adult continuing educational system were party members. The 
contrast is even more striking for elite administrators: While only 11% 
(19) of those with formal college education have ever held an adminis- 
trative position, 34% (58) of those with an adult college education are 
elite cadres. Party sponsorship therefore is not merely compatible with a 
“meritocratic” emphasis upon higher education; it is intimately connected 
to the process of educational attainment for those who enter the party 
while young and who are subsequently sponsored for elite administrative 


posts. 
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TABLE 6 
Rosust MLE OF THE EFFECTS OF REGULAR AND ADULT COLLEGE EDUCATION ON THE 
ATTAINMENT OF ELITE CADRE POSITIONS IN URBAN CHINA, 1949-96 


Overall Mao Period Earty Reform Late Reform 


Variable 1949-96 1949-77 1978-87 1988-96 
Age ...... 99 97 1.00 .98 
(-1.76) (-3.43) (—.11) (—1 66) 
Gender (male) . 3.41 3.03 5.04 255 
(6 61) (2.91) (4 70) (2.88) 
Party members . 537 7.25 597 2.69 
(9 59) (7 58) (6.58) (2.68) 
Regular college education ... 1.55 224 117 174 
(127) (1.89) (30) (1 12) 
Adult college education 350 4 44 246 4.92 
(4.89) (4 41) (2.51) (3 94) 
x 359 1 166 1 1598 51.6 
df . W300 de ae te 5 5 5 5 
N of events ..... ........ ; 281 112 105 64 


Norr —Coefficents are relative odds ratios Numbers m parentheses are secores. N = 3,058 All 
models use the method of robust estimate of vanance to account for the effects of the special samphng 
dengn (mcluding case weight and cluster effecta) 


SUMMARY 


Our findings conform very closely to the hypothesized characteristics of 
political sponsorship. Recruitment into the party is concentrated in the 
earliest years on the job, and hazard rates decline steadily thereafter 
—party membership is not a credential earned at relatively constant rates 
until the midcareer (hypothesis 1). In two of the three historical periods 
we have examined, those who join while young have different back- 
grounds than those who join later in life. In the Mao period, party re- 
cruitment conformed closely to the pattern of “counterselection”: early 
entrants were recruited heavily from among those with red backgrounds, 
while those with a college education tended not to join until later in their 
careers (hypothesis 3). In the late reform period, we observed a dramatic 
shift toward a “technocratic” pattern, in which individuals from red house- 
holds are abandoned in favor of young college graduates (hypothesis 4). 
Only in the intervening early reform period, which emerges as a transi- 
tional one between two different patterns of sponsorship, are there no 
significant differences between early and late party recruits. 

Are those who join the party early in their careers more likely to attain 
elite administrative positions (hypothesis 5)? In the Mao period, this was 
clearly the case, but not in the transitional early reform period. The late 
reform period returned to the pattern of promoting early recruits pref- 
erentially, but the estimated coefficients are just short of statistical sig- 
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nificance at the 0.05 level. This pattern of sponsorship is accomplished 
in part through the preferential sponsorship of party members with high 
school educations for adult college education. Party members are much 
more likely to return to school for college courses than nonmembers in 
all three historical periods (hypothesis 6). And those who take college- 
level courses in this fashion as working adults are far more likely than 
those with a regular college education to be promoted into elite admin- 
istrative posts (hypothesis 7), although the advantages are smaller in the 
early reform period. 

For most of its five decades in power, the Chinese Communist Party 
has recruited preferentially from among the very young. These young 
recruits have had different characteristics from older recruits; they have 
been promoted into administrative posts at higher rates; they have been 
given remedial college-level training at higher rates; and this remedial 
college training has greatly increased the odds of promotion into an elite 
administrative post, whereas a regular college education has not. The 
clear exception to this pattern is the early reform period (1978-87), which 
turns out to mark a transition between the Mao-era sponsorship of reds 
at the expense of college-educated experts and an emerging technocratic 
late-reform pattern that refocuses early recruitment efforts onto young 
college graduates while continuing to offer adult college education and 
preferential promotions to party members who still lack a higher edu- 
cation. The temporary suspension of party sponsorship in the early reform 
period appears to be part of a recovery from the damages wrought by 
the Cultural Revolution on the party, the bureaucracy, and individual 
careers. 


INTERPRETATION OF THE RESULTS 


Sponsored mobility has permitted the party to balance its role as an elite 
organization with its simultaneous role as a mass party. The practice of 
sponsorship has enabled the incorporation of a broad “united front” of 
members from all walks of life, while further selecting a subset of members 
for promotion into positions of power. Our search for evidence of a form 
of party sponsorship led us to distinguish party members according to the 
age at which they join, and this distinction has revealed previously un- 
detected complexities in the relationships among party membership, ed- 
ucation, and occupational mobility. It turns out that the Chinese Com- 
munist Party attempted to reconcile the classic red versus expert dilemma 
in a fashion that was surprisingly subtle—especially for a regime that 
adopted such openly anti-intellectual policies for much of the later Mao 
period. 
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On the one hand, in the Mao period, the party emphasized different 
principles of recruitment at different career stages: demands for political 
loyalty (and ascriptive markers for the same) were emphasized when re- 
cruiting the young, while demands for educational attainment (and pre- 
sumably professional competence) were directed primarily to those in mid- 
and late-career. After the selection of early members according to political 
criteria, later party recruitment was based on more open competition in 
which the educated professionals were incorporated into the party in mid- 
or late-career. Unlike their younger counterparts, however, these older 
party recruits, though well educated, were not sponsored for advancement 
into the administrative elite. For them, party membership may have been 
an intrinsic reward, or may have proffered status advantages in other 
areas of life. Through the late recruitment of more highly educated people, 
the party was able to improve the overall educational level of party mem- 
bers. And the party was able to incorporate the college educated into their 
organization in a largely symbolic way, signaling its openness to a broader 
constituency while not placing these educated party members in positions 
of power. 

On the other hand, during the Mao era and afterward, the red versus 
expert dilemma was handled through the systematic sponsorship of party 
members and officials for adult college courses. If young party members 
lacked a college education at the time of entry (which they did over- 
whelmingly in the Mao era), they were sponsored by their workplace 
leaders for continuing adult education. Such continuing education may 
have led to an actual improvement of the individual's ability to perform 
in an administrative post, or it may have served largely a symbolic purpose 
of legitimating those chosen for leadership without educational qualifi- 
cations (or both). Educational credentials earned in this way greatly in- 
creased the odds of entering into the administrative elite, even over those 
who had earned regular college degrees. The existence of a large system 
of adult education has permitted young party loyalists to receive higher 
education in order to prepare them for the exercise of leadership. This 
means that much of the association observed between higher education, 
party membership, and occupational attainment in China turns out to be 
due to the two phenomena we have revealed in this study: the late and 
largely symbolic incorporation of educated professionals into the party 
and the sponsorship of party members and cadres for adult college courses. 
Through these mechanisms, the Chinese Communist Party delayed for 
almost two generations the emergence of a “technocratic” pattern of the 
variety once heralded in European communist regimes in the 1970s (Kon- 
rád and Szelényi 1979; Lee 1991). 
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CONCLUSIONS 


Party membership is neither a homogeneous elite status nor a credential 
analogous to an educational degree. Strong associations between party 
membership and elite positions are masked by elite selection processes 
that are left unspecified in the models commonly employed in comparative 
research on social mobility. While party members appear to enjoy certain 
career advantages, it is early entry into the party, not party membership 
per se, that leads to promotion into an administrative post. Although 
party membership itself signals political worth, party members continue 
to undergo observation and training. Early membership allows an indi- 
vidual to go through such an extended process of screening, cultivation, 
and training, permitting the evaluation of such perceived attributes as 
loyalty, reliability, and “ability” that otherwise are difficult to judge. Party 
membership obtained later in the career does not provide such career 
advantages and is largely a symbolic reward. Treating party membership 
as a credential thus would prevent us from detecting the institutional and 
organizational forces through which political standards are brought to 
bear on career mobility. 

Party sponsorship, observed in the timing of career events, alters the 
nature of the association between education and career advancement as 
it has been commonly conceptualized in decades of research on compar- 
ative mobility. The political sponsorship of young party members for 
continuing college education clearly indicates that for a large portion of 
the upwardly mobile, educational attainment is as much an outcome as 
a cause of career advancement. While party sponsorship produces a strong 
correlation between education and elite occupation, such a correlation 
cannot be interpreted as a straightforward indicator of meritocratic prin- 
ciples, for it is partially the result of prior political selection processes. 
These processes conform to certain models of sponsored and tournament 
mobility based on observation of elite school systems and corporate hi- 
erarchies in market economies. However, when these familiar processes 
are orchestrated by a ruling party that has branches in all urban work 
organizations of any consequence, they take on a macrosocietal signifi- 
cance that is absent when they are orchestrated by a plurality of uncon- 
nected organizations or well-connected elite families. Through its formal 
designation of “members,” a ruling party inadvertently creates a category 
that turns out to provide an unusually convenient measure with which 
to gauge career processes that some researchers believe are common in 
social hierarchies throughout the world. Models of comparative mobility 
that have long sought to differentiate nations according to the returns to 
different kinds of individual credentials are too blunt an instrument to 
gauge the impact of politics on mobility. 
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This article shows that Medical Innovation—+he landmark study 
by Coleman, Katz, and Menzel—and several subsequent studies 
analyzing the diffusion of the drug tetracycline have confounded 
social contagion with marketing effects. The article describes the 
medical community’s understanding of tetracycline and how the 
drug was marketed. This situational analysis finds no reasons to 
expect social contagion; instead, aggressive marketing efforts may 
have played an important role. The Medical Innovation data set is 
reanalyzed and supplemented with newly collected advertising data. 
When marketing efforts are controlled for, contagion effects dis- 
appear. The article underscores the importance of controlling for 
potential confounds when studying the role of social contagion in 
innovation diffusion. 


INTRODUCTION 


Researchers from various disciplines have long studied how innovations 
diffuse through populations of individuals, households, and organizations. 


We benefited from comments by Wayne Baker, Hans Baumgartner, Albert Bemmaor, 


Watkins, the AJS reviewers, and audience members at the 1999 INSNA Sunbelt Con- 
ference, the 1994 and 1998 INFORMS Marketing Science Conferences, the Australian 
Graduate School of Management, Carnegie Mellon University, Columbia University, 
Cornell University, Duke University, Harvard University, Catholic University of Leu- 
ven, University of Michigan, Northwestern University, Pennsylvania State University, 
Stanford University, University of North Carolina at Chapel Hill, University of Texas 
at Austin, and the Wharton School. We thank Thomas Valente for providing us with 
the Medical Innovation data set prepared by Ronald Burt. Financial support from 
Penn State’s Institute for the Study of Business Markets and the Richard D. Irwin 
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Sociologists have offered the important insight that innovation diffusion 
may be driven by social contagion—another way of saying that actors’ 
adoption behavior is a function of their exposure to other actors’ knowl- 
edge, attitude, or behavior concerning the innovation. Researchers have 
offered different theoretical accounts of social contagion, each describing 
a different causal mechanism of social influence. 

Information transfer —tThe social influence process may simply consist 
of information transfer. Actors may become aware of the existence of the 
innovation through word of mouth from previous adopters (Katz and 
Lazarsfeld 1955). Actors may also update their beliefs about the costs and 
benefits of adopting the innovation after discussing it with previous adopt- 
ers or after observing the outcomes of adoption (e.g., increased status or 
profits). Bayesian updating through social learning under risk aversion 
(e.g., Chatterjee and Eliashberg 1990; Roberts and Urban1988) and Ban- 
dura’s (1986) vicarious learning and modeling concepts are examples of 
this influence process, as is the modeling process posited by DiMaggio 
and Powell (1983). 

Normative pressures —The social influence process may take the form 
of normative pressures, such as when actors experience dissonance and 
hence discomfort when peers whose approval they value have adopted 
an innovation, but they have not (e.g., Coleman, Katz, and Menzel 1966; 
Davis, Bagozzi, and Warshaw 1989; DiMaggio and Powell 1983). 

Competitive concern.—Social contagion may also be driven by the com- 
petitive concern that one's rivals who have adopted the innovation might 
be able to gain a competitive edge unless one adopts as well (e.g., Burt 
1987; Hannan and McDowell 1987). 

Performance network effect —Some innovations exhibit a performance 
network effect when the benefits of use, and hence of adoption, increase 
with the number of prior adoptions. This effect may be direct, as with 
point-to-point communication devices like telephones and fax machines, 
but it may also be indirect, operating through the increased supply of 
complementary products, as with videocassette recorders and prerecorded 
tapes (Katz and Shapiro 1994), or through the increased supply of sup- 
porting infrastructure, such as video rental stores (Brown 1981; Delacroix 
and Rao 1994). 

Although these social contagion mechanisms are conceptually distinct, 


Foundation is gratefully acknowledged. Direct correspondence to Christophe Van den 
Bulte, Wharton School, University of Pennsylvania, 3620 Locust Walk, Philadelphia, 
Pennsylvania 19104-6371. E-mail: vdbulte@wharton upenn edu 

7 Some may prefer to label such indirect endogenous feedback mediated through sup- 
pliers’ decisions “ecological influence” rather than “social contagion” to distinguish it 
from more direct interpersonal or interorganizational influence (Marsden and Friedkin 
1994). 
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their expressions in data may often be indistinguishable, making it im- 
possible to identify the exact nature of the mechanism at work (DiMaggio 
and Powell 1983; Montgomery and Casterline 1996). In this article, we 
emphasize and empirically illustrate another fundamental challenge in the 
study of social contagion in innovation diffusion processes: how social 
network effects may be confounded with common contextual effects. 

The danger of confounding common contextual effects for social con- 
tagion has long been recognized (e.g., Collier and Messick 1975; Erbring 
and Young 1979; Thibleson 1974). Classic studies by Katz and Lazarsfeld 
(1955) and Coleman et al. (1966), for instance, documented that awareness 
of and attitude toward new products can be affected not only by social 
contagion but also by mass media exposure and by companies’ marketing 
efforts. More recent research has further challenged the empirical support 
for the role of social contagion in innovation diffusion by showing that 
S-shaped diffusion curves—often interpreted as evidence of social con- 
tagion—can result from population heterogeneity rather than contagion 
(Bemmaor 1994; Bonus 1973; Thirtle and Ruttan 1987). For instance, 
when a product’s price decreases linearly over time and reservation prices 
are normally distributed over the population, the diffusion curve will be 
the normal cumulative density function. These results support concerns 
voiced by England (1998) and Haunschild and Miner (1997) that the 
positive relationship between the prevalence of prior adoption among 
one’s network alters and the likelihood of one’s own adoption—typically 
interpreted as evidence of social contagion—is often produced by factors 
that grow over time but that are excluded from the model.’ 

This article provides additional support for such concerns about con- 
founding social contagion with the effect of omitted contextual variables. 
We show that prior evidence of social contagion gained from the Medical 
Innovation study by Coleman et al. (1966) is an artifact arising from 
omitting the effect of marketing efforts.* Medical Innovation is a study 
on the role of social networks in the diffusion of the broad-spectrum 


* Granovetter (1978) warned of cases in which individuals appear to react to one 
another when they are actually responding to a common, external influence. In support, 
he provided this memorable quote from Weber ([1921] 1968, p. 23): “Thus, if at the 
beginning of a shower a number of people on the street put up their umbrellas at the 
same time, this would not ordinarily be a case of action mutually oriented to that of 
each other, but rather of all reacting in the same way to the like need of protection 
from the rain ” 

* In this article, we use the term “marketing efforts” to denote the range of activities 
that a supplier engages in to further the sales of its product. Apart from developing 
appealing products, marketing efforts include impersonal marketing communication 
(advertising and direct mail), personal selling (termed “detailing” in the pharmaceutical 
industry), obtaining free publicity, and managing pricing and distribution (e.g., Bagozzi 
1986; van Waterschoot and Van den Bulte 1992) 
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antibiotic tetracycline among physicians in four communities in the Amer- 
ican Midwest during the mid-1950s. It is often credited with documenting 
innovation diffusion as a social process in which adoption is driven by 
social contagion (Rogers 1995). The study has more than just historical 
interest, though. Its data on the diffusion of tetracycline have become “a 
strategic research site for testing new propositions of how social structure 
drives contagion” (Burt 1987, p. 1301) and for assessing the performance 
of new modeling techniques (Marsden and Podolny 1990; Strang and 
Tuma 1993; Valente 1996). While recent reanalyses of the data have found 
social contagion effects to be rather small (Burt 1987; Burt and Janicik 
1996), sensitive to model specification (Strang and Tuma 1993), or even 
insignificant (Marsden and Podolny 1990), these reanalyses were primarily 
designed to investigate which social contagion mechanism was operating 
(information sharing among cohesive actors vs. competitive mimesis 
among positionally equivalent actors) or to assess the performance of new 
models in capturing contagion patterns in the data. With such objectives, 
previous reanalyses assumed that contagion was truly at work. Therefore, 
none started with a detailed account of how the market for the new drug 
operated nor included variables capturing time-dependent nonnetwork 
mechanisms. 

We present our argument in two steps. We first describe the medical 
community’s understanding of antibiotics and sources of drug information 
at the time the data were originally collected. From this description, we 
conclude that social contagion is unlikely to have been a key driver in 
physicians’ decision to adopt, but that the pharmaceutical companies’ 
marketing efforts may have played a considerable role. Next, we test this 
conjecture empirically by applying hazard models to the Medical Inno- 
vation data set, which we have supplemented with new data on adver- 
tising. Our results indicate that previous evidence of social contagion was 
spurious. Given the significance of Medical Innovation in the diffusion 
literature, our findings underscore the risk of confounds in research on 
social contagion in innovation diffusion. 


SITUATIONAL ANALYSIS OF TETRACYCLINE AND THE MEDICAL 
COMMUNITY 


Medical Innovation is a study of the adoption of tetracycline, a broad- 
spectrum antibiotic, by 125 physicians in four small cities in Ilinois. At 
the time that Lederle launched the first tetracycline-based product in 
November 1953, three other broad-spectrum antibiotics were already on 
the market. Lederle had introduced chlortetracycline in 1948, Parke-Davis 
had introduced chloramphenicol in 1949, and Pfizer had introduced oxy- 
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tetracycline in 1950. To better understand the situation physicians found 
themselves in when deciding whether or not to adopt the new drug, we 
analyze the product’s characteristics, the way it was commercialized, and 
the kind of sources of information and influence physicians in the 1950s 
typically used. 


Product Characteristics 


The diffusion literature suggests that an innovation’s rate of adoption is 
affected by potential adopters’ perceptions of five critical characteristics: 
complexity, compatibility with existing values, trialability, observability 
of results, and relative advantage over alternatives (Rogers 1995). Tet- 
racycline had product characteristics typically associated with rapid 
diffusion. 

Low complexity —Tetracycline was chemically similar to two existing 
and successful antibiotics, as evidenced by their generic names. Though 
a new compound, tetracycline “was merely the newest in an already es- 
tablished family of drugs,” and an “undramatic pharmaceutical innova- 
tion” (Coleman et al. 1966, pp. 17, 36). 

Compatibility —Physicians were favorably disposed toward the phar- 
maceutical industry, its new products, and efforts to market them (Ben 
Gaffin 1959; Caplow and Raymond 1954). Enthusiasm was particularly 
strong for antibiotics (Peterson et al. 1956).* 

Trialability and observability of results —Broad-spectrum antibiotics 
were generally used in the treatment of acute, rather than chronic, 
conditions. Because of the short time between treatment and outcome, a 
physician could easily and quickly determine drug efficacy in any 
particular case and adjust the therapy if necessary (Coleman et al. 1966, 
p. 17). 

Relative advantage. —Tetracycline produced fewer side effects than the 
other three broad-spectrum antibiotics (Pearson 1969). Tolerance and side 
effects had become a very important issue by the time tetracycline was 
launched. In the summer of 1952, the side effects of chloramphenicol, 


* Peterson et al (1956) intensively studied 88 general practitioners in North Carolina, 
each over a period of three to three-and-a-half days. They often observed the physicians 
immediately prepare an injection of penicillin upon learning that the patient had a 
fever. This decision was frequently reached before the patient had been examined. 
Two-thirds of the physicians gave antibiotics to all patients suffering from respiratory 
infections without attempting to determine whether the infection was viral or bacterial 
(antibiotics are ineffective against viral infections) Also, “it was apparent from ob- 
servation and statements from physicians that their practices in regard to medications 
and therapy are influenced significantly by the information and products supplied by 
the drug salesman” (p 103). 
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marketed under the brand name Chloromycetin, received wide press cov- 
erage. A June 28 editorial of the Journal of the American Medical As- 
sociation (JAMA) called doctors’ attention to reports on Chloromycetin’s 
side effects, and on July 3 the American Medical Association issued a 
press release—“AMA Warns Doctors on Chloromycetin Therapy”—that 
received wide coverage in the popular press. Finally, the FDA even with- 
drew the drug’s certificate, organized its own field survey, and turned its 
reports over to the National Research Council for review. On August 14, 
the drug was allowed back on the market, although Parke-Davis was 
ordered to print prominently on its labels the dangers inherent in its use 
(Fortune 1953; Pearson 1969). As a result of these problems, Chloromy- 
cetin’s share of the broad-spectrum antibiotics market declined to 5% in 
October 1952, down from 38% four months earlier. In September 1953, 
two months before Lederle’s launch of tetracycline, Chloromycetin’s share 
was still at only 10% (Fortune 1953). Thus, tetracycline had a competitive 
advantage on a product dimension that was especially salient at the time 
of its launch. 

In sum, tetracycline’s characteristics present strong reasons to expect 
rapid diffusion. Also, there is little reason to expect social contagion to 
have been important. Since there was little ambiguity or perceived risk 
in prescribing tetracycline, information from previous adopters should not 
have affected physicians’ evaluation of the drug in a major way. Since 
tetracycline was merely the newest member of an already established 
family of drugs and an undramatic innovation, it is also questionable that 
adopting it would have markedly enhanced physicians’ status among their 
peers. 


Commercial Context 


An analysis of the potential adopter’s situation should also include a view 
of the supply side. In this section, we document characteristics that have 
been found to be associated with rapid diffusion (e.g., Bauer 1961; Hahn 
et al. 1994): the intensity of competition among suppliers, the reputation 
of suppliers among potential adopters, and the marketing efforts of 
suppliers. 

In contrast to other broad-spectrum antibiotics, tetracycline did not 
enjoy exclusive patent protection. After a tumultuous episode of litigation, 
the parties involved worked out a complex patent sharing and licensing 
agreement, giving Lederle, Pfizer, and Bristol the right to manufacture 
and sell the drug and allowing Squibb and Upjohn to sell the drug under 
a supply contract with Bristol (FTC 1958, pp. 245-57). These five firms 
accounted for more than half of all the antibiotics sold in the United 
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States in 1950, and all had a good reputation within the medical com- 
munity (FTC 1958). 

Lederle, the first company to launch a tetracycline formulation, de- 
ployed a very aggressive marketing program for its tetracycline brand 
Achromycin. Coleman et al. (1966, pp. 44, 181) mention the “blanket 
exposure of all doctors to the detail man [i.e., pharmaceutical represen- 
tative visiting physicians].” Lederle’s direct mail budget for tetracycline 
permitted 105 mailings, an average of two per week, to every physician 
in the United States during its first year of commercialization. Medical 
journal advertising for the first 12 months consisted of 26 insertions in 
JAMA and monthly insertions in the highly circulated Modern Medicine 
and Medical Economics, as well as in all state journals, 116 county jour- 
nals, and most specialty journals (FTC 1958). Tetracycline also received 
wide positive coverage in the professional media (Ben Gaffin and Asso- 
ciates [1956] 19615). As a detail man remarked, “Lederle was interested 
in bombarding physicians with the Achromycin name and we did just 
that and got the name across. We swamped them with Achromycin” (FTC 
1958, p. 130). Pfizer was much less aggressive in pushing tetracycline, 
fearing that strongly promoting its own brand of tetracycline, Tetracyn, 
would undercut its sales of oxytetracycline. Only in January 1955, possibly 
alarmed by the tremendous success of Lederle’s Achromycin, did Pfizer 
Start to market Tetracyn more aggressively (Mines 1978). We have no 
detailed information on how aggressive the other three players marketed 
their own tetracycline brand. They did not face the same fear of product 
cannibalization as Pfizer did, and they appear to have been more ag- 
gressive than Pfizer, though they lacked the resources to be as aggressive 
as Lederle (Pearson 1969). 

Tetracycline was not only extensively promoted, but also aggressively 
priced. Although the product was superior to other broad-spectrum an- 
tibiotics, its price to consumers was the same as that of the three other 
types of broad-spectrum antibiotics (FTC 1958). To the extent that phy- 
Sicians took price into consideration in their prescription behavior, tet- 
racycline pricing would have favored rapid adoption. 

In sum, tetracycline was marketed by a small group of companies en- 
joying a solid reputation. The first company to enter the market, Lederle, 
deployed a very intensive marketing campaign. The product also enjoyed 
a large amount of free publicity. Although superior in therapeutic effects, 
the product did not carry a price premium. Such a market environment 
is conducive to rapid initial diffusion (Bauer 1961; Hahn et al. 1994), 
irrespective of social influence among physicians. 

Exposed to such intensive marketing efforts, physicians did not need 
word of mouth from their colleagues to become aware of the product’s 
existence and purported benefits. Coleman et al. (1966, pp. 13-14) argued 
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that the physicians’ problem was not too little but too much information. 
Social contagion, then, need not have operated by creating awareness but 
by physicians’ turning to their colleagues as a way to cope with infor- 
mation overload. Physicians who already had adopted may then have led 
others to evaluate the new drug positively. However, such a simplifying 
cognitive strategy is necessary only if actors experience ambiguity or un- 
certainty, a condition that appears to have been unlikely. While there was 
indeed a deluge of information, it all pointed in the same direction: in 
favor of adopting tetracycline. 


Physicians’ Sources of Information 


A number of studies provide insight into the relative importance of the 
physicians’ various sources of information about new drugs around the 
time of the tetracycline study. Many of these studies were conducted in 
the Midwest or in relatively small cities and can thus be expected to be 
representative of the four Illinois towns studied in Medical Innovation. 

A 1952 survey of midwestern physicians reported that they found detail 
men, direct mail, journal articles, and journal advertising to be much 
more important sources of information than their colleagues (Caplow and 
Raymond 1954). Menzel and Katz (1955) conducted the pilot study for 
Medical Innovation in a New England city of comparable size to the four 
cities in the main study. They also found that physicians rated colleagues 
as less important than detail men, articles in journals, and direct mail. A 
study of the diffusion of Lederle’s Achromycin brand in another small 
midwestern city found similar results (Ben Gaffin and Associates 19610), 
as did Ferber and Wales (1958), using a sample of Chicago physicians, 
and Winick (1961) in a study of an ethical drug introduced in 1957. 
National scale studies by the National Opinion Research Center (Hawkins 
1959), Ben Gaffin and Associates (1959), and Harris (1966) reported similar 
findings as well All these studies indicate that physicians in the 1950s 
typically did mot report peer influence as an important information source 
for new pharmaceuticals but noted commercial communication efforts 
and medical journals to be more important and more valuable. 


Conclusion from the Situational Analysis 

Overall, tetracycline’s product characteristics, the way it was marketed, 
and the sources of information physicians typically used for adoption 
decisions do not paint a case for strong contagion effects. Table 1, recon- 
structed from original reports on the Medical Innovation study, shows 
that physicians considered colleagues to be a source of information and 
influence, but not a very important one. 
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TABLE 1 
Doctors’ REPORTED SOURCES WHEN ADOPTING TETRACYCLINE 





% PHYSICIANS 
Crediting Source Mentioning Source of 
with* Information as! 
Original Most 
SOURCE OF INFLUENCE Influence Influence First Intermediate Final 
Detail men ............ ... 57 38 52 27 5 
Journal articles .. 7 23 6 21 21 
Direct mail ...... 0 200... 18 8 22 16 14 
Drug house periodicals 4 5 3 11 21 
Colleagues ........ 7 20 10 15 28 
Meetings ............... 0. 3 Ms 3 4 8 
All other media 4 6 3 7 3 


and third cohonn 1s 87, for those in the muddle the bese ss 131 


One might raise the following questions: If advertising and detailing 
were indeed as important as our situational analysis suggests, would Cole- 
man et al. not have taken these factors into account? Does the fact that 
they did not include these factors in their analysis not reduce the face 
validity of our own conclusions? We do not believe so. In appendix A, 
we document that the way Medical Innovation came about may have 
led its authors away from looking into the effects of detailing and journal 
advertising. Appendix A also presents a reminiscence by Coleman indi- 
cating that he and his fellow authors may not have been very familiar 
with the institutional details of their research site. 


STATISTICAL ANALYSIS INCORPORATING MARKETING EFFORT 
Our description of the medical community’s understanding of tetracycline 
and of physicians’ sources of information indicates that marketing efforts 
by the drug manufacturers, and especially Lederle, may have been key 
drivers of the diffusion process. Earlier analyses, however, have ignored 
this factor, and their results may therefore have been based on a confound. 
We investigate this possibility empirically below. 


Substantive Assumptions 
We assume that to adopt, physicians must both be aware of the innovation 
and evaluate it positively. Using insights from social network threshold 
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modeling, we assume that awareness and utility thresholds may vary as 
a function of physician characteristics (Erikson 1998; Granovetter 1978; 
Hedstróm 1994; Valente 1996). Awareness can be driven by marketing 
efforts (such as sales calls by detail men and advertising), free publicity 
in medical journals, and exposure to peers who have adopted previously 
and with whom one shares information on medical practice. Marketing 
efforts, free publicity, and social contagion can also affect evaluation. 

We assume that social contagion stems from exposure to alters who 
have already adopted and that it operates over personal relationships. 
This social influence may take the form of information transfer, making 
one aware of the existence of the drug or improving one’s evaluation of 
the drug’s therapeutic merits. Social influence may also consist of social 
normative pressure: to maintain attitudinal balance, one may feel com- 
pelled to adopt once many of one’s direct peers have done so. Our data 
will not allow us to distinguish between these two “social cohesion” pro- 
cesses, but we will distinguish them from social influence operating as 
competitive mimicry of structurally equivalent colleagues who a physician 
feels compelled to imitate for fear of losing status in the community (cf. 
Burt 1987). Our inability to operationalize uniquely each social contagion 
mechanism is not important, as our purpose is not to distinguish between 
alternative theories positing each type of process, but rather to distinguish 
between social contagion and contextual effects, specifically marketing 
efforts measured through advertising volume. 


Model Specification 


We use discrete-time hazard modeling to relate explanatory variables 
to adoption behavior (Allison 1982). Operationalizing social network 
exposure through lagged endogenous autocorrelation terms (cf. Hedstróm 
1994; Strang 1991), we represent the hazard of adoption of physician i at 
time ? as 


prob(y,, = 1Y = 0) = Fla'x, + PE, Wg Yi) (1) 


where y,, = 1 if i has adopted by time ż, and y,, = O otherwise; F is a 
cumulative distribution function; x, is a vector of variables affecting 1's 
decision to adopt, irrespective of any influence from colleagues (the vector 
includes an intercept, a summer dummy, physician characteristics, and 
two advertising variables, which are described below); w, is social ex- 
posure of physician i to physician j (described below); a, 8 are a vector 
of parameters and a parameter to be estimated, respectively. Discrete- 
time hazard models are appropriate here because several adoptions occur 
in each observation period. Van den Bulte and Lilien (1999) show that 
these statistical models also have the theoretically attractive property that 
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they can be derived from both random utility theory and random social 
network threshold models. 


Data 


Coleman et al. (1966) provide a detailed description of the population, 
the sample, and data collection procedures. Burt (1986) placed the portion 
of the original data set that we use in the public domain. Since the data 
are publicly available, we limit our discussion to the variables we used 
or constructed for our own analysis. 

Physician characteristics.—We included five covariates to account for 
heterogeneity in physicians’ tendency to adopt early. Professional age mea- 
sures (on a 1-6 scale) how long ago the physician graduated from medical 
school. We included both a linear and a quadratic term to account for a 
possible inverse U-shaped relationship between professional age and in- 
clination toward adoption: compared to midcareer physicians, older phy- 
sicians may be more conservative and very inexperienced physicians more 
averse to risk. We mean-centered age to reduce collinearity. We used the 
number of journals a physician receives or subscribes to as a measure of 
media exposure. Journals included both newsletters sent by pharmaceu- 
tical companies and scientific and professional publications. We used the 
logarithm to reflect decreasing returns to scale. We expected physicians 
having a chief or honorary position in their hospital, captured as a dummy 
variable, to be less involved in actual medical practice than were active 
or regular staff, and hence more likely to adopt later. We also included 
an attitudinal measure, scientific orientation, coded as “1” if the physician 
agreed with the statement that it is more important for a physician to 
“keep himself informed of new scientific developments [than to] devote 
more time to his patients,” and “0” otherwise. 

We also estimated models including the number of nominations a phy- 
sician received as advisor or as discussant as measures of status. Although 
sociometric status figures prominently in the analysis by Coleman and his 
associates, it did not contribute significantly to model fit, nor did it change 
the coefficients of the contagion variables once we controlled for the num- 
ber of journals received. Burt (1987), Marsden and Podolny (1990), and 
Strang and Tuma (1993) reported similar findings. It appears that opinion 
leaders adopted early as a result of their cosmopolitan perspective and 
media habits rather than due to pressure to maintain their status among 
their colleagues. We report results only for models excluding sociometric 
status variables. 

Seasonal effects —We included a seasonal dummy variable for the sum- 
mer months of July and August. We expected fewer adoptions of a new 
antibiotic in these two months because the weather is warmer and schools 
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are closed, limiting the spread of contagious diseases and in turn the 
demand for antibiotics (Cliff et al. 1981). 

Advertising volume —The Medical Innovation data do not contain in- 
formation on the amount of marketing effort targeted toward the phy- 
sicians whose prescriptions were tracked. We use the number of adver- 
tising pages in three leading advertising outlets, Modern Medicine, 
Medical Economics, and GP, as our measure of marketing effort. These 
three publications were preferred by pharmaceutical advertisers and were 
. widely read by physicians in the 1950s (Ben Gaffin and Associates [1953] 
1961a, 1961b). Our attempts to collect data on the number of ads ap- 
pearing in JAMA were unsuccessful, as librarians removed the advertising 
supplement before binding the issues for storage. 

We distinguish between the marketing efforts by the first entrant, Led- 
erle, and those of the later entrants. We do so for two reasons. First, the 
first entrant’s marketing efforts are often more effective than those of 
later entrants when the latter do not offer an important therapeutic ad- 
vantage (Bond and Lean 1977; Hurwitz and Caves 1988; Shankar, Car- 
penter, and Krishnamurthi 1998). Second, Lederle had a very large sales 
force and was strongly committed to building a dominant position ag- 
gressively while other companies were less well endowed and less 
aggressive. 

We matched the number of advertisements in each issue to the four- 
week sampling periods in the data set prepared by Burt (1986). Because 
the data are monthly observations and because previous research in the 
pharmaceutical industry documents the presence of sizable spillover ef- 
fects over time (Berndt et al. 1997; Montgomery and Silk 1972; Rangas- 
wamy and Krishnamurthi 1991), we expected marketing communication 
effects to span multiple periods. We therefore constructed measures of 
depreciation-adjusted stock of marketing effort (Berndt et al. 1997; Kalish 
and Lilien 1986; Rizzo 1999). Let m, be the amount of advertising in 
month ż (in hundreds of pages), and let 6 be the monthly decay rate 
(0 <ô < 1). The stock of marketing effort M, is then defined as 


M, = m, + (1-3)M,_, = >, (1-8)'7m,. (2) 
r=0 


We constructed one such variable for Lederle and one for all other com- 
petitors combined. We assumed that the decay parameter 6 (to be esti- 
mated) was equal across companies. 

We have not been able to locate data on the amount of detailing effort 
by various companies marketing tetracycline, but we do not believe this 
to be a problem. Detailing effort and journal advertising are so highly 
correlated in pharmaceutical markets that either variable can be used to 
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represent overall marketing effort (Berndt et al. 1997; Gatignon, Weitz, 
and Bansal 1990; Lilien, Rao, and Kalish 1981; Rangaswamy and Krish- 
namurthi 1991; Rizzo 1999). 

We do not include interaction effects between marketing effort and the 
number of journals received. While such interactions would normally 
provide a sharper test of advertising effects, they would be unlikely to 
do so in this case because the journal subscription variable does not 
discriminate between exposure to commercial and scientific content. Nor 
does it distinguish scientific journals, Lederle’s newsletter, and competi- 
tors’ newsletters from one another. We see no reason why a physician 
with above-average exposure to the scientific information in medical jour- 
nals and company-specific newsletters would have been differentially sen- 
sitive to advertising or visits by detail men. Nor do we see a reason why 
the effectiveness of Lederle’s advertising would have been boosted among 
physicians who received many newsletters or were exposed to many jour- 
nal advertisements, not only from Lederle but also from its competitors 
having a stake in promoting other broad-spectrum antibiotics. A similar 
argument applies for the later entrants. Another reason not to expect a 
significant interaction between journal subscriptions and later entrants’ 
marketing efforts is that the latter’s effectiveness is likely to be low overall 
(Bond and Lean 1977; Hurwitz and Caves 1988; Shankar et al. 1998), 
resulting in a floor effect precluding the detection of significant interaction 
effects.° 

Contagion variables ——We define the social influence that physician i 
is subject to at time ¢ as a function of whether other physicians (J) have 
adopted previously (indicated by y,,_,) and how important each physician 
j is to i (indicated by the social weight w,,). The extent of social network 
exposure # is experiencing can then be expressed as a lagged network 
autocorrelation variable L,w,y,,, (e.g., Marsden and Podolny 1990; 
Strang 1991). The actual social contagion, that is, influence on adoption 
behavior, is then 6L,w,¥,,_,, where f is a parameter to be estimated. 

Coleman et al. (1966) had 228 physicians interviewed. Each physician 
was asked to name up to three other physicians with whom he discussed 
medical practice, and up to three physicians from whom he sought advice 
about medical practice. However, Coleman et al. had prescription data 
collected only for general practitioners (V = 125) and not for specialists. 
There are two approaches to this missing data problem. Burt (1987) and 


* To check these a priori reservations, we performed analyses including interaction 
terms between number of journals received and marketing stock of both Lederle and 
the other entrants. As expected, the interaction terms were not significant (P > 0.10) 
The point estimates and significance test of all the other parameters were strikingly 
similar to those reported below in this article. Detailed results of these additional 
analyses are not reported here but are available on request from the authors. 
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Marsden and Podolny (1990) assumed that the adoption of tetracycline 
by specialists affected generalists’ decision to adopt; consequently these 
authors imputed adoption dates for specialists (though they only analyzed 
the generalists’ adoptions). In contrast, Coleman et al. (1966) and Strang 
and Tuma (1993) assumed that generalists did not take specialists’ adop- 
tion behavior into account; these authors thus do not consider the latter’s 
missing adoption data when constructing the social influence variables. 
We used both approaches, as Strang and Tuma (1993) suggested that 
different assumptions about the effects of specialists on generalists may 
have caused different reanalyses to find evidence of social contagion or 
not. 

We constructed two types of exposure variables, each assuming a dif- 
ferent influence mechanism represented by the w,, weights. The direct ties 
weights reflect whether i nominated j as an interaction partner for advice 
or discussions, such that ¿ might have gained information or experienced 
social normative pressure from j. We also constructed weights of structural 
equivalence indicating whether i might mimic j out of fear of losing out 
in the competition for status. We operationalized structural equivalence 
as the proportion of exact matches between two physicians’ set of rela- 
tionships with third parties: thus, the more their portfolio of relationships 
overlap, the higher the weight they give to one another. Appendix B details 
how we constructed the network exposure variables E, w, Y,- 

We also used the network exposure variables constructed by Burt 
(1986): one captures word of mouth operating over direct ties and one 
captures competition for status between structurally equivalent physi- 
cians. Burt incorporated specialists' imputed adoption data in his exposure 
variables. He also used different operationalizations of the influence 
weights than we did. Because of imputation problems, Burt could not 
compute exposure to structurally equivalent colleagues for seven physi- 
cians. For one of these seven, Burt was also unable to compute a measure 
of social cohesion influence. For these few physicians, we substituted our 
measures of influence through structural equivalence and direct ties for 
Burt's missing values. 

After constructing the variables, we deleted four physicians, due to 
missing covariates. The data set for estimation contains 17 monthly ob- 
servations for 121 individuals, 105 of whom had adopted by the last 
observation period. Table 2 presents descriptive statistics for the data, 
after excluding postadoption observations irrelevant to explain adoption. 

Figure 1 graphs the cumulative proportion of physicians having 
adopted in each period (= L,w,y,, assuming perfect social mixing, i.e., 
w, = 1/N for all i and j where N is the number of actors) as well as the 
marketing stock M, of Lederle and the other entrants. The graph shows 
that the cumulative fraction of adopters, a rough proxy for average net- 
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1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 
Period 


= Proportion of physicians having adopted 
—@— Marketing stock of Lederle (x3) 
—0— Marketing stock of later entrants 


Fic. 1— Adoptions and marketing stock, marketing stock is defined as the depreciation- 
adjusted cumulative advertising volume (see eq [2D 


work exposure, closely tracks Lederle’s marketing stock, hence suggesting 
the threat of a confound. Figure 2 grapbs the empirical hazard rate, that 
is, the number of physicians adopting in period t divided by the number 
of physicians not having adopted prior to period #, together with the two 
marketing stock variables. The hazard rate increases with Lederle’s mar- 
keting stock between periods 2 and 7. As Lederle’s marketing stock flat- 
tens, the hazard begins to behave more erratically. It even drops in periods 
9 and 10 (July-August 1954), possibly due to the summer effect. The 
hazard rate picks ups again between periods 10 and 15, which coincides 
with increased marketing efforts by later entrants, but the increase is 
irregular and not sustained after period 15. Figure 2 suggests that mar- 
keting efforts, especially by Lederle, may have been associated with 
adoption. 


Estimation and Specification Tests 


We used both logit and probit specifications and estimated our models 
using maximum likelihood, with one exception. We estimated the mar- 
keting effort decay parameter 6 using a grid search (cf. Berndt et al. 1997) 
in a model that did not feature social network exposure variables. A value 
of ô = 0.25 led to the highest model likelihood. Model fit was not very 
sensitive to changes in the range between 0.15 and 0.30. In subsequent 
analyses of models featuring both marketing effort and social network 
exposure, we kept 6 fixed at 0.25. 

We checked for unobserved heterogeneity in both probit and logit mod- 
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1 2 3 4 8 6 7 8 9 10 14 12 13 14 15 18 17 18 


— Empirical hazard 
—@— Marketing stock of Lederle (x0.75) 
—0— Marketing stock of later entrants (x0.2) 


Fic. 2 —Empirical hazard rate and marketing stock The broken Hnes around the em- 
pirical hazard rate indicate the SE interval (+/ — 1). We computed SEs using a logit hazard 
model. The interval is not symmetnc because the logit model is linear in the log odds of 
the hazard rate, not in the hazard rate itself. 


els. In the probit models, we estimated a normal mixture while allowing 
the base hazard to vary freely every three months (cf. Han and Hausman 
1990). In the logit models, we used the score tests developed by Hamerle 
(1990) and Commenges et al. (1994). None of these tests suggested the 
presence of significant unobserved heterogeneity (P > 0.10). We present 
the results for the logit specification only and omit the test statistics for 
unobserved heterogeneity. 


Results 


Table 3 reports the results for each of the four social network exposure 
variables: two for exposure through direct ties (both our own and Burt’s 
measure) and two for exposure through structural equivalence (again our 
own and Burt's measure). The four first columns (1a-4a) report the logit 
coefficients in models without marketing effort. Social contagion is sig- 
nificant in all four cases. Exponentiating the social contagion coefficients 
reported in table 3 indicates that the odds of adoption by someone with 
maximum exposure was two to three times the odds of adoption by some- 
one without any social network exposure. Also, the coefficients of all 
physician characteristics except for age have the expected sign and are 
significant at 90% confidence or higher. Age does not have the expected 
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positive sign, suggesting that younger physicians did not delay adoption. 
The summer dummy variable has the expected negative sign, but is not 
or is only marginally significant. 

The next four columns (1b—4b) report the results for models incorpo- 
rating marketing efforts. Adding the marketing variables improves model 
fit substantially in all four cases. The coefficients of the physician char- 
acteristics barely change, but the social contagion effects are now all 
insignificant. As expected from our situational analysis, marketing efforts 
by Lederle affect adoption more than peer influence or marketing efforts 
by later entrants, neither of which show a significant effect. Finally, a 
model with marketing effort but without social network exposure fits 
about equally well as models with both types of variables (col. 5). Overall, 
the results indicate that Lederle’s marketing efforts, not social contagion, 
was the dominant driver increasing physicians’ hazard of adoption over 
time. 


Discussion of the Statistical Analysis 


Though we believe that our description of the physicians’ situation at the 
time the data were collected is accurate and that our statistical analysis 
supports its predictions, the latter involves both modeling compromises 
and data restrictions. First, we limited our analysis to interpersonal in- 
fluence associated with others’ past behavior and did not consider others’ 
contemporaneous or anticipated future actions. By ignoring contempo- 
raneous actions, we fail to capture possible joint decision making among 
physicians. By ignoring how physicians may have anticipated adoptions 
of their peers and how this in turn may have led them to adopt preemp- 
tively to maintain their status within their community, we may not have 
captured competitive contagion fully. Even though our descriptive anal- 
ysis suggests contemporaneous social contagion and prospective behavior 
were unlikely to be important for tetracycline, we did not assess these 
effects in the statistical analysis. There are both statistical (Besag 1975) 
and theoretical problems (Coleman 1990) that need to be resolved before 
considering empirically analyzing the effects of contemporaneous and an- 
ticipated actions in complex, nonrandom network structures.’ 

Second, our analysis is constrained by the lack of richness in the data 


"Strang and Tuma (1993) do model contemporaneous contagion. Their estimation 
procedure, however, does not account for the interdependence among observations 
explicitly present once one allows for contemporaneous contagion. Traditional likeli- 
hood functions for binary dependent variable models, which include discrete-time 
hazard models used in the present study, do not account for this endogeneity. Including 
a contemporaneous network exposure variable, E, w,, y, would lead to invalid results 
(Besag 1975). 
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about what gets communicated through the network. We assume that 
social interaction informs potential adopters only about others’ adoption, 
not their expected or achieved utility, attitudes, or other evaluations. This 
is defensible when actors do not discern internal states or outcomes of 
others. Sometimes, however, outcomes are actually communicated by 
adopters (e.g., satisfaction) or can be observed (e.g., market share gains 
or increased fundraising by organizations that implement a new tech- 
nology). Researchers having such data available can modify the model 
quite easily by substituting the relevant variable (say, q,,) for the y,,_, 
indicator and compute network exposure as E, 0, q,,. 

Third, our hazard models do not distinguish between two important 
stages in the adoption process: awareness followed by evaluation that is 
itself conditional upon awareness (Rogers 1995). Such separation may help 
gain a better understanding of the differential effects of advertising and 
social contagion, as the former is believed to operate mainly early in the 
decision process and the latter mainly in later stages (Rogers 1995). Mod- 
eling the effect of marketing efforts and social contagion without distin- 
guishing between awareness and evaluation might produce misleading 
results when marketing efforts are quite important in creating awareness 
and when social contagion is moderately—though still sizably—important 
in persuading actors to adopt the innovation. When both explanatory 
variables are forced into a single-stage model, the weaker social contagion 
effect may be washed out by the marketing effort, erroneously suggesting 
that social contagion was not at work, 

These three caveats mean that both our analysis and those that preceded 
it may not have been able to identify very subtle social contagion processes 
affecting the diffusion of tetracycline. Yet, the caveats apply to our own 
analysis as well as earlier ones and hence do not affect the main impli- 
cation of our statistical analysis: prior evidence of social contagion in the 
Medical Innovation data was based on confounding the contextual effect 
of Lederle’s aggressive marketing efforts with social contagion. 

Our results about the absence of network effects once one controls for 
advertising clearly contradict the received view of strong network effects 
in Medical Innovation (Rogers 1995). At the same time, they also explain 
the “weak” results obtained by Marsden and Podolny (1990) and Strang 
and Tuma (1993). Marsden and Podolny estimated a Cox proportional 
hazard model, which is very similar to a discrete-time logit hazard model 
with a dummy variable for each time period. These dummies capture all 
cross-temporal variation in the mean adoption hazard and leave only 
variance within particular time periods to be explained by network ex- 
posure. Strang and Tuma incorporate lagged penetration as a covariate, 
besides lagged network terms. Lagged penetration assumes that any phy- 
sician interacts with any other physician (ie., a constant w, for all ¿and 
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J), ignoring network structure but capturing the cross-temporal variation 
in average network exposure. Similarly, our marketing variables vary over 
time but not across physicians. Hence, all three studies show that differ- 
ences in adoption across physicians within any particular time period are 
not statistically significantly associated with differences in lagged social 
network exposure. Our study, however, is the only one to provide an 
explanation for this finding grounded in a detailed situational analysis. 


CONCLUSION 


Based on a situational analysis of the medical community’s understanding 
of tetracycline in the mid-1950s and on a new statistical analysis, we 
conclude that Medical Innovation (Coleman et al. 1966) and several sub- 
sequent studies that analyze the diffusion of tetracycline have confounded 
social contagion with marketing effects. Our analysis of the drug’s char- 
acteristics, the way it was marketed, and the typical sources of information 
within the medical community, suggest that it is unlikely that social con- 
tagion processes were important in physicians’ decision to adopt, but that 
aggressive marketing efforts by Lederle may have had a significant im- 
pact. A new statistical analysis of the Medical Innovation data set, sup- 
plemented with newly collected data on advertising volume (a measure 
of marketing effort), shows that contagion effects disappear once we con- 
trol for marketing effort. Hence, we conclude that the data do not doc- 
ument that diffusion was driven by contagion operating over social net- 
works and that earlier analyses confounded social contagion with the 
effect of marketing effort. 

The danger of confounding common contextual effects with social con- 
tagion has been repeatedly discussed in the literature. Yet, given the prom- 
inence of the original Medical Innovation study and subsequent reana- 
lyses of the data (esp. Burt 1987) in the diffusion and social network 
literatures, our results are particularly noteworthy: they underscore, more 
compellingly than general methodological admonitions, that the danger 
of confounds when studying the role of social contagion in innovation 
diffusion must not be taken lightly. 


APPENDIX A 


Medical Innovation’s Silence Does Not Mean Marketing Efforts Were 
Irrelevant 


The authors of Medical Innovation did not pay much attention to mar- 
keting factors in their analysis. This oversight does not reduce the face 
validity of the conculsions of our situational analysis: Medical Innova- 
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tion’s genesis may have led its authors away from looking into the effects 
of detailing and journal advertising, and the authors may not have been 
very familiar with all the institutional details of their research site. 

The genesis of the original study. Medical Innovation started off as 
an advertising effectiveness study for Pfizer, one of the sponsors of Paul 
Lazarsfeld’s Bureau of Applied Social Research at Columbia University. 
As reported by a former affiliate to the bureau, Pfizer “wanted to find out 
whether or not it should continue to advertise a new drug in the Journal 
of the American Medical Association” (Glock 1979, p. 27). Typical of the 
scholarly entrepreneurship with which Lazarsfeld funded his institute, 
this rather humdrum marketing question was converted into a sociological 
study of scholarly interest showing very little surface traces of its mer- 
cantile origins (Rogers 1994). It is important to note that, in those days, 
Pfizer did not place ads in the regular advertising section in JAMA, but 
had its own newsletter, Spectrum (that contained both ads and articles), 
inserted in each issue of JAMA. This, we believe, explains why the study 
includes multiple questions about specific newsletters and about JAMA 
but none about regular journal advertising or other medical journals 
mentioned by name (the questionnaire is reprinted in Coleman et al. 1966, 
pp. 195-205). The genesis of Medical Innovation as a study on the ef- 
fectiveness of drug house newsletters redefined into a diffusion study also 
explains the rather small amount of attention given to detailing as a source 
of influence. 

Familiarity with the research setting —Consider the following 1993 
reminiscence by James Coleman of the Medical Innovation project: 


I never saw the communities. It [Medical Innovation] was one of those 
research projects that happens while you are busy with more important 
projects. I designed the research with Herb [Menzel] and Elihu [Katz]. Herb 
and Elihu reviewed the literature on medical innovation. A team of inter- 
viewers came out to Illinois from the Bureau to talk to the doctors, and 
Sidney Spivik searched the prescription records. The questionnaires went 
back to Columbia to be keypunched, a set of cards were sent to me for 
analysis, and the research report was published in Sociometry [1957] (As 
cited in Burt 1997; square brackets added by Burt) 


This recollection indicates that neither Coleman nor his two coauthors 
had firsthand knowledge of medical practice in the four Illinois com- 
munities. Hence, the fact that Medical Innovation does not emphasize 
the role of marketing efforts in the diffusion of tetracycline cannot be 
used to infer that those efforts were unimportant. 
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APPENDIX B 


Procedures Used to Create Social Influence Weights 


Our analysis uses both discussion and advice relationships. Using the 
network data of all 228 physicians, we constructed the social weight ma- 
trices for each of the four cities separately in a series of steps. 

Step 1.—First, we created adjacency matrices with element a, = 1 if 
i mentions j; 0 otherwise. We created two such adjacency matrices for 
each city: one for discussion ties and one for advice ties. 

Step 2.—Since being discussion partners is a naturally reciprocal re- 
lationship, we symmetrized the discussion adjacency matrix (Alba and 
Kadushin 1976). 

Step 3.—We constructed a pooled adjacency matrix by adding the sym- 
metrized discussion matrix and the advice matrix, treating discussion and 
advice as indicators of a common underlying variable “interacting with.” 
We also performed analyses (not reported here) keeping discussion and 
advice separate. This separation did not affect the results. 

Step 4.—We constructed two different weight matrices to account for 
various network contagion mechanisms. Direct tie matrices are identical 
to the adjacency matrices. We computed structural equivalence weights 
as the proportion of exact matches between two physicians’ set of rela- 
tionships with third parties. A valid match required that the physicians 
had at least one common third party, which implies that actors without 
any common third party did not put any weight on each other’s actions. 

Step 5.— We deleted all rows and columns referring to physicians who 
were not among the 125 included in the prescription sample. 

Step 6.—We put all diagonals to zero and normalized all rows such 
that (1) w = 0, and (2) L,w, = 1, if and only if w, + O for some J, and 
Ew, = 0 otherwise. This row normalization implies that physicians are 
sensitive to the proportion rather than the number of relevant others who 
have adopted, and it ensures that each network exposure variable is 
bounded between zero and one. 

After performing these six steps, actor 7’s social network exposure at 
time £ can be computed as E,w, y,¿;- 
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Stirrings in the Jug: Black Politics in the Post-Segregation Era. By 
Adolph Reed, Jr. Minneapolis, Minn.: University of Minnesota Press, 1999. 
Pp. xiiit+303. $47.95 (cloth); $18.95 (paper). 


Social Dominance: An Intergroup Theory of Social Hierarchy and Op- 
pression. By Jim Sidanius and Felicia Pratto. New York: Cambridge 
University Press, 1999. Pp. x+403. $49.95. 


Lawrence D. Bobo 
Harvard University 


Depending upon whom you consult, racial cleavages are chimerical 
boundaries of diminishing import in the United States or else they con- 
stitute durable fissures in the body politic. Adolph Reed’s provocative 
collection of essays in Sttrrings in the Jug and Jim Sidanius and Felicia 
Pratto’s Social Dominance both belong in the latter camp (though Si- 
danius and Pratto develop a far more pessimistic analysis than does Reed). 
In a time when almost any Pollyannaish treatment of race can be heralded 
as bold thinking by the mainstream press and by more than a few scholars, 
these two careful and sobering books demand serious consideration. 

The two books set very different scopes of analyses. Reed tackles a 
problem of more tractable scale: black politics in the United States in the 
postsegregation era. He delivers a telling and always engaging analysis. 
Sidanius and Pratto pursue the daunting goal of crafting a universal, 
cross-cultural theory of social hierarchies integrated across micro- and 
macrolevels of analysis. They deliver on a goodly portion but far from 
all of this grand ambition. 

Stirrings in the Jug pivots off four recurrent themes. First, Reed opens 
his collection of essays written between 1986 and 1999 with a lengthy 
critique of the tendency to view the black public and black politics as if 
they were unitary and expressed simply apprehended corporate interests. 
There are important lines of stratification and ideological and issue- 
specific divisions within the black community that are too-often glossed 
over (intellectually and as a matter of pragmatic political action). How- 
ever, this assumption of a monolithic black community has been nurtured 
and exploited both by politicians seeking office and by naive radicals 
seeking a claim to authenticity. 

This assumption of uniform and obvious corporate interests leads to 
Reed’s second major theme: that there is great tension, disconnect, and 
lack of accountability between black leadership and the communities for 
whom they claim to speak. With the success of black electoral politics, 
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especially at the local level, many previously activist voices and organi- 
zations were made a part of the race relations management apparatus of 
the modern monopoly capital-dominated state. Black mayors, who ride 
to office on the hopes of the mass of working-class blacks, end up serving 
largely their downtown and suburban corporate development benefactors. 
The latter may open the doors of business to a highly select group of 
middle-class blacks (and often only after bitter contestation) but do noth- 
ing to improve the lives of most black citizens on whose needs and interests 
black officials’ very claim to legitimacy rests. 

This state of affairs arose, according to Reed’s third major theme, 
because of failures of vision and analysis on the left, particularly among 
the black left. As civil rights victories took place and more blacks were 
elected to office, the left failed to develop a pragmatic analysis of how 
this was transforming the political landscape. Ultimately the left was 
unable to develop a critical response once the new elite, insider-brokerage 
model gained ascendancy. The fourth and final major theme is that this 
rise consolidated the power of monopoly capital. Lacking a critical vision 
or a mobilized public, black elected officials had to compromise with the 
progrowth development coalitions that tend to dominate local politics 
(though according to Reed they compromise even more than political 
exigencies would require). The end result is a demobilizing appearance 
of significant empowerment but the underlying reality of an inability to 
deliver significant benefits to one’s constituents. Although Reed engages 
many other topics—the Atlanta mayoral regimes of Maynard Jackson and 
Andrew Young, the poisonous Reagan-Bush years, the Jackson candi- 
dacies of 1984 and 1988, the rise of Democratic Leadership Conference 
dominance within the Democratic Party, scholarship on the “urban un- 
derclass”—+they mainly reflect the dynamics of these larger trends. 

A problem with Reed 's analysis is that he sometimes attacks caricatured 
versions of other scholars’ work. For example, he chastises Katherine 
Tate (From Protest to Politics: The New Black Voters in American Elec- 
tions [Harvard University Press, 1993]) and Michael Dawson (Behind the 
Mule: Race and Class in African American Politics [Princeton University 
Press, 1994) for assuming a monolithic black community with transparent 
corporate interests. This is a serious distortion of what both scholars do, 
what they find, and what they each argued. In particular, the Dawsonian 
approach is one designed to try to find different ideological strands and 
groupings within the black population. As he argues, a history of op- 
pression and an array of contemporary, heavily sedimentary inequalities, 
contribute to a clear, empirically documented sense of shared corporate 
identity and interest. Reed writes as if he is the only analyst who can use 
the phrase “black community” without succumbing to a disabling as- 
sumption of black political uniformity and untenable presumptions of 
authentic group voice. This sort of caricatured treatment of others? work 
is no less in evidence in Reed’s treatment of William Julius Wilson’s work 
on modern urban poverty. 


1437 


American Journal of Sociology 


Part of the force of Reed’s argument, ironically, turns on an assumption 
that he does very little to validate: namely, that there are significant 
strands of radical thought and ferment—or at least a potential of more 
radical mobilization on some issues some of the time—within black com- 
munities. The idea is not so much false as it goes without direct sub- 
stantiation in the argument and evidence Reed amasses. Indeed, on this 
score, Tate and especially Dawson are much better sources of information. 
If there are serious grassroots efforts of a potentially radical bent in black 
communities, Reed takes few steps to identify them. Instead, he rigorously 
chronicles the meager gains, legions of compromises, and sometimes abys- 
mal failures of the elite, insider-brokerage approach adopted by so many 
black elected and appointed officials. 

These limitations notwithstanding, Stirrings in the Jug is a trenchant, 
sometimes bare-knuckled, but always thickly reasoned assessment of the 
transformation and demobilization of African-American politics in the 
post-civil-rights era. Anyone interested in a historically astute, grounded, 
yet critical, perspective on the stagnation of an organized and contentious 
progressive thrust to African-American politics will find Reed’s work 
timely and insightful. 

Sidanius and Pratto fashion a three-part project. First, they synthesize 
a range of theories in order to develop the social dominance model and 
establish their core variables and measures. This is the most original, 
tightly argued, convincing, and important part of the book. Sidanius and 
Pratto sensibly parse theories of prejudice, stereotyping, and group con- 
flict, as might be expected of social psychologists studying intergroup 
dynamics. Less expected, but no less central to their argument (and no 
less well executed), is their discussion of theorists of ideology such as 
Marx, Mosca, Michels, and Pareto. They posit that all human social sys- 
tems that generate any significant economic surplus exhibit three strati- 
fication systems based in age, gender, and what they term “arbitrary-set” 
groups. Arbitrary-set groups include social divisions based on race, eth- 
nicity, class, language, religion, nation, and so on, where there is a clear 
privileged or dominant group and a clear disadvantaged or subordinate 
group. The proximal factor in the maintenance of social hierarchies is the 
operation of social dominance orientation (SDO). SDO is defined as “a 
very general individual difference orientation expressing the value that 
people place on nonegalitarian and hierarchically structured relationships 
among social groups. It expresses general support for the domination of 
certain socially constructed groups over other socially constructed groups” 
(p. 61). They show very clearly that this construct is different from 
authoritarianism, conservatism, and other value orientations; that it re- 
lates as predicted to an array of other ideological perspectives; that it 
relates to gender (men scoring higher than women) and group status 
(dominants higher than subordinates) as expected; and that it relates 
strongly to such issue attitudes as opposition to affirmative action. 

In the second major component of their project, Sidanius and Pratto 
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develop a detailed review of the evidence on contemporary discrimination 
in the consumer, labor, and housing markets; in health care provision and 
outcomes; in educational institutions; and in the criminal justice system. 
This review, especially that concerning blacks in the U.S. criminal justice 
system, serves to support their claim that members of subordinate arbi- 
trary set groups systematically receive a lesser share of valued social goods 
and outcomes and a greater share of unwanted social outcomes. Most of 
this summary focuses on the case of blacks in the United States, but it 
also draws frequently on data from around the world. In addition, the 
summary provides much of the evidence for their claim that subordinate 
group males (i.e., black men not black women) are disproportionately the 
targets of oppression; a direct rejection of the widely accepted double 
jeopardy hypothesis. In the third and surely most controversial compo- 
nents of the project, they aim to show two things: (1) that subordinate 
group members collaborate in their own oppression (via ideological in- 
corporation or false consciousness and via self-debilitating behaviors such 
as gang violence or weak academic commitment); and (2) that women are 
uniformly less dominance oriented than men as a direct reflection of highly 
debatable evolutionary reproduction and mate selection imperatives. 

There are three problems with the social dominance approach. First, 
a generic individual difference variable like SDO does not give us much 
leverage in understanding social change processes or variation in the level 
of oppression that occurs across significant social units (such as occurs 
between regions or nations). Thus, despite the unambiguous proof 
reviewed by Sidanius and Pratto that discrimination against African- 
Americans still occurs, they can say little about how or why the levels, 
consistency, or pervasiveness of that discrimination has changed. Second, 
data from more than 30 samples (usually convenience samples of college 
students) involve a great range of measures, and various correlations and 
comparisons are made. At times, a reader becomes lost in the vast as- 
semblage of data. The project might have been made more immediately 
accessible and, indeed, convincing, if fewer samples and fewer measures, 
all selected on the basis of high theoretical clarity and centrality, had been 
the basis for the book. 

Third, the case for the “subordinate male target hypothesis” (SMTH) 
and against double jeopardy is overstated. Sidanius and Pratto’s own 
review of evidence shows that women are more likely to drop out of 
school and suffer from illiteracy in countries such as Algeria, Kenya, and 
India. More important, in the U.S. case, they sidestep the question of the 
absolute earnings advantage of black men or why black male/white female 
marriages would be historically more common than white male/black 
female unions if black men invariably occupy the most uniquely disad- 
vantaged position. Nor, for instance, do they consider recent evidence of 
especially severe and disproportionate discrimination against black 
women by customs officials. 

Yet, it should be stressed that Social Dominance is a work of potentially 
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foundational importance for the study of intergroup attitudes, prejudice, 
racism, and discrimination. Sidanius and Pratto shed new light on the 
ubiquitous phenomena of group-based social hierarchy and the social 
psychological and institutional mechanisms that maintain it. While not 
every joint and connection in the great edifice that Sidanius and Pratto 
attempt to erect is a seamless one, there is a great deal of value here for 
any serious student of social inequality. 


Bad Boys: Public Schools in the Making of Black Masculinity. By Ann 
Arnett Ferguson. Ann Arbor: University of Michigan Press, 2000. Pp. 
1i+256. $29.95. 


Howard Pinderhughes 
University of California, San Francisco 


“That one has a jail cell with his name on it.” So begins Ann Ferguson’s 
Bad Boys, with this statement by an elementary school staff person about 
a 10-year-old boy. Ferguson’s Bad Boys is an excellent, often poignant, 
examination of the role of schooling in the marginalization and crimin- 
alization of African-American boys Ferguson provides a provocative and 
persuasive ethnographic account of the racialized and gendered institu- 
tional practices and treatment of African-American boys in an interme- 
diate school (grades 4-6) in a medium-sized city on the West Coast. The 
book is based on extensive participant observation and interviews in the 
school as well as with the families of the African-American boys who are 
the subject of the analysis. Ferguson spent the better part of a year fol- 
lowing two groups of African-American students through their academic 
year: The “schoolboys,” who were doing well academically, and the “trou- 
blemakers,” who spent substantial portions of their school day in the 
“punishment room” or in after-school detention in the “jailhouse.” Fer- 
guson’s analysis provides a vivid picture of how public schools place some 
young African-American boys on a track toward incarceration or control 
in the criminal justice system, the situation nationwide for nearly one- 
third of African-American males in their twenties. 

Ferguson takes on the problem of educational inequality characterized 
by a huge gap in academic achievement and the disproportionate aca- 
demic failure and incarceration of African-American males, fundamental 
issues at the center of the current crisis in urban education. Bad Boys 
provides an account and a sophisticated analysis of the experiences and 
perspectives of the boys themselves. Ferguson’s analysis builds on Bour- 
dieu’s concept of symbolic violence through punishment practices that 
enforce cultural hegemony and Foucault’s conception of discipline as a 
central element of social control embedded in dynamics of power that are 
reinforced through the production of social identities of “good” and “bad,” 
“gifted” and “troubled.” She describes the use of rules to enforce nor- 
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malized behavior based on dominant cultural norms and to isolate and 
demonize alternative cultural expressions. Ferguson succeeds in providing 
an intersectional analysis of how race, gender, and class dynamics combine 
in the institutional practices and treatment of the African-American boys 
she follows. The boys are labeled and marginalized because of the com- 
bined dominant construction of their blackness and their masculinity. 
Ferguson argues that the culture and institutional policies and practices 
of the school are instrumental in the creation, maintenance, and inter- 
nalization of the predominant societal images of black males, as either 
criminals or endangered species. 

There are two dimensions to Ferguson’s account and analysis of the 
experiences of the troublemakers. First, the institutional culture of the 
school is examined through its policies, practices, and treatment of the 
boys. Second, Ferguson analyzes the behavior, ideas, and understandings 
of the boys who are negotiating daily humiliation and marginalization. 
Ferguson’s analysis is a nuanced one that challenges recent theories of 
oppositional culture among black students. Ferguson describes the be- 
havior of many of these boys as defiant and oppositional. She explains 
their oppositional behavior as a construction of self through defiance and 
fighting. She argues that the powerful effects of academic and social mar- 
ginalization create the oppositional behavior and identity. It is here that 
her comparison of the troublemakers and the schoolboys is most useful. 
The schoolboys are described as adopting a multicultural, nonracial iden- 
tity that is less threatening and that conforms to the dominant culture of 
the school. The troublemakers adopt a raced perspective that uses race 
as a tool to feel a sense of individual power in a context where they are 
powerless. In the school context, blackness is equated, by the institution 
and the boys themselves, with trouble and the disruption of order. The 
embracing of this behavior and identity on the part of the boys is analyzed 
not simply as oppositional behavior but resistance to the oppressive forces 
of institutional racism that, in their minds, subject them to humiliation, 
marginalization, and differential treatment because of their race. 

Tf there is one criticism I have of the book it is that we hear more about 
the boys” lives from their parents, teachers, and school staff than we do 
from the boys themselves. It would be useful and important to understand 
exactly how an 11-year-old constructs and expresses his view of his ra- 
cialized, marginalized situation. Ferguson’s analysis is very well organized 
and persuasive but would be strengthened by a stronger voice from the 
boys. Yet even without a strong voice from the boys, Ferguson’s Bad 
Boys is an engaging and important book that should be required reading 
for scholars and students studying education, race relations, criminal jus- 
tice, social reproduction, or child psychology. Few contemporary authors 
have supplied such a cogent analysis of the effects of public schools on 
African-American boys and how we prepare them to take their place in 
the jail cells of America. 
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Hair Matters: Beauty, Power, and Black Women’s Consciousness. By In- 
grid Banks. New York: New York University Press, 2000. Pp. x+210. 
$55.00 (cloth); $17.50 (paper). 
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In her book, Hair Matters: Beauty, Power, and Black Women's Con- 
sciousness, Ingrid Banks offers a powerful analysis of the cultural reso- 
nance of hair in contemporary U.S. society. Beginning with the claim that 
“hair is a means by which one can understand broader cultural issues” 
(p. 4), Banks convincingly argues throughout the book that if cultural 
theorists are interested in racial politics and identity in a broad sense and 
if they are committed to understanding what race means in lived com- 
munities, then understanding why hair matters to black women and girls 
is essential. Hair is not simply a matter of style; according to Banks, it 
is, more fundamentally, a cultural discourse on black women, identity, 
and power relations in the United States. 

To trace how hair operates as a rich symbol for identity, power, and 
subjectivity for black women, Banks adopts an ethnographic approach, 
interviewing 61 black girls and women (43 individual interviews and five 
focus group sessions). The questions she poses are deceptively simple (pp. 
18-19): Why does hair matter among African-American women? Is hair 
associated with power in any way? And, do African-American women 
have a choice or voice about the way they wear their hair? What emerges 
from her interviews is a fascinating ethnographic account of black women 
of varied age and income, speaking about hair and its relation to identity, 
gender politics, racial solidarity, generational differences, and image cul- 
ture. Among other things, this account makes it unmistakably clear that 
“hair shapes black women's ideas about [not only] race, gender, class, 
sexuality, [and] beauty, [but also] power” (p. 3). However, although Banks 
makes gestures in the direction of linking these ideas to understandings 
of broader social and cultural dynamics, she does not quite engage this 
relationship in a more substantive fashion. 

Banks insists that black women’s experiences must be regarded as 
compelling data in contemporary projects that aim to “figure out” the 
complex cultural politics of hair. The ethnographic data in Hair Matters 
challenges many conventional assumptions, among them the notion that 
black women who straighten or relax their hair are engaging in a kind 
of self-hatred by “trying to be white.” Although a quick glance at the 
American mass media landscape suggests that the idealized subject of 
desire is most often imagined in the form of a white woman—complete 
with long, flowing, and often blond hair—the politics of identity are con- 
siderably more complicated. Many of the women Banks interviewed, for 
example, gave evidence that the politics of hair—and identity more gen- 
erally—are only insignificantly about wanting to be or to look white. As 
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one of the interviewees says, “I’ve known women who have permed hair 
who I think are more at peace with themselves and grounded in their 
history and black people. And then there are people with locks who treat 
you worse than white people would treat you” (p. 68). Banks makes a 
significant contribution to the otherwise thin body of literature on the 
relationship of black women to mediated visions of ideal femininity. In- 
deed, when the relationship of African-American women to beauty culture 
is consistently read through the lens of not being “good enough,” the 
complexities of navigating both racial and gender identity are reduced to 
a simple dynamic of imitation. Banks’s ethnography confronts this kind 
of reduction and offers a much more nuanced glimpse at identity 
formation. 

Interestingly, Banks’s privileging of ethnographic data is the book’s 
strength as well as weakness. In one frame, the experience of black women 
is undertheorized, and Banks does not successfully link this experience 
with a broader conversation on race in American culture. There are a 
number of different ways in which she could have produced this link; for 
example, the economy of hair and, more generally, of beauty culture could 
have been analyzed as a specific component of racial identity formation. 
And although she briefly describes the history of African-American 
women and the relationship to hair, there is not a substantial view of the 
history of “hair politics” in this book. Moreover, there is a missed oppor- 
tunity more generally to theorize the social construction of women’s ex- 
perience itself, a dilemma all ethnographers face. Quite certainly, women 
who have been positioned outside of dominant beauty norms and in the 
specific communities Banks discusses will have a fuller understanding of 
the complexities of the discourses examined, but nevertheless, most are 
left with filling in some of the blanks. Indeed, it is a fascinating conver- 
sation, but the reader is a kind of eavesdropper here and is left to create 
the framework for the story. 

However, this does not detract from the important contributions Hair 
Matters makes for understanding the multidimensional relationship 
African-American women have with black beauty culture, white beauty 
culture, and female empowerment. Banks forcefully demonstrates that 
black girls and women “use hair as a medium to understand complex 
identity politics that intersect along the lines of race, gender, class, sex- 
uality, power, and beauty” (p. 148). Banks’s ethnography demonstrates 
that issues of style and personal identity are not incidental or peripheral 
to a person’s sense of the political. Rather, those issues of personal 
identity—of hair—“matter” in critical and important ways. 
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The Color of Freedom: Race and Contemporary American Liberalism. By 
David Carroll Cochran. Albany: State University of New York Press, 
1999. Pp. ix+207. $17.95 (paper). 


Charles W. Mills 
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Blacks have historically had a somewhat ambivalent relationship with 
liberalism. Too often, liberty, equality, and fraternity have turned out to 
be marked “for whites only.” But as the dominant ideology of the modern 
period, liberalism’s promise of moral and juridical egalitarianism has 
continued to attract those seeking an end to the system of ascriptive 
hierarchy known as race. Though some black theorists have been pushed 
in more radical directions, whether pan-Africanist or Marxist, most still 
hope that, despite all the disappointments, liberalism can be reconstructed 
to be an effective tool against white domination. 

David Cochran’s book needs to be situated as part of this project. He 
sees liberalism as “the most important source of [our] normative under- 
standings,” and thus as still the best choice for black political activists (p. 
2). However, liberalism now needs to be rethought. 

His first three chapters review the strengths and weaknesses of “color- 
blind liberalism,” which has been the dominant public philosophy of race 
since World War II. Color-blind liberalism is marked by “its individualism, 
its focus on rights, its emphasis on fair procedures, its dedication to equal- 
ity of opportunity, and its commitment to the principle of nondiscrimi- 
nation” (p. 18). The civil rights struggle of the 1950s and 1960s was 
thus able to frame its demands in terms of “closing the gap” between 
these ideals and the actual practices of discrimination against African- 
Americans. Supported by the federal judiciary and political elites, civil 
rights leaders were able to win dramatic victories in ending formal dis- 
crimination and in changing white public attitudes (pp. 28-33). 

But after the mid-1960s, the movement began to fragment. With the 
defeat (at least in public discourse) of old-fashioned bigotry, the debate 
now shifted to the conflict between supporters of broad and narrow read- 
ings of the color-blind paradigm, liberals and conservatives. And this 
battle has, of course, basically been won by the right. Discrimination is 
narrowly construed, and remedial measures (busing, affirmative action, 
etc.) are widely opposed by the white population. Cochran’s third chapter, 
“The Limits of Color-Blind Liberalism,” reviews the frustrations of the 
present situation. While legal and formal equality have been achieved, 
“an unjust racial status quo” remains, manifested in lower black incomes, 
net worth, and life expectancies; along with higher poverty and unem- 
ployment rates, a startling 50% of the prison population is now black 
(pp. 46-53). 

The left’s failure to make its broader interpretation of discrimination 
plausible is, of course, linked in part to the national and global victory 
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of conservatism. But it also has conceptual roots. Cochran argues that 
the categories of color-blind liberalism, in focusing on the state and on 
formal political and legal institutions, obfuscate the significance of civil 
Society in shaping blacks’ lives. Informal and subtle processes of discrim- 
ination in education, housing, loan applications, and the job market are 
deeply embedded in the structure of everyday life. The individualism of 
color-blind liberalism makes it insensitive to the pervasive realities of 
culture and group membership, and of general white-skin privilege, in 
determining how people are perceived. 

The chapters 4 and 5 set out Cochran’s positive thesis. Freedom is “at 
the heart of liberalism” (p. 67), but it is usually construed in purely negative 
terms, as the removal of “external barriers to choice.” Autonomy, on the 
other hand, has the connotations of “self-rule” and, as such, is sensitive 
to both external and internal impediments to self-realization. Autonomy 
is thus the key normative lever for Cochran. 

Cochran draws on the classic writings of John Stuart Mill, as well as 
contemporary philosophers Joseph Raz, Charles Taylor, and Will Kym- 
licka, to argue for an ideal of autonomy that recognizes the role of culture 
and civil society in enabling or hindering an autonomous life. To the 
extent that black and white Americans constitute distinct cultural groups 
and de facto segregation persists, it makes sense to refer to a black civil 
society. Black civil society represents “an alternative cultural, social, and 
economic space,” which has been crucial in providing resources for the 
achievement of black autonomy (p. 110). Cochran lists the many ways in 
which blacks have been empowered by this alternative space and also 
analyzes some of the disabling forces that currently threaten black au- 
tonomy. Finally, in chapter 6, he summarizes what he sees as the most 
promising public policy approaches, arising out of the book’s expanded 
liberal understanding of race, for revitalizing black civil society and thus 
enhancing black autonomy. 

Cochran’s book succeeds admirably in combining abstract political the- 
ory with socioeconomic facts. He makes a very strong case, in fact one 
of the best I have seen, for the continuing relevance to the black struggle 
of a liberalism reconstructed in its normative emphases and its factual 
presumptions through taking race and white domination seriously. Of 
course, as Rogers Smith’s recent Civic Ideals (Yale University Press, 1997) 
has definitively documented, U.S. political culture has “multiple 
traditions,” and white supremacy is also very much a “source of our 
normative understandings.” So the ideals Cochran identifies as levers of 
change are themselves going to be shaped and inflected by this rival, 
exclusionary normative system. But this problem will be faced by any 
moral challenge to the racial order, liberal or not. To the extent that 
liberals, for internal theoretical reasons, have been ceding political ground 
to the right on race, Cochran provides a powerful defense for the liberal 
legitimacy of continuing state intervention in the cause of racial equality. 
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The Cultural Territories of Race: Black and White Boundaries. Edited by 
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Just as Joseph Goebbels reputedly reached for his revolver whenever he 
heard the word cultwre, some social scientists seem to become apoplectic 
whenever they hear the word conjoined with race. At least since the 
brouhaha over Daniel Patrick Moynihan’s eponymous report on the black 
family, social scientists have risked controversy by uttering the two words 
in one breath, often facing incendiary charges of conservatism, if not 
reactionary racism, to boot. Certainly, conservative intellectuals, such as 
Thomas Sowell, Dinesh D’Souza, and Charles Murray, have been brazen 
about attributing the conditions of poor African-Americans to their cul- 
ture—that hazy constellation denoting attitudes, beliefs, meanings, and 
values—and thereby rendering the term disreputable for the high minded. 
In fear of blaming the victims, well-intentioned academics have been loath 
to analyze African-American culture, save perhaps for cultural studies 
aficionados of gangsta rap and other putative expressions of counter- 
hegemonic art (see Michael Dawson’s essay in the volume under review). 

In The Cultural Territories of Race, Michéle Lamont and her colleagues 
have brought culture back into the study of race, or, more accurately, of 
African-Americans. They seek not only to recuperate the term from the 
clutch of conservative intellectuals, but also to demonstrate its salience 
in understanding race relations, racial identity, and racism. They hope to 
go beyond empty formalism and reactionary rhetoric by, in the words of 
the editor, “gaining theoretical leverage from a close ‘corps á corps’ with 
their data.” Although this collection does not athieve the rare distinction 
of being more than the sum of its parts, it does contain some excellent 
essays that make it required reading for serious students of race relations. 
Elijah Anderson opens with a fascinating ethnographic foray into the 
world of the black executive. Reworking Erving Goffman’s classic work 
on the spoiled identity, he reconstructs the racial tension and negotiation 
between white and black executives. Mary Waters depicts West Indians 
as providing “the comfort factor” for whites by providing “a black face 
for whites to look into without seeing the sorry history of American race 
relations mirrored back.” She also draws a useful distinction between 
structural and interpersonal racism and judiciously places the West Indian 
“success story” in the context of U.S. race relations. The editor deftly 
presents the moral discourses of black and white workers, showing the 
contours of their moral boundaries and their ideas about their self-worth, 
status, and success. Katherine Newman and Catherine Ellis delineate the 
treacherous world of low-wage service workers caught between the honor 
of working and the stigma of low status. Jane Mansbridge practices the 
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oft-repeated mantra of “race, gender, and class” by analyzing the origins 
and trajectory of feminisms. By emphasizing the actual diversity of ac- 
tivists and their agenda, she vividly reconstructs the cleavages in what 
has become reified as the feminist movement. Jennifer Hochschild’s su- 
perb analysis of affirmative action as culture war demonstrates the nu- 
ances and ambivalence of a broad middle ground that advocates “extra 
help” for African-Americans but does not condone “reverse discrimina- 
tion.” In so doing, she criticizes social scientists and political advocates 
who clamor for the polar dichotomy of “for” and “against” affirmative 
action. 

Needless to say, there are many more fine essays here, but even my 
cursory overview should suggest the diversity not only in subject matters, 
but also in methods and perspectives. Quite clearly, bringing culture back 
in is in and of itself inadequate to resolve many of the thorny theoretical 
and political puzzles of U.S. race relations. Are family and school im- 
portant for young African-Americans, as Alford Young suggests, or is the 
workplace more significant, as Newman and Ellis imply? Is racism rel- 
atively robust and intransigent, as Waters suggests, or is it waning and 
ambiguous, as Hochschild implies? Can we usefully aggregate black work- 
ers, as Lamont does, or should we disaggregate them, as Amy Binder 
seems to imply? I am not suggesting that all these questions should gen- 
erate either/or answers. Contexts, analytical foci, and historical changes 
all matter. Consider in this regard Pamela Walters’s interesting piece on 
educational aspirations among northern urban immigrants, poor southern 
whites, and southern blacks around the turn of the 20th century. In spite 
of some significant differences, Walters underscores the commonality 
among the three groups in valorizing education as a path toward upward 
mobility. Lf poor southern blacks in the early 20th century had high hopes 
for educational achievement, then what caused its seeming collapse in 
their poor descendants in northern cities by the late 20th century, as 
documented by several contributors? Surely, culture is neither unitary nor 
unchanging, and cultural analysis would presumably benefit from ex- 
plicitly incorporating historical and structural factors. 

The book is more of a Pandora’s box of disparate theoretical views 
and political commitments, than it is a neat box of unified and uniform 
voices. Having read so many superb essays, however, it would be churlish 
indeed not to welcome it as a magnificent contribution to the study of 
African-Americans in particular and U.S. race relations in general. 
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The goal of this book is to examine the concept of legal equality in a 
multiracial society from a philosophical and legal standpoint. It brings 
into sharp relief the tensions created by, on the one hand, the libertarian 
ideals at the heart of Lockean liberalism, invoked in today’s political 
rhetoric in the service of protecting individual freedoms, and by, on the 
other hand, the egalitarian ideals most famously pronounced in the Dec- 
laration of Independence, invoked today in the service of overcoming 
inequalities between groups. It focuses specifically on the tensions thereby 
created between the rights of the individual and the rights of groups and 
provides a comprehensive look at such conflicts in such domains as self- 
governance for indigenous people, the rights of immigrants, affirmative 
action, racial redistricting, and multicultural schooling. 

The analysis proceeds from the perspective of “critical social theory,” 
which “situates concrete social dilemmas within the framework of con- 
temporary views of justice and their prevalent legal applications. Here 
social and political ideals are located within an historical narrative in- 
corporating economics, politics, social science, and cultural anthropology” 
(p. 5). Through logical analysis of the arguments made by competing sides 
of these issues, and assessment of the factual basis for those arguments, 
it arrives at philosophical resolutions of many of these conflicts and pro- 
vides appropriate policy recommendations. 

The fundamental premise is that a group with a history of being the 
victim of dominance and oppression has certain group-based rights that 
are justified by the need to rectify past or continuing injustice. The analysis 
is said to depart from that of the current Supreme Court, in that its 
treatment of most cases of “systemic injustice” as natural outcomes of a 
neutral market underestimates the influence of government actions over 
market distributions. 

In a brief review, it is not possible to summarize the lengthy analyses 
of each of these complex issues. One theme that recurs through the book, 
however, is that African-Americans may have a special claim to group- 
based rights because of their historical oppression and domination. Their 
claims as a racial group have a legitimacy that, say, white affinity does 
not: “When a government has legally abetted in the enslavement, 
exploitation, and oppression of certain of its citizens, it is morally obligated 
to redress the harm done to them” (p. 198). The harms to African- 
Americans are more grievous than toward other groups, and so their 
claims to compensation are stronger as well. 

Other groups have substantial claims to group-based rights, however. 
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Indigenous peoples are caught between premodern traditions, such as 
patriarchy and communitarian economic forms, and modern liberal rights, 
such as gender equality or possessive individualism. Some of the latter 
are compatible with indigenous culture and need to accompany self- 
government, but some are not. Those who seek to immigrate to another 
country may face restrictions on entry, but the author concludes that 
developed nations have the right to restrict immigration only for their 
own security, subsistence, and political self-preservation. He urges more 
open borders in North America and suggests that illegal immigration is 
just the price we pay for global injustice. 

The book will serve as an excellent source of descriptive history on 
these issues, especially on the development of the relevant law. The anal- 
ysis frequently provides penetrating insights into logical inconsistencies 
or blurred distinctions in the arguments made by each side, though most 
often in arguments made by conservatives. 

Nevertheless, there are limits to the value of logical and philosophical 
analysis if one is seeking resolution of these issues at a normative level. 
The realities of politics are largely ignored in this analysis, and those 
realities in the end are based on the realities of public preferences. Those 
preferences may not be particularly responsive to logical argument, but 
may be based on passionately held values and prejudices or real interests. 
Even the “facts” that are adduced to try to resolve a conflict are, in the 
real world, often sharply contested politically and seem to be assessed 
from the normative perspective of a critical social theorist. As a result, 
many of the policy recommendations seem dependent on particular in- 
terpretations of an often ambiguous and usually contested reality, and 
indeed many seem foreordained by the author’s normative priors. For 
example, in taking up the tension between academic meritocracy and the 
persistent underrepresentation of minorities and women in the profes- 
sorate, he argues that both should be hired, even within disciplines, in 
proportion to their numbers in the general population. This conclusion 
seems to me to ignore the powerful political resistance against such a 
principle as well as the practical obstacles posed by massive inequalities 
in “the pipeline.” 

To say this should not in any way diminish the fine accomplishments 
of the work. This study makes a rigorously analytic case for the conditions 
under which the need for a more egalitarian society demands that group 
rights trump the Lockean liberalism and possessive individualism that 
Americans take so for granted. 
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Poststructuralist analyses of social policy have not been as influential in 
the United States as in Europe; Gail Lewis’s excellent contribution to this 
field indicates just what we have been missing. Lewis’s analysis of racial 
formation in social welfare policy and practice provides a textured and 
thoughtful analysis with important implications. Lewis clearly shows how 
deeply “race” has influenced the framing of British social welfare policy, 
its implementation, and its impact on the recruitment and experiences of 
black and Asian social workers. 

The book analyzes social welfare policy and practice from a variety of 
perspectives. Lewis begins by explicating her theoretical framework, pri- 
marily drawn from Foucault’s notion of “governmentality.” She argues 
that the government has created the black and Asian populations in Brit- 
ain as racialized subjects who must be assimilated to “British” norms. 
Social work was “the set of agencies and professions to be brought into 
the project of assimilating reconstituted black subjects to a redefined 
political agenda” (p. 42). Her aim is to show how black and Asian women 
have been integrated into professional social work in response to the 
racialization of these groups, while also denoting the complex terrain these 
workers must navigate as racialized and gendered subjects. 

Lewis explores racial formation through a number of empirical chap- 
ters, beginning at the national level and ending at the level of the ex- 
periences of social workers. She analyzes a variety of statutes to investigate 
how “race” was constituted in national government policy. Next, she ex- 
plores how two local authorities implemented national policy initiatives 
to address issues of “cultural sensitivity,” particularly through developing 
policies to recruit black and Asian social workers. She then analyzes the 
profession of social work and social work theory on “difference.” Finally, 
Lewis presents accounts from social workers that show how these workers 
experience and negotiate “race” and gender in their daily lives. 

Lewis particularly excels when she discusses the “now you see it, now 
you don’t” quality of discourse about “race” in Britain: “At times, explicit 
connection is made between ‘race’ and the reasons for, and the anticipated 
outcomes of, the policies. At other times such visible connections to ‘race’ 
are elided by referring instead to conditions of social malaise and dep- 
rivation. ... In a context where issues of ‘race’ and racism are cast as 
epiphenomenal, or at most the property of a few ‘bad apples’ and not 
endemic in British society, recourse to class or spatial structure is a means 
of avoiding the embeddedness of ‘race’ and racism in the British social 
formation” (pp. 55—56). 

Lewis demonstrates that “race” and racism have indeed been deeply 
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embedded in policymaking and social work practice. In analyzing the 
accounts of social workers, Lewis draws the reader’s attention to how 
these workers have been constructed as “raced” beings, which provides 
employment opportunities and, at the same time, limits their autonomy 
as professionals. 

There are important strengths in Lewis’s analysis. By connecting na- 
tional policy, local policy, social work theory, and the practice of social 
work, Lewis constructs a historically contextualized and unusually rich 
explication of racial formation. Lewis skillfully weaves a variety of 
theoretical arguments (from psychoanalytic to postcolonial theory) into 
her discussion of her cases, while providing painstaking analyses of the 
policy framework and the accounts offered by social workers. Her close 
reading of these cases provides both a fascinating window into the use- 
fulness of poststructural analyses and a tremendously convincing descrip- 
tion of the processes of racial formation. 

While the book is meant to explore race and gender in social welfare, 
race has clearly been positioned more centrally. Lewis’s analysis is weaker 
in displaying how gender ideologies have centrally shaped policies, social 
work practice, and the experiences of social workers. The analyses might 
have addressed more effectively how race and class ideologies intersect. 
While class has been used as a means of evading “racial” issues, class and 
race often do work together in important ways. However, this insight has 
been generally left unexplored. 

Additionally, the book would have been stronger with an analysis of 
how racialization differs across groups. The sole South Asian social worker 
account presented appeared to differ from the African and African- 
Caribbean accounts but did not allow Lewis to generalize in greater depth. 
Interviewing greater numbers of social workers to explore differences 
between African, African-Caribbean, and South Asian workers might 
have led to even richer and more perceptive analyses of the complexities 
of racialization. 

Perhaps it is quibbling to require more of Lewis when she has provided 
such a rich and important analysis. Lewis brings together a wide array 
of theoretical perspectives and empirical evidence to make a very strong 
and convincing argument. This book deserves a wide readership: it is 
strikingly original in its design and analysis, and it sheds important light 
on social welfare policies and racial formation. 
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Wenfang Tang and William L. Parish. Cambridge: Cambridge University 
Press, 2000. Pp. xii+388. $64.95 (cloth); $24.95 (paper). 


Thomas B. Gold 
University of California, Berkeley 


This book is a milestone in the sociological study of contemporary China. 
In 1984, William Parish and Martin K. Whyte published Urban Life in 
Contemporary China (University of Chicago Press), a comprehensive 
study that made use of the limited research access foreign social scientists 
had to China only a few years after that country’s reform program began, 
but still relied in the main on official documents and interviews with 
Chinese émigrés to Hong Kong. Now Parish has partnered with a Chinese- 
born, American-educated political scientist to present a penetrating analy- 
sis of several high-quality surveys conducted by Chinese social scientists, 
supplemented with numerous extended research stints in the country. This 
has enabled them not only to write a detailed study of a number of key 
aspects of contemporary urban Chinese life, but also to cast it in a com- 
parative framework with data from former socialist societies of Eastern 
Europe and with Taiwan, a modernized Chinese society that in many 
respects, they argue, offers a glimpse of China’s future. 

The authors examine China’s transition from one type of social contract 
(socialist) to another (market). Each chapter assesses the consequences for 
individual lives of this shifting social contract. The socialist contract prom- 
ised basic living standards as well as numerous other benefits, such as 
full employment and pensions, though at the cost of political freedom. 
The market social contract emphasizes individualism and opportunity, 
doing away with the generalized dependency characteristics of socialism. 

Tang and Parish examine survey data to understand both the effects 
of the reforms on the lives of different groups of urban Chinese, their 
attitudes toward these changes, and the best theoretical approach for 
analyzing these. Topics include life chances as seen in access to education 
and jobs, economic rewards, labor-management relations, bureaucratic 
behavior, and political participation. Parish teams up with Sarah Busse 
and James Farrer to analyze the consequences of the reforms on gender 
and work and gender and family, respectively. Each of the book’s chapters 
carefully lays out the issues to be examined, the data used, theoretical 
debates involved, and the analysis and conclusions. The authors earn 
commendation for the exceptional clarity of the writing, explanations of 
figures and tables, and cross-references to related issues elsewhere in the 
book. 

Their overall conclusion is that while there is much evidence to support 
modernization theory’s claims of a homogenization of social life, including 
more proactive citizens inclined to participate in political life, at the same 
time—and this is brought out most forcefully through the extended com- 
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parison with the Taiwan case—the effects of socialism were so deep as 
to send the modernizing trends into unexpected directions. A path de- 
pendent approach is therefore more appropriate for studying the unique 
set of systematic consequences of reform. The Chinese experience is par- 
ticularly complex, as the “freezing” effect of socialism, in addition to so 
much instability (especially the 1966—76 Cultural Revolution) requires that 
a “geographically layered” approach be used to understand different con- 
sequences for various birth cohorts. The authors are also sensitive to 
regional and gender-based variations. 

The authors take the influential market transition school to task for its 
undertheorized and overly simplistic distinction between redistributors 
and producers. Using data from Doug Guthrie (Dragon in a Three-Piece 
Suit [Princeton University Press, 1999), as well as from Taiwan, they 
show how market competition often causes the incomes of direct producers 
to fall below that of administrators. The competing bargaining school also 
comes in for criticism for many of the same reasons. 

While sure to become the standard work in the field, and a gold mine 
for comparative research, the rush of events in China means that most 
of these issues already require updating. For instance, the most recent 
nationwide survey used was conducted in early 1992, before Deng Xi- 
aoping’s call for a speed-up of reforms. This proclamation accelerated 
economic growth and the expansion of the private sector. China’s desire 
to enter the World Trade Organization spawned a new push for privatizing 
state enterprises, laying off workers, and dismantling the socialist safety 
net, with deleterious consequences for current and even retired workers. 
The dramatic spread of the Internet and modern telecommunications in 
private hands will have an impact on attitudes toward fairness and ex- 
pression of all kinds of discontent. The religious revival should be con- 
sidered in this context. 

Finally, while the survey data provide a rich source of information 
about changing structures and attitudes, the book lacks an ethnographic 
element. That would have given more life to the individuals being 
sampled. 


The Consumer Revolution in Urban China. Edited by Deborah S. Davis. 
Berkeley and Los Angeles: University of California Press, 2000. Pp. 
xiii +366. $55.00 (cloth); $22.00 (paper). 


Louisa Schein 

Rutgers University 

Since the 1978 institution of market reforms by Deng Xiaoping on the 
heels of China’s Great Proletarian Cultural Revolution, scholars of China, 


along with the global business community, have watched and waited with 
bated breath for the world’s biggest market to begin to behave like one. 
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It was not until the 1990s that an explosion of consumption practices 
really began to transform Chinese urban social and economic life in a 
dramatic way. Davis’s collection is one of the first book-length studies of 
Chinese consumption to emerge out of Western sinological circles, a laud- 
able attempt to explore what a consumer transformation might mean for 
Chinese society. Deploying the motif of “revolution,” as signaled in the 
title, the book advances the notion that what is happening in consumption 
is of vast and perhaps subversive political potential, posing a measurable 
threat to the state’s monopoly on power. 

Davis develops the politics of consumption through an explicit social 
formulation: there is “an increasing reliance of urban residents on hori- 
zontal ties of friendship, kinship, or informal sociability that challenge 
the vertical relationship between subject-citizens and state agents” (p. 3). 
In addition to expanded social ties, Davis and many of her contributors 
stress the intensification of personal choice. The book is brimming with 
case studies (of McDonald’s, cigarettes, luxury housing, street food mar- 
kets, greeting cards, bowling alleys, wedding dresses, discos, and more) 
that portray newly liberated flaneurs free to roam the market in search 
of self-actualization. Despite gestures to the inequality that China’s mar- 
ketization has wrought, the overarching narrative is one that celebrates 
the possibilities inherent in the life of consumption. It bears noting from 
the outset, however, that the studies are restricted to cities that have 
populations of 2 million or more, especially Beijing, Shanghai, Nanjing, 
Xiamen; hence, what is represented are the lives of China’s most elite 
consumers, many of whom, it could be argued, are less busy actualizing 
themselves than they are with demonstrating their status distinction from 
the rest of Chinese society. 

The 12 case studies, authored by sociologists, anthropologists, histori- 
ans, political scientists, business experts, and area specialists, are rich with 
detailed description; some reflect considerable ethnographic depth. Many 
of the chapters are amply illustrated with photographs, cartoons, maps, 
and reproductions of advertisements. Where available, quantitative data 
is offered as well. In addition to overviews of various quests for durable 
goods, the reader gets a sampling of the types of lifestyle pursuits that 
drive these neophyte consumers. Whether they seek oasis from urban 
clamor in luxury housing compounds (Fraser), wile away pleasurable 
hours at McDonald’s (Yan), call in for love/sex advice on telephone hot- 
lines (Erwin), dance in provocative self-display in discos (Farrer), or con- 
solidate business ties through bowling together (Wang), these urbanites 
are seeking something beyond the parameters of their lives under Maoism. 
Indeed, the book would convince us that much of what prompts consumer 
desires is a pointed repudiation of the austerity of the Maoist redistributive 
economy, where consumerism was stigmatized, choice was suppressed, 
and the state controlled the bulk of what was actually consumed. Much 
of the tone of the book extols the novelty of the current climate, implying 
that those most antimarket moments of the Maoist era constitute ground 
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zero for the growth of consumerism in China. This argument is compel- 
ling, but is it the whole picture? Especially in the cities under question, 
consumption flourished before and even well into the post-1949 socialist 
period. That consumers may be harking back to lifestyles they recall from 
the first half of the century, that they may be recuperating consumerist 
skills they remember excelling at in the past, receives little consideration. 

A reticence to delve into the multiplicity of meanings attached to the 
practices of consumption characterizes much of the project. Implying that 
only the advent of global market forces saturating China’s socioeconomic 
order could have wrought such a revolution, many of the articles, par- 
adoxically, slight the symbolic freight of the foreign/modern. Referencing 
only China, they describe commodities as if their significance could be 
thus bounded, ignoring the tremendous cultural capital to be wielded in 
the display of the imported. Refreshing exceptions are found in Yan’s 
treatment of McDonald’s cosmopolitan allure and in Gillette’s interpre- 
tation of Hui Muslim women’s use of Western-style bridal gowns to dis- 
avow ethnic backwardness and demonstrate their mainstream modernity. 

Overall, the book is constrained by its overprivileging of the state- 
society antagonism. That such antagonism itself is overstated, or even 
irrelevant, is suggested by Wank’s textured discussion of the changing 
uses of premium cigarettes by entrepreneurs who continue to need to 
curry favor with officials. And that forms of domination may be found 
outside the state, in the disciplines of style, or in social norms purveyed 
in the so-called public sphere, is suggested by Erwin’s analysis of hotlines 
where callers are subjectified through intimate and ultimately pedagogical 
discussions. These implicit debates are what enliven the book, making it 
of interest to social scientists and to marketing analysts groping for what 
makes the Chinese consumer tick. 


Cadres and Corruption: The Organizational Involution of the Chinese 
Communist Party. By Xiaobo Lu. Stanford, Calif.: Stanford University 
Press, 2000. Pp. xviii+367. $55.00. 


Doug Guthrie 
New York University 


This book is a powerful analysis of one of the most important issues facing 
the Chinese Communist Party in the era of economic reforms. While it 
has become all too fashionable to talk about the problem of corruption 
in Chinese society, few accounts actually attempt to theorize the structural 
causes of the phenomenon. This book represents a serious and rigorous 
treatment of the problem, and it advances our understanding not only of 
corruption, but also of the political and economic changes transforming 
Chinese society. It will be of interest to sociologists, China scholars, and, 
more generally, anyone interested in the challenges faced by countries 
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undergoing the transition from planned to market economies. Lu bases 
his arguments on interviews conducted from 1991 to 1996 with party 
cadres and nonparty members and on several textual sources that have 
been outlets for information on corruption over the period of economic 
reforms. 

The analysis of this book begins with a discussion of what corruption 
actually is. Rather than focusing on the high-profile cases that have given 
corruption so much notoriety in China, Lu is interested in what he calls 
“official deviance,” which he defines as “all forms of behavior that deviate 
from the prescribed norms of a regime, in which individuals or groups 
exploit the formal organization instead of working for it” (p. 12). He goes 
on to note that corruption comes in two primary forms—economic and 
noneconomic—and within each of these two categories are three addi- 
tional subtypes of corruption—graft (bribery), rent-seeking (resource ex- 
traction), and prebendalism (perquisites and benefits connected to a public 
office). 

With his definition of the phenomenon clearly laid out, Lu moves on 
to explain the conditions under which corruption in China has emerged. 
Proposing a theory he calls “organizational involution,” Lu argues that 
there is a fundamental contradiction inherent in the routinization of rev- 
olutionary movements. A revolutionary party such as the Chinese Com- 
munist Party is, by definition, one that comes to power through a wave 
of change. Once in power, the revolutionary party takes up the task of 
institutionalizing a modern bureaucracy, which, in theory, should bring 
about impersonal, rational offices and functions. A problem emerges, how- 
ever, when these offices are occupied by the same revolutionary person- 
alities who brought about the change, as these individuals are often more 
committed to the revolutionary goals and the process of change that 
brought them to power than they are with maintaining a strict organi- 
zational structure. Eventually, Lu argues, the prescribed organizational 
norms (which are the formal, routinized analogues of the revolutionary 
goals) become unglued from the value system of the party members who 
brought about change in the first place. These members continue to act 
through personalistic, informal modes, and organizational deviance be- 
comes inevitable. 

Lu’s analysis spans the half-century of communist party rule in China, 
taking us from the early years, when the revolutionary party set to the 
task of building a communist organizational bureaucracy, to the era of 
economic reforms. In the discussion of the early years, we find especially 
interesting memos and historical documents, which point directly to the 
routinization challenge that Lu places at the center of his analysis. Lu 
systematically takes us through the building of a communist party, which 
includes mundane issues constructing a salary system and more funda- 
mental challenges like party recruitment. The key moment for under- 
standing the organizational involution of the party came toward the end 
of the first decade of communist rule, when Mao opted for continuous 
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revolution rather than for a path of stable bureaucratic structure. Fol- 
lowing the Anti-Rightist Campaign of 1957, when many of the party’s 
policies of the early years were questioned, Mao adopted an aggressive 
policy of revolutionary reform as a way of furthering his communist 
legacy. This period made it impossible for revolutionary goals to be rou- 
tinized in organizational norms and practices. 

One problem with the study has to do with the broad definition of 
corruption. Lu’s decision to step beyond the too-often reified categories 
of extreme and notorious corruption to look at everyday deviance tends 
oddly toward the very reification he is trying to avoid—only in this case 
we are reifying the mundane. Under the microscope of Lu’s definition of 
corruption, it is difficult to see what is not corruption. Including preben- 
dalism, for example, which essentially refers to the perquisites and benefits 
one receives from being in public office, is somewhat of a problem for 
Lu’s study, as it lays bear the extent to which he is focused on an ideal- 
typical notion of organizational norms. To include prebendalism in the 
definition of corruption means that any practice that does not exactly fit 
with prescribed organizational goals can be defined as corruption. Is it 
the case in modern bureaucracies anywhere that public officials refuse to 
take advantage of the benefits that have come to be attached with their 
office? If a practice like accepting benefits of public office becomes an 
accepted organizational norm, even if that norm violates our ideal-typical 
notions of the modern organizational form, does it make sense to think 
of it as deviant? Lu does acknowledge (in a footnote) that Weber himself 
did not see prebendalism as deviant or corrupt, though it was more clearly 
associated with traditional rather than modern societies in Weber’s ideal- 
typical schemata. Thus the argument appears to be based on a fairly 
extreme standard about what a revolutionary party—or any modern bu- 
reaucratic state—can effectively achieve in institutionalizing organiza- 
tional goals. 

That said, this book stands as a major contribution to the study of 
corruption in Chinese society. Lu’s analytical approach to the issue sets 
this book apart from other work done on this phenomenon, making it 
essential reading for those who wish to more fully understand China’s 
transition to capitalism. 


Uncivil Movements: The Armed Right Wing and Democracy in Latin 
America. By Leigh A. Payne. Baltimore: Johns Hopkins University Press, 
2000. Pp. xxx+297. $42.50. 


William C. Smith 
University of Miami 


Focusing on the armed right wing and “uncivil” movements in Argentina, 
Brazil, and Nicaragua, this provocative monograph challenges existing 
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accounts of democratic transitions while simultaneously interrogating the 
literature on social movements. According to Payne, elite-centric and in- 
stitutionalist perspectives on transitions generally fail to understand that 
in posttransition contexts, strategies of moderation and institutionalization 
may enhance the power of antidemocratic actors. Similarly, the author 
points to weaknesses in approaches that privilege social movements and 
civil society, which miss the mark by implicitly assuming that democratic 
values are inherent in social movement mobilization. 

Correcting the errors in both approaches, chapters 1 and 2 of this book 
set forth an argument with three components. First, armed right-wing 
groups constitute genuine movements that deploy mobilization strategies 
resembling those of more conventional political actors (parties, grassroots 
organizations, interest groups, etc.). Second, the objectives of armed move- 
ments of the right resemble those of their “civil” counterparts: constructing 
collective identities and empowering constituencies that seek greater 
influence in society and the polity. Third, the term uncivil refers to traits 
unique to armed groups—they pursue exclusionary policies and use po- 
litical violence as a “deliberate strategy to eliminate, intimidate, and si- 
lence political adversaries in other movements and within the govern- 
ment.” This violence must be distinguished, the author insists, from that 
which may arise from “civil” movements’ efforts to expand citizenship 
and the scope of democratic rights, to defend themselves against aggres- 
sion, or to practice civil disobedience to challenge an unjust legal order. 
In contrast, the premeditated murders, kidnappings, and coup attempts 
carried out by uncivil movements are acts of violence intended to eliminate 
competition while perpetuating and expanding their own power and 
privileges. 

The empirical analysis in chapters 3, 4, and 5 employs a methodology 
of structured, focused comparison and process tracing to explain the ev- 
olution of three armed right-wing movements: the Carapiniadas or 
“painted faces,” military rebels in Argentina who between 1987 and 1990 
staged four uprisings in a newly democratized regime; the Rural Demo- 
cratic Union (UDR), a right-wing landowners’ group in Brazil created in 
the mid-1980s and implicated in the murder of hundreds of leaders of 
rural social movements; and the Contra (short for contrarevolucionarios), 
founded in Nicaragua in the early 1980s by the remnants of the Ú.S.- 
supported National Guard and backed by disaffected revolutionaries and 
members of the peasantry and indigenous communities, who waged a 10- 
year civil war against the Sandinista government. 

Each of the armed movements followed different, path dependent, ev- 
olutionary trajectories that were shaped by the repertoire of discourse, 
strategies, and actions of previous authoritarian groups. They also adapted 
and innovated, according to the skill of their leaders and shifts in political 
opportunity structures. This contrapuntal process is traced in each case 
using a variety of conceptual categories—political frames, cultural cues, 
legitimating myths—to explain the successes and failures of mobilization 
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strategies and the movements’ relationships with democratic govern- 
ments. A variety of data sources—ranging from open-ended interviews, 
participant observation, and published accounts to newspapers and mag- 
azines—are used to describe the movements, whose evolution follow a 
spiraling pattern of institutionalization, latency, and disintegration. 

In Argentina, the Carapintadas initially achieved considerable success, 
particularly in their key goal of ending the prosecution of military officers 
for human rights violations and obliging elected presidents to enact broad 
amnesties. The small rebel movement transformed itself, albeit briefly, 
into the nation’s third most popular political party (MODIN) with rep- 
resentatives in congress, and municipal and provincial governments. Nev- 
ertheless, by the late 1990s the Carapintadas had become politically mar- 
ginalized. Similarly, following the transition to civilian rule in Brazil, the 
UDR unleashed a reign of terror in the Brazilian countryside and played 
a major role in drafting the 1988 constitution and subsequent legislation 
that erected significant legal barriers to agrarian reform and social justice 
for rural workers and landless peasants. Following these victories, how- 
ever, the UDR closed down most of its operations. In Nicaragua, the 
Contra war against the Sandinista government ended in a cease-fire. The 
Contras attempted to reinvent themselves as a political party but splin- 
tered into various factions and paramilitary groups after the 1990 elec- 
tions. In all three cases, the uncivil movements eventually disintegrated. 
The question remains: Did they fail? 

This book persuasively concludes that the demise of these groups does 
not necessarily support optimistic interpretation. To the contrary, these. 
Latin American cases lead Payne in chapter 6 to a troubling conclusion 
that she buttresses with comparative references to cases from other 
regions, including the U.S. militia movement, the National Front in 
France led by Jean-Marie Le Pen, and Meir Kahane’s movement in Israel. 
The author demonstrates that even when uncivil movements sow the 
seeds of their own destruction, they can have profound consequences for 
democratic or democratizing societies. In fact, disintegration may actually 
indicate their success rather than failure. As Payne notes, “even when 
uncivil movements disappear, they set the stage for future movements 
that model their political strategies and goals on the successes (and fail- 
ures) of their predecessors. The uncivil movements of today become part 
of the cultural stock upon which future uncivil movements build.” Beyond 
its valuable theoretical contributions and the richness of its empirical 
analysis, this book raises important normative and policy questions about 
the “pathologies of civil society” and the possible responses to the threat 
to democratic values, practices, and institutions represented by uncivil 
movements. 
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Elites, Masses, and the Struggle for Democracy in Mexico: A Culivralist 
Approach. By Sara Schatz. Westport, Conn.: Praeger, 2000. Pp. xiit+ 134. 
$55.00. 


Alan Knight 
Oxford University 


Following the Mexican Revolution of 1910-20, the victorious factions 
crystallized in a catch-all civilian party—eventually the PRI—which, 
while tolerating opposition, effectively monopolized power for 60 years. 
When much of Latin America succumbed to military rule, Mexico retained 
a civilian, “semiauthoritarian” regime; when Latin America democratized, 
Mexico lagged behind, experiencing a “delayed transition” (p. 1), which 
finally culminated in the surprising defeat of the PRI and election of an 
opposition president, Vicente Fox, in July 2000. Schatz’s brief book ex- 
amines Mexico’s transition, plausibly arguing that much of the transition 
literature is inappropriate and suggesting comparisons with other civilian- 
authoritarian, “non-Leninist,” single-party regimes, such as Taiwan and 
Tanzania (pp. 104-10). By way of explanation, the author focuses on the 
role of the Supreme Court and judicial review (chap. 3) and claims to 
adopt a “culturalist approach” (pp. 1, 21). Along the way, she ventures 
numerous “predictions” (e.g., pp. 13-14, 29); hence the reader, aware of 
the remarkable events of 2000, turns the page in anticipation of some 
topical insight, some fleeting sign of political-science prescience. 

Alas, the “culturalist approach” tells us next to nothing. Schatz’s “pre- 
dictions” are, in fact, conventional generalizations derived from unoriginal 
polling surveys, which reveal, inter alia, “the continued legitimacy of the 
regime in a large sector of the population” (p. 29). “Legitimacy,” however, 
is not elucidated, nor is the presumed erosion of the PRI’s legitimacy 
explained. The schematic concluding “prediction” (p. 14) is that “continued 
legitimation of old single party into period of competitive elections sug- 
gests new type of paternalist democracy.” Obviously, Mexico’s voters had 
not read the script. In particular, the “sociocultural” bases of legitimacy 
are merely asserted, and Almond and Verba’s contentious classic, The 
Civic Culture, is not even mentioned. While the comparisons with other 
“non-Leninist” civilian parties are plausible, if brief, the reiterated re- 
buttals of a supposed Western European theoretical template, derived 
from Lipset and Rokkan (pp. 6, 17-8, 26), are superfluous, since few 
scholars would seek to impose that template (Lipset himself dissents; p. 
18). Meanwhile, in stressing the role of the judiciary—a promising line 
of analysis—the author presents an odd notion of the “Anglo-American 
separation of powers” (pp. 12, 16, 23), which confuses the separation of 
powers with the rule of law and imagines that Britain’s non-Constitution 
embodies the separation of powers (Lord Irvine has not read that script). 
The general question of the rule of law is certainly crucial in Mexico, and 
the specific question of the rule of electoral law was crucial in the mo- 
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mentous election of July 2000. This, however, had much less to do with 
the Supreme Court—Schatz’s focus—than with the Federal Electoral In- 
stitute (IFE), which scarcely gets a mention. Indeed, the author’s grasp 
of Mexican realities is shaky. Neither the proposed extension of the fran- 
chise to Mexican residents in the United States nor recent demands for 
indigenous autonomy have much to do with the long-term survival or 
decline of the PRI. Historians, too, will recoil from the notion of Carranza 
and Huerta as “postrevolutionary” presidents (p. 58), when both were 
involved in the thick of the revolutionary fray (later [p. 47], the period 
1900-34 is oddly located “before the revolution”). They will also be puzzled 
that 19th-century Mexico vacillated between constitutionalism and “des- 
potic absolutism” (pp. 22-23), since independent Mexico was never, either 
in theory or practice, either despotic or absolutist. The comparative grasp 
is also shaky; fascist Italy is termed a “modern authoritarian socialist 
regime” (p. 24); in 19th-century Western Europe, the franchise was slowly 
extended to include “slaves” (p. 10). 

The style of the book is wordy, leaden, and at times, ungrammatical. 
The poor are the “nonprivileged classes” (p. 13); “improvident focus” on 
military regimes seems to mean an inappropriate one (p. 4); and “political 
actors,” we are told, “take their cases to the Mexican president as a result 
of the postrevolutionary linkage of party and state historically” (p. 55). 
Slips and errors also abound: the state controls “airlines, steel and coo- 
per”—for copper—{p. 24); the 1997 “pact” in fact refers to 1977 (p. 27); 
and the 1988 officeholders are those of 1998 (p. 83). Proper names are 
frequently mangled: Loeza for Loaeza (pp. 8, 27, 123); Murrillo for Murillo 
(pp. 25, 124); Gadsen for Gadsden (p. 92); and Stephan for both Stephen 
and Stepan (pp. 7, 128). Worst of all, the PRI—the central collective 
actor—is misnamed, as is the PNR of the 1930s (pp. 1, 23, 132). What is 
particularly strange is the frequent verbatim repetition of entire sentences: 
65 words on pp. 21 and 29; 57 words on pp. 46 and 47; 53 words on pp. 
11, 63; and so on (e.g., pp. 64 and 83; 83 and 84; and pp. 12, 56, 77, where 
the same Latin tag is introduced, translated, and cited three separate times. 
Incidentally, the translations differ). It seems that the cut and paste po- 
tential of the word processor has been fully exploited and that the folks 
at Praeger, charging book buyers $.50 per page of text, cannot even afford 
a decent copy editor. 


Origins of Democratic Culture: Printing, Petitions and the Public Sphere 
in Early-Modern England. By David Zaret. Princeton, N.J.: Princeton 
University Press, 2000. Pp. xvi+291. 


Peter Burke 
University of Cambridge 


Nearly 40 years after its original publication in German in 1962, the classic 
study by Júrgen Habermas (The Structural Transformation of the Public 


1461 


American Journal of Sociology 


Sphere [MIT Press, 1989) continues to arouse controversy and to inspire 
fresh research. David Zaret is among those who believe that Habermas 
dated the rise of the public sphere too late. His argument for backdating 
it to the 17th century, first put forward in an article published in 1992, 
is developed further in Origins of Democratic Culture, which argues that 
a political public sphere emerged in the course of the English Revolution 
of the 1640s. 

Zaret’s argument is presented on two levels. On the more concrete and 
particular level, he begins by surveying the domain of what he calls 
“traditional communicative practice” in early modern England, summa- 
rizing and synthesizing recent work on oral and manuscript communi- 
cation. The author goes on to argue that “restrictive norms of secrecy and 
privilege” that “precluded a public sphere in politics” were disrupted by 
the rise of “the culture of print” in the course of the 17th century, em- 
phasizing the publication of pamphlets, petitions, reports of speeches in 
parliament, and not least, the proliferation of rival news sheets in the 
course of the Civil War. At the more general and theoretical level, Zaret 
presents his work as a mediation between sociological studies of the rise 
of modernity and democracy, which are insufficiently nourished by history 
and are based on crude assumptions of progress, the rise of the bourgeoisie, 
and so on, and on the other hand, the “revisionism” of the professional 
historians of 17th-century England, who have no taste for theory, deny 
progress, and present the society in which the “English Revolution” oc- 
curred as essentially traditional, based on local bonds between patrons 
and clients. To resolve the conflict, Zaret emphasizes what he calls “the 
paradox of innovation”; in other words, the importance of unintended 
consequences in history, in this case the reshaping of communicative prac- 
tices following the spread of printing. 

Historically well informed, lucidly and persuasively written, and mak- 
ing a skillful synthesis of the general and the particular, Zaret’s book 
deserves to be widely read by historians and sociologists alike. The only 
serious criticism I would make of his basic argument about the public 
sphere is that it is too exclusively concerned with England and with the 
17th century. After all, printing with moveable type antedates the English 
Revolution by about two centuries. Print culture was already affecting 
the debate over the German Reformation in the 1520s, when a prolifer- 
ation of pamphlets and polemical images encouraged the participation of 
ordinary people in the controversies of the time, allowing us to speak of 
the rise of a temporary or “conjunctural” religious public sphere and facing 
the Catholic Church with what might be called the “conservative di- 
lemma.” To answer the critics encourages a debate that in the long run 
undermines traditional authority. However, not to answer them gives the 
impression that they are right. A vivid later illustration of the dilemma 
is the case of Sir Roger L'Estrange, Charles 11's censor of the press, who 
was hostile to newspapers but ended up founding one himself. 

Similar temporary public spheres, political as well as religious, emerged 
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in the late 16th century in France during the so-called “religious wars” 
and in the Netherlands in the course of their revolt against Spain, as they 
did once again in England both in the 1640s and in the so-called Exclusion 
Crisis of 1679-81. There was no linear progress but rather a zigzag move- 
ment in which later groups were often aware of earlier ones. What trans- 
formed a temporary or conjunctural public sphere into a permanent or 
structural one was the rise of the pamphlet and the newspaper as per- 
manent institutions, first in the Dutch Republic in the 17th century and 
then in Britain after the lapse of the Licensing Act in 1696. In other 
words, I believe that Zaret’s specific arguments about 17th-century Eng- 
land would benefit from being placed in a wider, comparative context. 
As for his bold synthesis between social theory and revisionist history, I 
can only applaud it. 


Fighting Words: Working-Class Formation, Collective Action, and Dis- 
course in Early Nineteenth-Century England. By Marc W. Steinberg. 
Ithaca, N.Y.: Cornell University Press, 1999. Pp. xviii+286. $45.00. 


Philip Harling 
University of Kentucky 


Steinberg effectively carries out the broadest task that he sets for himself 
in this perceptive and carefully researched book: to show the ways in 
which material and discursive processes work together in the shaping of 
consciousness. His subjects are two different groups of workers: the met- 
ropolitan silk weavers of Spitalfields and the cotton spinners of Ashton 
and Stalybridge in industrializing Lancashire. At first glance, these two 
groups had little in common. The silk weavers were quintessential do- 
mestic artisans, while the cotton spinners epitomized factory labor. But 
weavers and spinners faced many of the same pressures in the first three 
decades of the 19th century: the proliferation of sweated outwork, the 
erosion of trade controls and skill differentials, and the intensification of 
the labor process itself. They responded to these pressures in different 
ways, the weavers through an intensive and ultimately unsuccessful effort 
to preserve the protective Spitalfields Acts (repealed in 1824) and the 
spinners through union organization and direct industrial action. 
Nevertheless, the two groups shared a similar critique of the degra- 
dation of their trades and similar discursive strategies for attacking the 
tenets of political economy that provided the rationale for that degra- 
dation. Steinberg’s close examination of the “fighting words” that the 
weavers and spinners exchanged with their opponents finds him at his 
best. The masters of the textile trades spoke in a crude dialect of classical 
political economy, in which deregulation was divinely inspired, price com- 
petition was integral to the nation’s well-being, and job- and skill-shed- 
ding were short-term pains that were bound to yield long-term benefits 
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all around. Skilled workers in both the silk and cotton trades offered a 
labor theory of value as a retort to this free-market logic. Workers were 
not mere commodities, they argued, but the source of all value in the 
productive process; it was the master’s moral obligation to give value in 
return for the value generated by his workers; and it was a ruinously 
competitive economic system that was ultimately to blame for the work- 
ers’ plight, as this system was bound to pit the interests of even the most 
benevolent manufacturer against those of his employees. 

Steinberg’s meticulous research shows that these arguments were in- 
tegral to workers’ perceptions of their material conditions. His logic is 
reasonable enough. Indeed, in his effort to seek a via media between 
materialism and poststructuralism, Steinberg has more allies than he 
seems to realize. Despite some of the excesses generated by the “linguistic 
turn” in scholarship over the last quarter-century, most workaday scholars 
are still firmly (if often silently) committed to the idea that material con- 
ditions and discursive practices work together in the shaping of social 
inequality. 

Most scholars are probably also as committed as Steinberg is himself 
to the idea of “class consciousness.” But the debate over when a class 
consciousness developed among British workers is likely to continue until 
the last trumpet sounds. Steinberg’s effort to revive E. P. Thompson’s 
argument for placing it in the first third of the 19th century is bound to 
meet stiff resistance, and for good reason. For what Steinberg so vividly 
describes here is not the emergence of a class consciousness, with the sense 
of united purpose that the term implies, but the desperate efforts of threat- 
ened skilled workers to preserve their privileges. Certainly the weavers 
and spinners were fighting free-market apologists and profit-maximizing 
masters. But they were also fighting a reserve industrial army of unskilled 
male and female workers—the burgeoning “dishonorable” section of their 
trades—that threatened to degrade their status. Throwing vitriol in the 
faces of strikebreakers, as cotton spinners were wont to do from time to 
time, was scarcely a gesture of class solidarity. It was, rather, one of the 
more violent expressions of the destructive power of an unfettered in- 
dustrial capitalism. It would be wrongheaded to fault the weavers and 
spinners for fighting to control access to their trades, to preserve their 
skills and their relatively decent wages, and to protect their patriarchal 
notions of honorable labor. Indeed, perbaps the greatest strength of 
Thompson’s great book, and a considerable strength of Steinberg’s pretty 
good book, is the dignity that they give to their subjects. But one can 
applaud the effort to save the weavers and spinners from what Thompson 
famously called “the enormous condescension of posterity” (The Making 
of the English Working Class [Victor Gollancz, 1963], p. 12) and still balk 
at the notion that these champions of an artisanal, hierarchical conscious- 
ness were the tribunes of working-class solidarity. 
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Language Policy and Identity Politics in the United States. By Ronald 
Schmidt, Sr. Philadelphia: Temple University Press, 2000. Pp. xi+282. 
$65.50 (cloth); $21.95 (paper). 


James Crawford 
Silver Spring, Maryland 


Many who are unfamiliar with our “language wars” over the past two 
decades will find in this volume an accessible introduction to the subject. 
It is a clearly written account of the conflict over “bilingualism” that gets 
its facts straight, at least most of the time, a thoughtful critique of extreme 
claims on all sides, and a welcome search for practical solutions rather 
than ideological gratification. Because of this, it is disappointing that 
Schmidt's analysis of the U.S. language policy debate has so little to do 
with the actual course of events. 

What Schmidt calls identity politics—conflict “over the meanings and 
uses of group identity in the public life of the nation-state” (p. 47)—does 
enter into the equation. In countries like Canada and Switzerland, it has 
long dominated deliberations over language policy: To what extent should 
group distinctiveness be sanctioned and national “commonalities” be en- 
shrined? In the United States, however, the notion of group rights has 
rarely figured in our legal traditions. As a result, identity politics has been 
a secondary factor in our conflicts over language. Ethnic minorities have 
not often demanded “symbolic recognition” (p. 52) from the government 
in the form of entitlements to maintain their native tongues. Nor have 
majority-language speakers elevated English into a banner of American 
nationalism, except during times of war or nativism. More typically, our 
language policy debates involve questions of individual rights, for 
example, a student's right to equal educational opportunity, regardless 
of language barriers; a non-English-literate voter’s right of access to the 
ballot; a criminal defendant's right to a court interpreter, if necessary to 
assist in his or her own defense; or a worker’s right to freedom from 
discrimination on the basis of language. 

This is the terrain on which most of our language battles have been 
waged since the early 1980s. The impetus has been a movement to declare 
English the nation’s official language and, more importantly, to restrict 
the use of other languages in public contexts. Such laws have passed in 
23 states and the U.S. House of Representatives. The stated goal has been 
to defend English, “our common bond,” against the “divisive” forces of 
“bilingualism.” More recently, sponsors have championed English-only 
legislation as a way to “empower” immigrants “to achieve the American 
dream.” 


Taken at face value and out of social context, this rhetoric may well 
sound like linguistic nationalism: English as the bedrock of “American- 
ism.” No doubt the movement appeals to some voters on that basis. Its 
real impact, however, is to discriminate and disenfranchise. The English- 
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only campaign is less an assertion of identity than a program of exclusion. 
Conversely, its opponents are less concerned with reinforcing language 
differences than with dismantling barriers to minority participation in the 
wider society (for further details, see James Crawford, At War with Di- 
versity: U.S. Language Policy in an Age of Anxiety [Multilingual Matters, 
2000). 

All of this is well documented in Schmidt's early chapters, the book’s 
strongest, which trace recent language conflicts, describe their sociological 
basis, and explore their antecedents in U.S. history. In the second and 
third sections, “The Arguments” and “Critique and Reform,” the analysis 
becomes tenuous, as Schmidt tries to make the evidence fit a preconceived 
“typology of language policy approaches” (pp. 57-66). Ranging from ex- 
clusion to assimilation to pluralism to confederation, these categories are 
useful in comparing the responses of various countries to language di- 
versity. But imposed on the American experience, they tend to constrain 
rather than explain. 

As a result, Schmidt ends up criticizing pluralists who advocate policies 
of official bilingualism and the maintenance of minority language enclaves 
in the United States. His arguments would be well taken if such a “po- 
sition” existed in the public (i.e., noncampus) arena. It does not, except 
in one very special case: Native Americans? efforts to save their languages 
from extinction. Similarly, Schmidt elaborates a reasonable assimilationist 
argument, based largely on worries about balkanization, then dissects its 
limitations. Yet in actuality, the English-only movement, an offshoot of 
the immigration restriction lobby, has done nothing to bring minorities 
into the mainstream. Inclusionist rhetoric aside, its stance toward im- 
migrants is best summed up by a longstanding refusal to support even 
modest government subsidies for adult English instruction. 

Having rejected various idealized positions as extremist, in conclusion, 
Schmidt advances a centrist solution. He proposes a language policy of 
“pluralistic integration” (p. 221) that would encourage minorities to ac- 
quire English without eradicating their native languages or violating their 
civil rights and would encourage English speakers to acquire the bilingual 
skills needed in a global economy. Once again, his arguments would be 
well suited to a seminar on theories of language policy. They also happen 
to coincide—in virtually all details—with English Plus, the real-world 
position advanced by opponents of the English-only movement since the 
1980s. 
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Between Class and Market: Postwar Unionization in the Capitalist De- 
mocractes. By Bruce Western. Princeton, N.J.: Princeton University Press. 
1997. Pp. xx+230 $45.00 (cloth); $17.95 (paper). 


Michael Wallace 
Indiana University 


In recent years, few social scientists have ventured to offer a compre- 
hensive explanation of the sources of union growth and decline among 
the advanced capitalist democracies. Fewer still have provided a rigorous 
sociological treatment of this topic that strays beyond the comforting 
confines of a single country. Bruce Western has transformed the terrain 
of this discussion and provided a new standard for future research on 
unionization with his sweeping comparative study of union growth and 
decline among the 18 OECD countries that comprise the major capitalist 
democracies of the post-World War II era: Australia, Austria, Belgium, 
Canada, Denmark, Finland, France, Germany, Ireland, Italy, Japan, the 
Netherlands, New Zealand, Norway, Sweden, Switzerland, the United 
Kingdom, and the United States. Western provides a masterful blend of 
qualitative and quantitative analysis, rich enough in historical detail, com- 
pelling in its comparative arrangements, with adequate attention to ex- 
ceptional cases that do not neatly fit the prevailing patterns, to craft an 
elegant and convincing argument about the institutional sources of di- 
versity in unionization trends between 1950 and 1989. 

Western frames his analysis around three overarching questions: What 
institutional forces account for the increasing diversity among the ad- 
vanced capitalist democracies in their levels of union density (union mem- 
bers divided by labor force) over the 40-year span? How do conventional 
or popular explanations for unionization fare in competition with the 
institutional account offered by Western? What forces explain the wide- 
spread decline in union density that took place in many countries in the 
1980s, and how do these forces impact the institutional explanation? 

Addressing the first question, Western shows that while these countries 
evinced relatively similar levels of union density in the early 1950s, by 
the end of the period, there were vast differences between high-density 
countries like Sweden, Finland, and Belgium and low-density countries 
like the United States, France, and the Netherlands. He attributes these 
differences in union strength to variations in three institutional features 
of the political economy: dominance by left-leaning, working-class (e.g., 
social democratic) political parties; centralization of labor market insti- 
tutions and collective bargaining; and union-sponsored programs of un- 
employment insurance known as “Ghent” programs, which allow workers 
to continue to be active union members if they become unemployed. These 
three features reduced the incentive of employers to resist unionization 
and provided a solidaristic class basis for union organization. Put differ- 
ently, these three features insulated workers from the vicissitudes of the 
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market, which might otherwise retard union growth; for example, in coun- 
tries like the United States, where these features were largely absent, 
unionization failed to grow as fast or even declined. 

Much of the book is spent addressing the second question: Is the in- 
stitutional explanation superior to the leading alternative explanations? 
Western quickly dismisses cultural explanations with some clever com- 
parisons of culturally similar pairs of countries that followed different 
trajectories of unionization (i.e., Sweden and Norway; Belgium and the 
Netherlands, the United States and Canada). The most likely contender, 
business cycle theory, is brushed aside by showing that the impact of 
unemployment and inflation on union growth is largely influenced by the 
institutional variables. Finally, he dismisses “structural” arguments for 
unionization such as manufacturing decline and the gender and age com- 
position of the labor force by showing that countries characterized by 
high levels of labor market centralization and Ghent systems of unem- 
ployment showed much smaller structurally based differences indicating 
that these countries were more successful at including workers on the 
fringes of the labor market (i.e., nonmanufacturing workers, the young, 
and women) in unions. 

Finally, Western asks whether the institutional theory stands up well 
in the 1980s, a decade that witnessed the decline of union density in many 
capitalist democracies. Here, Western claims that the pattern of deunion- 
ization is largely a function of globalization and the uncertainties it created 
for many national economies. In some countries, this led to an uncoupling 
of some of the features of labor market centralization and the loss of 
power by working-class political parties. This contributed to a resurgence 
of market forces and an erosion of the class-based foundations of union- 
ization, which was exacerbated in some cases, as in the United States 
under Reagan and the United Kingdom under Thatcher, by vigorous 
counteroffensives against unions. Even so, Western contends, countries 
with working-class party presence, high levels of labor market centrali- 
zation, and Ghent systems fared better during the 1980s than those with- 
out these features. His analysis shows that the increased prevalence of 
international trade and high unemployment tended to reduce levels of 
union density, but the fall was less precipitous for countries already high 
on institutional resources. 

Western’s analysis is provocative, but not without its faults. Some read- 
ers will prefer more historical richness than this type of analysis permits. 
Others might critique the crudeness of measurement of some of the key 
variables (e.g., union decline) or the pedantic style of the quantitative 
analysis. Still others will prefer a fuller treatment of factors such as var- 
iations in the national legislative contexts undergirding union develop- 
ment or various dimensions of globalization. These shortcomings shrink 
in importance in light of the strides Western has made in advancing our 
understanding of union growth and decline. His book will serve as a 
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touchstone piece for all those who endeavor to advance this understanding 
further. 


“Designing Women”: Gender and the Architectural Profession. By Anne- 
marie Adams and Peta Tancred. Toronto: University of Toronto Press, 
2000. Pp. xiii+190. $50.00 (cloth); $19.95 (paper). 


Rosemary Crompton 
City University London 


As increasing numbers of women have entered paid employment, the 
range and level of their educational and occupational qualifications have 
improved. There has been a recent and rapid increase in the numbers of 
women in higher-level occupations. Many of these occupations (medicine, 
law, accountancy, etc.) have a recent history of direct and indirect exclu- 
sionary practices against women. These ranged from the restrictions or 
refusals of professional registration, which prevailed during the first half 
of the twentieth century, to the formal and informal gender quotas that 
operated in prestigious schools and academies in the second half. These 
operated in tandem with overtly sexist attitudes and behavior (such as 
the interleaving of medical teaching slides with soft pornography to “liven 
things up for the students”). Although a completely level playing field has 
by no means been achieved, equal opportunities legislation and reforms 
have made such practices the exception rather than the rule, and more 
and more women are going into the professions. 

This is a book about a single profession, architecture, in a single country, 
Canada. It is meticulous in its depth and detail and draws upon a range 
of sources and research methodologies including statistical sources (among 
them, Canadian census data as well as especially-commissioned data gath- 
ering from provincial architectural associations), archival material (mainly 
the journal of the Royal Architectural Institute of Canada [RAIC), and 
interviews (with 27 female and 10 male deregistered architects). Anne- 
marie Adams is a professor of architecture, and Peta Tancred is a professor 
of sociology. 

The entry of women into previously male-dominated enclaves such as 
architecture raises a number of interesting questions for sociologists. How 
will women, as individuals, fare in these masculine environments, and 
will the professions themselves be transformed as a consequence of female 
entry? In Designing Women the topics of the occupation’s treatment of 
women and women’s impact on the occupation are treated as separate 
issues, whereas arguably, they are different sides of the same coin. Fur- 
thermore, Adams and Tancred suggest that the classic professions are 
themselves inherently “masculine.” This essentialist-leaning operational- 
ization of “gender” treats as “masculine” institutions (such as professions) 
that were molded to the contours of the “male breadwinner” model, in 
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which men specialized in market work and women took (unpaid) re- 
sponsibility for care. It is true that such institutions were created and 
maintained largely by men, but to regard the outcome as inherently “mas- 
culine” falls into the trap of treating self (male) definitions as if they are 
“real.” 

For example, the interviews with the deregistered architects suggest 
that the majority of the women changed careers—usually into some less 
demanding, but architecture related option such as teaching—because 
they found it impossible to keep going under the combined pressures of 
a demanding family life and high-powered employment. Most had several 
children, and the partners of these women seemed to be largely absent 
as far as caring responsibilities were concerned. In contrast, the deregis- 
tered men were more likely to be seeking individual career and personal 
development. This is a not unexpected gender difference. However, what 
it demonstrates is that working patterns laid down in an era when it was 
assumed that caring would be done by a woman at home are difficult to 
achieve when individuals seek to combine employment and caring. For 
caring is gender coded, rather than “gendered” as such, there is nothing 
inherently “masculine” or “feminine” about care work (or the time it takes). 
It has been demonstrated that men who attempt to combine primary care 
responsibilities with a demanding career face similar difficulties. 

The distinctions between men and women rest not only on the way in 
which caring responsibilities are distributed. Adams and Tancred dem- 
onstrate, in their analysis of the representation of women in the RAIC 
Journal, how women’s professional profiles as architects were “domesti- 
cated” and women were presented as sexual objects, men as power holders. 
They are correct to suggest that the “gendered substructure” of the pro- 
fession will be modified as the gender balance within it changes. Their 
evidence for women’s professional (i.e., employment) “innovation,” how- 
ever, is drawn largely from their interviews with the deregistered archi- 
tects, who, as we have seen, were propelled into innovation largely by 
the pressures of their caring responsibilities. Adams and Tancred’s alter- 
native characterization of the “mommy track” as the “fusion track”—that 
is, a way of working that allows employment and family life to be com- 
bined—should not be allowed to obscure the fact that it remains the case 
that those choosing mommy (or daddy) occupational tracks will not, on 
the whole, achieve the highest positions within their chosen profession. 
This should be a matter of concern for both men and women. 
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Temps: The Many Faces of the Changing Workplace. By Jackie Krasas 
Rogers. Ithaca, N.Y.: Cornell University Press, 2000. Pp. 197. $39.95 
(cloth); $15.95 (paper). 


Kirsten Dellinger 
University of Mississippi 


Women dominate the temporary workforce. Jackie Krasas Rogers’s book, 
Temps: The Many Faces of a Changing Workplace, attempts to explain 
why. This book is a comparative case study of temporary employment in 
clerical work and law. Using in-depth interviews with temporary workers 
(and some temporary agency employees) in both fields, and ethnographic 
observations from her own work as a clerical temp, Rogers finds that the 
experience of temp work varies for men and women. 

Being a “temp” is stigmatized in both clerical work and law. In both 
fields, it is considered “women’s work.” Women are supposedly drawn to 
these jobs because of their “flexibility,” which is often assumed to help 
them balance competing work and family demands. But among clerical 
workers, this assumption is inaccurate on two counts. First, clerical temp 
work itself is not flexible; in fact, it is the workers who have to accom- 
modate the inflexible schedules of the temporary agencies and the em- 
ployers. Second, Rogers finds that both male and female clerical temps 
are seeking (and would prefer) permanent employment in order to make 
ends meet. Rogers argues that the belief that women prefer flexible jobs 
naturalizes the low pay and lack of benefits in temporary employment. 

Among the lawyers Rogers interviewed, however, women were more 
likely to say that they sought temporary work to balance work and family. 
None of the men mentioned family responsibilities as a reason for their 
choice of temp work, but she is uncertain whether the stigma of talking 
about these matters in a highly male-dominated field led them to avoid 
the topic. Although doing temporary legal work does not lead to the 
benefits associated with partnership in a law firm, contract attorneys said 
that temp work granted them a higher degree of personal control over 
their work. The high pay per hour, relatively high status, and flexibility 
to work at home made temporary work desirable for men and women. 

Rogers does an excellent job of making visible the fact that the assumed 
ideal worker for temporary jobs among clerical workers and in law is 
raced and gendered. The deference needed in clerical work is associated 
with the ability to accommodate “the interests and desires of men” (p. 75) 
so that the preferred clerical worker turns out to be a white, heterosexual, 
middle-class woman. Although this has been noted in other research, 
Rogers offers particularly disturbing empirical evidence of discrimination 
against women of color. Some temporary agencies complied with com- 
panies’ coded and illegal demands for “no Marias” or “No Kims” (no 
Latinas, no Asians; p. 72), especially at the higher paying front-desk jobs. 
Male temps also face difficulties as clerical temps, as they are seen as 
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“defective men” who cannot get “real” jobs. They are also often assumed 
to be gay. Rogers reports that temp work is seen as “beneath” white men 
but “natural” for every other group. She calls for more research on tem- 
porary employment to determine whether a glass escalator to jobs with 
greater autonomy and responsibility exists as supervisors try to push white 
male clerical temps up to positions more in line with societal notions of 
the male breadwinner. 

The qualitative case study approach is well suited to Rogers’s main 
argument that the experiences of workers are shaped by the structure of 
an occupation and larger systems of societal inequality. For example, in 
addition to her focus on gender, in two chapters of the book she provides 
a convincing analysis of how clerical temp workers are controlled by their 
employers and the temporary agencies at various levels (structural, in- 
terpersonal, and discursive) and how workers’ resistance to that control 
is shaped by the nature of their jobs. This is a much-needed addition to 
a literature on resistance and control that often occurs outside a specific 
discussion of workplace context. At her own admission, the data on the 
contract attorneys are not as extensive as the data on the clerical workers. 
Ethnographic observation in this context is missing. As her argument is 
based on the assumption that the structure of work shapes workers’ ex- 
perience of it and reaction to it, I found myself wishing that she had 
allowed us to see more of the interview material that captures workers’ 
daily experiences. The addition of this context could shore up Rogers’s 
less convincing attempt to specify whether the exploitative aspects of temp 
work stem from the structure of a given occupation or from the “tem- 
porary” nature of the job. 

Temps is a welcome addition to the comparative in-depth study of 
occupations and the workers in them. I recommend this book for social 
researchers interested in labor process; issues of control and resistance; 
and gender, race, and class inequality at work. The book provides a more 
complicated view of who loses and who benefits in temporary 
employment. 


Restructuring Gender Relations and Employment: The Decline of the Male 
Breadwinner. Edited by Rosemary Crompton. Oxford: Oxford University 
Press. 1999. Pp. xii +241. $78.00. 


Janet C. Gornick 
City University of New York 


Restructuring Gender Relations and Employment: The Decline of the Male 
Breadwinner presents a theoretical and empirical account of recent 
changes in gender relations, particularly in the restructuring of gendered 
divisions in paid and unpaid labor. The collected chapters share a common 
“gender systems approach,” which—according to editor Rosemary Cromp- 
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ton—considers the totality of the relationships between men and women, 
“instantiated through the development of institutions such as educational 
systems, welfare states, employment regimes, family law, religious differ- 
ences, and national cultures” (p. 5). The essence of the gender-systems 
approach is twofold: first, it argues for complex causal explanations of 
gendered outcomes, avoiding economic determinism, and second, it rejects 
gender essentialism, understanding gender relations instead to be socially 
constructed. The contributions in the volume also share an empirical 
framework: the identification of similarities and differences across diverse 
European countries. 

The book has two sections. In the first section, four chapters extend 
the burgeoning literature on the role of national institutions and structures 
in the shaping of gender relations, especially the gendered nature of em- 
ployment. The common subject of these chapters is the erosion of the 
traditional “male-breadwinner/female-career” model and its replacement 
by a range of alternatives that constitute varying degrees of modification. 
Jane Millar analyzes variation across 16 welfare states in the ways in 
which expectations about family obligations are encoded in law and policy. 
Anne-Lise Ellingseter focuses on the Scandinavian countries to highlight 
the potential of the state to influence gender egalitarian divisions of labor. 
Ellingseter argues that, in Norway, policy reforms embedded in a new 
“politics of fatherhood” are encouraging the development of a radically 
transformed gender system: a “worker/career” society, in which women 
and men symmetrically share employment and caregiving. Birgit Pfau- 
Effinger applies the gender systems approach to explain cross-national 
variation, across Germany, the Netherlands, and Finland, in the mod- 
ernization of practices of motherhood. Finally, Susan Yeandle compares 
Germany, Italy, and the United Kingdom to explore how country-specific 
sectoral mixes, welfare state features, and family structures shape patterns 
of nonstandard employment and gendered divisions of labor. 

In the second section of the book, four additional chapters present 
results from the Gender Relations and Employment project, a five-country 
collaborative study directed by Crompton that assessed the interplay 
among national institutions, gender-role attitudes, domestic divisions of 
labor, and women’s employment in Europe. The study included five var- 
ied European states: one liberal (the United Kingdom), one corporatist 
(France), one social democratic (Norway), and two state-socialist (the 
Czech Republic and Russia). The Gender Relations project considered 
factors at multiple levels: macro- (national), meso- (occupational), and 
micro- (individual). A core microlevel component included conducting 
“biographical work-life interviews” with 154 women doctors and bankers 
across the five countries. This second section of the volume, in contrast 
to the first, emphasizes women’s agency, considering ways in which 
women’s choices—albeit constrained choices—shape their own trajecto- 
ries as well as long-term transformations of gender systems. 

In the initial chapter of this section, Crompton and Fiona Harris use 
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pational survey data (from the International Social Survey Programme 
OSSP), combined with their own interview data, to assess the relationship 
between gender-role attitudes and domestic divisions of labor. In a sub- 
sequent chapter, they interpret their interview results to explore the ori- 
gins, and diversity, of women’s work/family decisions, stressing the nature 
of women’s choices made against the backdrop of high-level occupations 
that impose differing constraints. In a largely theoretical contribution, 
Nicky Le Feuvre links distinct conceptions of gender to predictions about 
the processes and outcomes of occupational feminization. In a final col- 
laborative chapter, Crompton, Le Feuvre, and Gunn Elisabeth Birkelund 
focus on the feminization, and internal gender segregation, of the medical 
professions in their three home countries—the United Kingdom, France, 
and Norway. 

This volume has much to recommend it. First, its adoption of what 
Crompton calls “a position of theoretical pluralism” eschews both struc- 
tural and gender determinism; instead, the contributions paint a complex, 
but coherent, portrait of multiple explanatory factors. Second, the vol- 
ume’s emphasis on women’s work/family choices, in the face of constraint, 
provides a reconciliation between competing perspectives on gender in- 
equality in the labor market, that is, as the result of patriarchal exclusion, 
on the one hand, or the reflection of women’s heterogeneous preferences, 
on the other. Finally, several contributions argue that a gender egalitarian 
future depends upon some reallocation of men’s time—from market to 
home—a key insight that is taking hold in Europe but eluding too many 
American analysts. 

Perhaps the lone shortcoming of the volume is that the cross-national 
comparisons include a sometimes bewildering array of country combi- 
nations—for example, 16 welfare states, five, multiple sets of three, a 
pair—and too often idiosyncratic cases seem to be offered as exemplars 
of welfare state regime types. Nevertheless, the book makes a compelling 
contribution to theoretical and empirical scholarship on the gendered na- 
ture of welfare states, social rights, employment, and caregiving. 


Crime and Social Change in Middle England: Questions of Order in an 
English Town. By Evi Girling, Ian Loader, and Richard Sparks. New 
York: Routledge, 2000. Pp. xiv+211. $27.99 


Robert J. Sampson 
University of Chicago 


Macclesfield, England, is apparently a bore. A pastoral English town of 
about 50,000 set in the Cheshire countryside 180 miles north of London 
and 15 miles south of Manchester, Macclesfield is remote, relatively pros- 
perous, orderly, and seemingly devoid of big city troubles. It is, in other 
words, an untroubled place—one even the locals concede is boring. What 
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about such a place could inspire a study of crime and disorder? Girling, 
Loader, and Sparks turn the tables and argue that precisely because of 
its mundane reality, Macclesfield is a strategic site to study ordinary per- 
ceptions of crime and disorder in an era of rapid social change and glob- 
alization. They focus on “crime talk*—the ways that lay people express 
their fears, anxieties, and passions about crime. Although disorder and 
crime in Macclesfield are rare, at least by American standards, the amount 
of time spent on crime talk is immense. Why is this sor 

The research design is eclectic and multimethod, merging participant 
ethnography by two of the authors with census-like data on social change 
and crime, representations of crime by the local media, focus-group ses- 
sions with residents of varying social class backgrounds, in-depth bio- 
graphical interviews with a small number of longtime Macclesfield resi- 
dents, interviews and group discussions with criminal justice officials and 
other local leaders, and observations of public meetings. To provide a 
community-level comparison, the authors also present some data on Prest- 
bury, a nearby town even more affluent than Macclesfield. The early 
chapters give a detailed description of Macclesfield and its environs, per- 
haps overly so. The “common place” of crime is then explored by jux- 
taposing the official reality of crime as gleaned from a variety of statistics 
with everyday perceptions of crime and disorder. A seeming paradox 
begins to emerge, in that residents talk about Macclesfield as secure and 
Fragile at the same time. Global and sinister outside forces are seen as 
penetrating the town, even though trouble is predictably concentrated in 
the town center and council estates, the vast majority of it minor and 
committed by locals. 

Not surprisingly, one of the major sources of perceived disorder is youth 
itself. “Disorder” is mostly teenage behavior that triggers the usual com- 
plaints—for example, rowdy males, drinking, drug use, congregating in 
public, and vandalism. For their part, the teenagers complain of boredom 
and say that adult concerns about disorder are overblown. They want 
some action and feel the town fails to provide it. Interestingly, the police 
tend to agree in large measure. They do not particularly like policing 
teenagers and view much of their effort as futile. The dilemma of how 
to provide order in Macclesfield is thus complex and full of nuances. The 
residents perceive the community as increasingly unable to control local 
youths and that the police are abrogating their responsibilities. Yet the 
adult residents are ambivalent about punitive reactions and alternative 
measures of crime control such as video surveillance and private security. 
They also view the job of policing disorder as the domain of the State, 
while asserting “bad parents” as the root of teenage disorder. Concern for 
security, then, seems to be less about physical security against violence 
than with a subjective worry about youth gone awry. In the concluding 
chapter, the authors claim that criminological research treating fear of 
crime as a separate and discrete object of inquiry is “exhausted” (p. 170). 
They mean that the boundaries between disorder and other forms of 
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anxieties and concerns, especially age-graded conflict, ultimately collapse. 
Crime and disorder in Macclesfield are largely mundane, part of the ritual 
of intergenerational conflict as played out in local public spaces. The 
concerns voiced by residents reflect larger societal concerns about youth, 
social change, and eroding mores. Precisely because of alleged extralocal 
and global origins (e.g., economic restructuring, youth culture), local au- 
thorities are seen as ill equipped to handle the problem. 

Although impressive on many counts, the book is hampered by a writing 
style heavily addicted to parenthetical statements (very annoying). I 
wished also for more corroborative data on many of the trends and pat- 
terns that dominate the narratives, especially structural and cultural 
changes manifested in local form. Overall, the analytical delivery does 
not live up to the potential of the multiple data sources collected. But in 
the end, there is much to be said for the authors’ choice of site, careful 
attention to the quotidian, and theoretical insight. Macclesfield is boring, 
to be sure, but so are most places. The encounters we read about in 
Macclesfield are telling and shed new light on a number of themes in 
globalization theory, urban studies, and criminology. The Macclesfields 
of America await similar inquiry. The literature is full of studies of big 
city crime, but little is known about the suburbs and towns wherein the 
majority of everyday folks now live. 


On the Origins of Human Emotions: A Sociological Inquiry in the Ev- 
olution of Human Affect. By Jonathan H. Turner. Stanford, Calif.: Stan- 
ford University Press, 2000. Pp. xvit+ 189. $49.50 (cloth); $17.95 (paper). 


Jack Barbalet 
University of Leicester 


On the Origin of Human Emotions is an important book. In it, Turner 
challenges many of sociology’s cherished conventions: that language is 
key to social formation, that culture underpins social processes, and that 
biology is unnecessary in sociological consideration. Against these is the 
forcefully presented contention that “every facet of human endeavour is 
emotional” (p. 119) and that “all emotions ultimately have a biological 
basis because they are built from body systems activating one or more 
primary emotions” (p. 126). 

In summary, Turner says that “language adds much to interaction, but 
by itself it is not particularly effective in generating solidarities . . . [it] 
supplements . . . those body cues that signal states of emotion and affect 
that are more primal and more essential to the flow of an interaction” 
(pp. 25-26). But it is not only the communicative aspect of emotion that 
is important in Turner’s account. He convincingly demonstrates that emo- 
tions underlie attunement of interpersonal responses, social sanctioning, 
moral coding, valuing and exchanging resources, and even rational de- 
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cision making (pp. 43-63). Indeed, some of the best discussion in the book 
is in the thoughtful demonstration of the ways in which complex emotions 
are the basis of the range of necessary social processes. In particular, the 
account of variants of primary emotions and their elaborations (pp. 72-83) 
and the application of emotional configurations (pp. 126-53) should be 
read by anyone interested in the emotional underpinnings of social struc- 
ture and process. 

The book is concerned also with how human sociality came about, 
evolutionarily speaking. Apes typically incline to solitary not collective 
existence. While weak ties satisfied the survival needs of gibbons, orang- 
utans, gorillas, and chimpanzees living in arboreal and woodland habitats, 
selection for survival on the African savannah, the environment of early 
hominids, favored sociality and organization. Turner argues that the req- 
uisite capacities were achieved through the evolution of early hominid 
neuroanatomy, in which the development of the primate brain encouraged 
extension of the range of emotions that functioned to promote high levels 
of sociality. This aspect of the argument is less successful than the func- 
tional account of emotions themselves. As the author states, here the 
argument is speculative, a thought experiment (pp. 62, 145, 153). The 
evidence is poor, and the framework is undeveloped and simplistic. 

Two aspects of Turner’s case about the continuing role of emotion in 
social processes draw attention. One of these is the problem of what might 
be called social evolution. Turner develops some very sensible ideas about 
not just the role of particular emotions, such as pride, shame, guilt, sad- 
ness, and so on, in creating and sustaining social bonds, but also con- 
cerning differential intensities of emotions for optimal bond formation 
and reproduction (pp. 128-32). The problem, though, is an ahistoricity of 
approach that throws into clear relief the real difficulties associated with 
the application of principles of natural selection to explain not the function 
but the configuration of human emotions. Norbert Elias (The History of 
Manners (Blackwell, 1978) and Peter Stearns (American Cool [New York 
University Press, 1994), for instance, each demonstrate the need to tem- 
porize statements about the functionally optimal level of emotional in- 
tensity and relevant particular emotions for sociality, for the details of 
both of these factors change through broad historical development. 

Another problem is Turner’s claim that in spite of hominid neurological 
evolutionary elaboration, or perhaps because of it, arousal of socially 
effective human emotion requires ritual (pp. 54-57). It is too sweeping 
an assertion that is not necessarily true. The complaint here is not that 
social practices are brought in to support a neuroanatomical structure, 
but that Turner does not show how ritual fits in at all. Turner’s source 
is the relevant work of Randall Collins. Collins originally developed his 
argument about interaction ritual chains through a discussion of Durk- 
heim. Indeed, Collins’s account of ritual and emotion assumes Durkhei- 
mian mechanical solidarity, leaving entirely open the question of ritual 
under condition of organic solidarity. Turner’s appropriation of Collins 
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for his own argument leaves us hanging between what he describes as 
“two minds” (p. 45): one reaches for social solidarity (through ritual), the 
other looks back to the solidarity of our ape ancestors. In fact, of course, 
there are not “two minds” in this sense; humans can only seek autonomy 
in the context of social relationships. Ritual has no necessary place in any 
of this. Kemper’s argument (A Social Interactional Theory of Emotions 
[Wiley, 1978), that differential arrangements of power and status are 
productive of different emotions, leaves the question of ritual open for 
investigation and links biology with emotions directly, although Kemper’s 
biology is hormonal, not neurological. 

On the final page of book, Turner says that sociology cannot advance 
further in theorizing human emotions without going to “knowledge and 
data from other fields” (p. 154). Presumably this is a reference to limitations 
of constructionist and culturalist accounts of emotion, which depreciate 
or deny the importance of physical processes in emotional experience. 
Turner joins a distinguished number of social science writers that have 
drawn on biology. These include not only Kemper, but also Paul Ekman, 
who has demonstrated the universality of expression of basic emotions, 
and Peter C. Reynolds (On the Evolution of Human Behaviour: The Ar 
gumeni from Animals to Man [University of California Press, 1981), who, 
like Turner, argues that human emotionality is an evolutionary devel- 
opment, not an evolutionary remnant or atavistic vestige. On the Origins 
of Human Emotions will not be the last word on human emotions or their 
evolution; its author did not intend it to be (p. 154). It is, though, a highly 
provocative statement, showing that we risk nothing of our sociological 
imagination by looking not only to emotions, but also to biological science 
in explaining human social processes. 


Religion as a Chain of Memory. By Daniéle Hervieu-Léger. Cambridge: 
Polity Press, 2000. Pp. x+204. 


Steve Bruce 
University of Aberdeen 


The belief in what one's fate is depends a great deal on how a person 
identifies religion. Though they need not, those who define religion sub- 
stantively tend to see modernity as secularizing. Those who define religion 
by its social functions suppose the need for religion to be stable and then 
spend a lot of time trying to prove that societies such as the United 
Kingdom or France are “really” religious. Hervieu-Léger is unusual in 
that she tries to find a middle way between both views. 

She first considers the various well-known problems with substantive 
and functional definitions before proposing her alternative: that religion 
be defined not by the substance of beliefs but by the mode of believing. 
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“There is no religion without the authority of a tradition being invoked 
in support of the act of believing” (p. 76). 

What follows from this? Hervieu-Léger accepts most of the thrust of 
the classic secularization case: “The dominant religions [have been] mar- 
ginalized to a point where they are no more than picturesque and par- 
asitic” (p. 91). However, as the recent history of Eastern Europe shows, 
religion can still play a part in the production of meaning and in the 
expression of collective aspiration. It would be possible to combine a 
general secularization approach with the claim that religion still matters 
by identifying the circumstances that retard secularizing trends. Hervieu- 
Léger’s approach is slightly different (and that slight difference leads to 
a very different overall tone). She wants to argue that modernity itself 
creates demand for religion. Rationalization, individualism, personal au- 
tonomy, and “the uncertainty that flows from the dynamics of change has 
made the need stronger, as illustrated by the infinite diversity of the 
demand for meaning on the part of both individuals and groups, and the 
imagination-fed solutions arrived at” (p. 93). The problem for religion is 
that the living collective “chain of memory” is destroyed by modernity, 
but modernity creates a persisting demand for tradition and belonging 
that is served by small groups inventing traditions. 

The difficulty for this case is that which has always dogged function- 
alism: it is always possible that circumstances will prevent needs being 
met. As Hervieu-Léger recognizes, modernity may well have created a 
need for religion while at the same time preventing any particular one 
from becoming plausible for more than a handful of people. The problem 
can be seen in our fondness for community. It is not hard to argue that 
the world would be a better place if we could restore a sense of community. 
Almost everyone is in favor of it, but few of us are willing to forgo our 
individual autonomy. We want community on our terms, which, of course, 
defeats the exercise. Modernity may create needs, but needs may remain 
unmet. 

This is an interesting but frustrating book. It is not helped by the 
translation, which may be faithful to the original but too often constructs 
sentences in which abstractions are presented as acting agents. It may be 
a matter of Gallic style but the argument is presented as overcoming a 
series of obstacles to seeing religion in a particular way rather than as a 
series of reasons why the evidence should lead us in that direction. One 
is left with the sense that Hervieu-Léger wants to believe that religion 
remains a powerful force in modern Europe, despite her own evidence 
to the contrary. This may reflect my Anglo-Saxon prejudice against ethe- 
real theorizing, but I repeatedly found myself voicing two reservations 
about the project. First, I wondered about the effort put into the defi- 
nitional arguments. Of course, substantive definitions of religion have 
their drawbacks, but they seem perfectly adequate as a foundation for 
interesting studies of the real world. Hervieu-Léger’s alternative runs the 
risk of all functional definitions: it establishes by fiat what should be 
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demonstrated empirically. The relative absence of evidence is my second 
frustration. In particular, I wanted to see the central empirical claim 
tested. If it is the case that modernity has a paradoxical effect on religion, 
both eroding the conditions for its reproduction and stimulating a need 
for it, then we should see the greatest demand for religion among the 
most secularized sections of the most secular societies. If not, there seems 
no reason to abandon the more conventional views of secularization. And 
just how many people are yearning for a sense of an invented tradition? 
In any event, this is an interesting book that deserves a wider audience, 
if only for its detailed consideration of the relative merits of competing 
definitions of religion. 


Religion and the New Immigrants: Continuities and Adaptations in 
Immigrant Congregations. By Helen Rose Ebaugh and Janet Saltzman 
Chafetz. Walnut Creek, Calif.: AltaMira Press, 2000. Pp. 492. $69.00. 


Peter Beyer 
University of Ottawa 


Responding in part to a gap in the extensive literature on immigrants 
and ethnicity in the contemporary United States, this volume focuses 
specifically on the religion of recent arrivals. Ebaugh and Chafetz make 
their basic hypothesis clear: religious institutions today, as in the past, 
play a major role in allowing new immigrants to make the transition to 
their new location. They make it possible to recreate and adapt the eth- 
nocultural ambiance of the countries of origin in a socially supportive and 
religiously secure environment. As such, these institutions permit a broad 
range of adaptive strategies. They also present characteristic challenges. 

Ebaugh and Chafetz consciously build on the relatively little research 
that has been done on this subject. Most of this literature is historical or 
concentrates on specific ethnic or religious groups and therefore does not 
permit the controlled comparison that is at the heart of this project. Among 
the more salient influences on this book are R. Stephen Warner and Judith 
Wittner’s Gatherings in Diaspora (Temple University Press, 1998), as con- 
cerns the comparative focus on immigrant religious groups, and Nancy 
Ammerman’s Congregations and Community (Rutgers University Press, 
1997), with respect to analysis of religious organizations within their local 
contexts. Ebaugh and Chafetz, however, advance research significantly 
by applying a multifaceted and standardized research protocol to a diverse 
set of groups within a delimited geographical area, namely Houston, 
Texas. On the basis of a broad examination of the immigrant religious 
situation in this metropolitan area, the authors chose representative sites 
such as Vietnamese Buddhist, Chinese Buddhist, Chinese Protestant, His- 
panic Catholic and Protestant, South Asian Muslim, South Indian Hindu, 
and mixed-ethnic congregations. 
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The organization of the book corresponds to the structure and aims of 
the overall project. After two introductory chapters, 13 chapters under 
the principal authorship of the different site directors report the empirical 
findings under a consistent set of headings such as theology and ritual, 
organizational structure, ethnicity, the second generation, and the role of 
women. A series of thematic chapters by the principal authors then focuses 
on comparative interpretation. 

Among the many interesting observations that Ebaugh and Chafetz 
make are the following: organizations that represent religious majorities 
in their homelands but religious minorities in Houston, such as Muslim 
or Hindu groups, have a comparatively difficult time attracting fellow 
ethnics not already within the fold, but they are relatively more attractive 
to native-born Americans seeking an alternative. By contrast, organiza- 
tions that move from minority to majority status, such as Chinese or Latin 
American Protestant churches, experience the reverse. Correspondingly, 
the former groups have somewhat more difficulty retaining the American- 
raised and second generations. Responding to the different demands of 
these young people is, however, a general problem because the ethno- 
cultural ambiance that these organizations provide for the founding gen- 
eration is exactly what the children find off-putting. Another consistency 
is the dominance of congregational structure, including self-financing, 
heavy lay involvement, and sometimes lay leadership. One of the more 
consistent dilemmas concerns the issue of internal linguistic/ethnocultural 
differences. Accommodating these usually hurts the unity of the organi- 
zation, but failure to respond to them invites schism. Complicating matters 
is the fact that the religions themselves all put theological stress on a 
wider transcultural unity. Finally, although generally restricted to im- 
portant but traditionally patriarchal gender roles, women in these groups 
are in several cases seeking a greater say, but none of them is led by 
women. 

Overall, Ebaugh and Chafetz confirm tendencies found in other work 
in this area, above all the strong continuity between immigrant religious 
organizations of the past and the present, but also their great variety as 
concerns structure, history, composition, and degree of, adaptation. The 
authors do, however, caution that it is probably too early to tell how much 
past patterns will be predictors for the future. The principal factors here 
are the intensification of transnational ties in comparison with the past 
and the possibility of continued significant influx of immigrants that may 
prevent generational and acculturation factors from becoming determi- 
native. In this regard, this research is only the first part of a larger en- 
deavor that will in its next phase engage in multisite research to gauge 
the role and importance of transnational factors. Perhaps in reporting that 
phase, the authors will be able to correct a weakness of this work: its 
ultimately too narrowly American view. Immigrant religions exist in many 
other countries. A comparison with these could shed much light on spe- 
cifically what is, if anything, American in these stories, and more generally 
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on the complex relations between religious adaptation and sociocultural 
context. 

That small point aside, this book makes an important contribution to 
its field and should be required reading for anyone working or interested 
in the area. By its own admission, much work remains to be done. Even 
when this arrives, however, Ebaugh and Chafetz still will have supplied 
a significant component and will have helped demonstrate the indepen- 
dent role of religion in the immigrant experience. 


Bearing Witness: Readers, Writers, and the Novel in Nigeria. By Wendy 
Griswold. Princeton, N.J.: Princeton University Press, 2000. Pp. 
xxiv+340. $59.50 (cloth); $24.95 (paper). 


Vera Zolberg 
New School for Social Research 


A rare foray into a domain that had been terra incognita, this study of 
the literary life of a “new nation” in Africa by an American sociologist 
would for that reason alone be worthy of attention. But Wendy Griswold’s 
new book, Bearing Witness, a study of the Nigerian colonial and post- 
colonial literary world of the novel, would be worthy of concentrated 
reflection by social scientists and other scholars even if it were not so 
unusual. 

A number of African countries have given rise to artistic forms, of 
which an important literary oeuvre has come to Euro-American attention. 
Not the least of these is Nigeria, known for the internationally acclaimed 
writers, Chinua Achebe and, Nobel Prize winner, Wole Soyinka. In ad- 
dition to these outstanding, sophisticated creations, Griswold examines 
the purely entertaining romance or action genres. Given her experience 
with the sociology of literature (Renaissance Revivals: City Comedy and 
Revenge Tragedy in the London Theatre, 1576-1980 [University of Chi- 
cago Press, 1986]), we might expect much of her new contribution. We 
are not disappointed. 

As in a majority of new African states constructed willy-nilly by Eu- 
ropean imperialism, local rivalries, regional, ethnic, and religious cleav- 
ages have provided the bases for contestation. In Nigeria, they brought 
about a bitter civil war and continuing tensions among the unevenly 
Westernized regions. Potential wealth has fomented corruption by many 
politicians while producing misery for ordinary people of the 250-400 
ethnic groups in a cobbled together nation-state of between 85 to over 
100 million inhabitants. Nevertheless, as Wendy Griswold shows in her 
superb study of the world of the Nigerian novel, Nigerian writers and 
readers have not given up on the rationality and hope that they associate 
with modernity. 

Having succinctly provided the historical and social context, the char- 
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acter of the genre in question, and her analytic approach in the first 
chapter, she carries out her program in two densely packed chapters. 
Chapter 2 lays out “The Nigerian Fiction Complex” (novels, writers, the 
business, the readers), and chapter 3 focuses on the thematic content of 
Nigerian novel genres (village and city, women and men, pen and sword, 
crime and politics). She analyzes the literary works of political writers 
and interviews many of them who, surprisingly, rarely expressed concern 
about censorship, in spite of the persecution, forced exile, imprisonment, 
and most poignantly, in the case of Ken Saro-Wiwa, execution. Finally, 
chapter 4, “Capturing the Past and Inventing the Future” is devoted to 
an account of the literary world of this “giant of Africa” and its limitations 
in a world in which electronic media are rapidly outpacing print. 

Griswold helps us understand how and why the aspiration to success 
by sales of intellectuals and integrated professional writers, publishers 
(English and local Nigerian), and critics is unrealistic. Beyond her analysis 
of the production of this culture, she considers themes that recur in many 
of the works: bride price and family pressure as sources of tension between 
tradition and modernity (pp. 3940, 68); the dominance of patriarchy and 
a general belief that women must prove their worth through reproduction, 
even at the cost of accepting polygamous marriages; and the seeming 
inevitability of sexual harassment in school and work. These writers see 
themselves as “bearing witness” to the changes, modernization, and na- 
tional unity in spite of the cleavages that plague their country. Never- 
theless, there are puzzling gaps in the content of the novels on which she 
reports and, perhaps, in the interviews she conducted. There is no mention 
of specific problems such as female genital mutilation; the increasing 
spread of HIV-AIDS; and human rights issues, many of them centered 
on the environmental and human impact of the petroleum industry. Their 
lack of attention to these concrete issues would help to explain why so 
many of the novelists in the country itself are unconcerned with 
censorship. 

Regardless of what their books touch on, their impact is not helped by 
the fact that novelists, already highly unrepresentative of their nations’ 
people (only 15% are female, mostly well educated, southern, especially 
Igbo, and Christian; pp. 41—42), increasingly have little in common with 
their potential readers. Almost regardless of genre, Nigerian readers are 
a relatively nonexpanding proportion of the population because of de- 
clining access to elementary and secondary education as oil prices col- 
lapsed and governmental prestige spending for higher educational insti- 
tutions rose (p. 45). Those Nigerians who read are more likely to buy 
foreign books, while most buyers of Nigerian novels are foreigners (p. 81) 
seeking the exotic, which to local writers is merely everyday matter (p. 
84). Griswold’s vivid description of the impoverished book publishing 
and book retailing industry helps to explain why writers are obliged to 
hold down other jobs in order to support their writing habit. 

Griswold tackles fundamental questions concerning the nature of the 
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novel as an African form, incorporating cultural structures of Nigerian 
narrative forms and Western literary conventions (pp. 14-16). She ana- 
lyzes the conflicting positions of universalists (who see African works as 
human documents) as opposed to traditionalists (supporters of such es- 
sentialist idea systems as mégritude), or both of these in opposition to neo- 
Marxists who privilege the production base of the arts. For those deeply 
committed to any one of these positions, Griswold’s work will, because 
of its very equity, be unacceptable. That would be unfortunate, because 
they would be missing a rare exposition of these positions in a variety of 
orientations that have risen or declined in acceptance: modernization the- 
ory and hegemony theory, among others. Proponents of some of these 
approaches will be put off by Griswold’s adherence to what she terms 
“provisional, provincial positivism” (p. 299 n.36), but if they stop there it 
will be their loss. It is rare to find an American scholar so committed to 
and successful in melding the approaches of the social sciences and the 
humanities as sensitively as she does. 


Music in Everyday Life. By Tia DeNora. Cambridge: Cambridge Uni- 
versity Press, 2000. Pp. xiv+181. $54.95 (cloth); $19.95 (paper). 


Daniel Thomas Cook 
University of Illinois, Urbana-Champaign 


An important test of any idea or argument is not just how well it con- 
tributes to “the literature,” but also whether the author invites a different 
way of seeing and, in this case, a new way of hearing the world. On this 
measure, Tia DeNora's Music in Everyday Life successfully ushers the 
reader onto the soundstage of social encounters, opening up the acoustic- 
musical realm for both investigation and appreciation. Original in con- 
ception and based on years of research, this book offers interpretive and 
cultural sociologists a novel point of entrée into the study of embodied, 
lived meaning. 

Given the title; the book’s cover showing naturalistic, half-posed pho- 
tographs of people wearing earphones; and the opening vignette discussing 
the author’s encounter in the field, one reasonably expects a text based 
on thick descriptions of the uses and meanings of music in social context. 
Instead, she presents a series of separate ethnographic studies in an effort 
to “document some of the many uses to which music is and can be put, 
and to describe a range of strategies through which music is mobilized 
as a resource for producing the scenes, routines, assumptions and occasions 
that constitute ‘social life’” (p. xi). DeNora essentially builds something 
of a grounded theory of music in “everyday” life from slices of apparently 
unconnected lives. 

The first chapter serves as both a critique of approaches to music in 
the totalizing, “grand tradition” of Adorno and as a miniprimer demon- 
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strating how music exists in daily life. Contra Adorno, and drawing upon 
Paul Willis, she offers a number of examples from her research (e.g., music 
on airplanes, as a form of therapy, in karaoke bars) to introduce the twin, 
key ideas that music is, at once, a force in its own right and a resource 
for personal agency. In chapter 2, DeNora proposes a theory of musical 
affect as practice, arguing that music cannot be “read” out of context as 
semioticians tend to do. She drives home the point repeatedly that music’s 
meanings are not immanent in lyrics, notes, or rhythms but arise from 
the actions of users. 

In the remaining chapters, DeNora builds her core understandings of 
“music’s power” from interviews and observations. Most elementary is 
how music figures in the construction of self. In chapter 3, she draws out 
how music is used as a form of self-programming to assist in relaxation, 
as an aid to memory, to set a mood for self and others, to help in con- 
centration, and as a way to create and sustain social identities. For in- 
stance, an interviewee relates how she motivates herself for an evening 
out on the town by the kind of music she plays as she readies herself for 
the evening (p. 54). Here, music helps prepare a self for an anticipated, 
future situation. 

DeNora rightly moves away from embracing the view of a wholly self- 
authoring self and toward the more slippery, but necessary, terrain of less- 
than-conscious motivations and effects regarding music. Her treatment 
of music and the body in chapter 4 is the keystone of this work where 
she introduces the idea of entrainment: “the alignment or integration of 
bodily features with some recurrent features in the environment” (pp. 
77-78). Musical entrainment identifies the ways in which musical em- 
bodiment is at work in such things as foot tapping, finger snapping, and 
head nodding, as well as in the way work songs encourage coordinated 
movement. DeNora further illustrates entrainment with her research on 
how aerobics class participants and instructors understand and use music 
as a device of social-bodily ordering. Both insightful and overdrawn, the 
aerobics class becomes the quintessential example of her theory (she re- 
turns to it many times) despite it being a highly planned and structured 
environment, absent from the “everyday life” of many. 

The social ordering theme continues in chapter 5 where she discusses 
the ways in which women negotiate the creation of an atmosphere for 
intimate encounters with their partners, illustrating how music is “im- 
plicated in what [actors] do and how it may structure what can be done” 
(p. 120). This point is revisited from a different angle in her interesting 
study of retail settings, where interviews with shoppers, store managers, 
and others draw attention to music’s role in the temporal framing of the 
act of shopping. 

The jumping from one setting and example to another, the almost 
exclusive focus on women’s experiences (almost no man is represented in 
her examples), and the lack of social diversity in class, age, and ethnicity 
among those observed or interviewed leads one to ask, whose everyday 
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life is at issue? Nevertheless, DeNora clearly has a sympathetic eye and 
a sharp ear, which makes her effort to map out the place of music in 
social relations a useful point of departure. Her work, like the music she 
studies, is a resource. 


Fussy-Set Social Science. By Charles C. Ragin. Chicago: University of 
Chicago Press. Pp. xviii+352. $48.00 (cloth); $20.00 (paper). 


Edgar Kiser 
University of Washington 


Significant methodological innovations are very rare in the social sciences. 
Many scholars employ existing methods in useful ways, but few import 
new approaches from other fields. Charles Ragin does just that. For the 
past 15 years or so, Ragin has been developing methods for the analysis 
of complex causal structures in studies with midsized Ns (between about 
5 and 50 cases). In part because we lack appropriate methods to analyze 
data sets with midsized Ns, scholars tend to avoid this work in favor of 
either qualitative analyses of a few cases or quantitative work on a very 
large number of cases. In short, Ragin has been trying to develop methods 
that bridge the unfortunately large gap between qualitative case studies 
and quantitative statistical analyses. 

His method of qualitative comparative analysis (developed in The Com- 
parative Method [University of California Press, 1987) used Boolean al- 
gebra to discover necessary and sufficient conditions for producing par- 
ticular outcomes. Although it has been useful in a fair amount of 
comparative-historical work, like many novel approaches, QCA has some 
serious flaws. The two most significant are the necessity to dichotomize 
all variables and the deterministic nature of the method (it is not based 
on probability theory and lacks any kind of significance test). In large 
part, his new book is an attempt to address these shortcomings while 
retaining the key features that made his approach unique. 

Ragin's new method is based on the theory of fuzzy sets. In set theory, 
variables are conceived of in terms of set membership, and relationships 
between variables are modeled as the intersections of sets. In Ragin’s 
earlier QCA, the sets were crisp, thus the variables were dichotomous 
(i.e., a country was either democratic or not, capitalist or not). The problem 
with using crisp sets is their procrustean quality, making it difficult to 
deal with cases that are somewhat democratic or somewhat capitalist. In 
quantitative statistical analysis, this issue is normally addressed by in- 
creasing the number of sets (moving to interval scale measurements), and 
miscategorizations are captured in the error term of the regression equa- 
tion. Since fuzzy sets are not based on probability theory, Ragin takes a 
different tack. Using detailed knowledge about the cases, the analyst 
determines whether cases are either fully in or fully out of the set or 
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whether they fall somewhere in between. The latter’s relationship to the 
set is quantified precisely—for example, if full inclusion is “1” and exclu- 
sion is “0,” cases are depicted as .7 democratic or .86 capitalist. The use 
of fuzzy sets thus combines qualitative distinctions (cases that are fully 
in or fully out of sets) with quantitative distinctions indicating interme- 
diate degrees of membership. 

One of the virtues of his approach is that it allows for tight relationship 
between verbal in quantitative expressions. The analyst can specify that 
“predominantly” capitalist refers to cases at .8 or above and “somewhat” 
capitalist refers to those between .6 and .79. A potential problem with 
the fuzzy-set approach is that the analyst is required to characterize set 
membership very precisely. First, this means that his method requires 
extremely precise data and thus will be difficult to apply to cases in which 
data are fragmentary or of poor quality. Second, it requires the analyst 
to know a great deal about each of the cases, enough to distinguish be- 
tween those that are .7 and those that are .8 democratic, for example. 
This is difficult enough to do in small N studies, but the problem is greatly 
magnified when detailed knowledge is required for 30-50 cases. It is 
clearly not easy to do fuzzy-set social science. 

What are the costs and benefits of using the fuzzy-set approach instead 
of existing alternatives (such as logit regression) for studies with midsized 
Ns? For example, could we simply use interval instead of categorical 
measures to capture the “fuzzy” part and interaction terms to address 
conjunctural causation? Ragin's answer to the first is that interval mea- 
sures cannot adequately capture the qualitative part of fuzzy sets. For 
many theories, it is only important to know if something is in a set or 
not; quantitative variations among entities that are fully within the set 
do not matter. The question of whether regression models with interaction 
terms can be used to capture conjunctural causation seems to come down 
to an issue of degrees of freedom, but in fact, it rests primarily on Ragin?s 
ontological assumption that most causation is based on complex multiway 
interactions. If he is right about that, then he is also right that regression 
models will not be able to capture this type of causation without ex- 
hausting degrees of freedom. 

This book is ambitious and innovative and will thus be of interest to 
a wide range of methodologists and historical sociologists. However, in 
spite of the fact that Ragin is probably right about the limitations of 
existing methods for analyzing midsized Ns, the odds are against the 
widespread adoption of his fuzzy-set alternative. My guess is that most 
statistical methodologists will find it no more palatable than QCA (fuzzy- 
set theory is not popular among statisticians, to say the least) and that 
most historical sociologists will discover that they lack either the data or 
the expertise to use it. Like QCA, fuzzy-set social science will probably 
be useful in some settings, and it will certainly force all of us to think 
further about some unresolved methodological issues. 


1487 


American Journal of Sociology 


Institutional Change and Healthcare Organizations: From Professional 
Dominance to Managed Care. By W. Richard Scott, Martin Ruef, Peter 
J. Mendel, and Carol A. Caronna. Chicago: University of Chicago Press, 
2000. Pp. xxv+426. $58.00 (cloth); $25.00 (paper). 


Donald W. Light 
University of Medicine and Dentisiry of New Jersey 


This much-anticipated book provides a rare analysis of the ways in which 
organizational sets within an organizational field affect each other’s per- 
formance as changes occurred over 50 years in their resource and insti- 
tutional environment. Because this analysis was done for several organ- 
izational sets within one region, and because it considered endogenous 
and exogenous changes over three historical periods, it contributes to the 
fields of community and organizational ecology, organizational demog- 
raphy, and profound institutional change. Its chapters reflect key socio- 
logical issues: ecological processes shaping organizational change, the ef- 
fects of resource environments on organizational dynamics, changing 
institutional environments and organizational legitimacy, forms of organ- 
izational integration, how field-level changes affect organizational pop- 
ulations, and the structuration processes of profound institutional change. 
This book, together with excellent references and appendixes, will provide 
for graduate students as well as faculty a fount of new research ideas and 
sociological insights that will endure far longer than the binding, which 
came apart after one gentle reading. 

Scott has long benefited from his early discovery that the structured 
complexities and rapid changes in health care provide a rich arena for 
sociological analysis. Particularly intriguing is the ecological destabiliza- 
tion documented in health care. “How was it that this stable, professionally 
dominated complex of institutionalized arrangements came apart?” (p. 
xvii). How did some actors lose their dominance and legitimacy, and how 
did new claimants acquire theirs? Thus the book is a tragedy as well as 
a sociological treatise, underplayed until its conclusion: “Governance 
structures have become much more fragmented. . . . The coherence of 
organizational boundaries has been greatly reduced... . Practitioners and 
patients alike are confused. .. . Consensus about institutional logics has 
been reduced” (p. 360). When this happens to an institutional field, one 
gets “disagreements and disputations over the priorities and goals of the 
sector and lack of agreement on the appropriate means to be employed 
in reaching them” (p. 359). 

To carry out this analysis, the research team assembled a unique con- 
stellation of data sets on several kinds of providers, purchasers, inter- 
mediaries (such as insurance companies and health plans), and govern- 
mental bodies. To examine organizational events, such as births, deaths, 
morbidities, and transformations, the researchers invented many new 
ways to measure theoretical variables and applied them across three his- 
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torical periods: the era of professional dominance (1945-65); the era of 
federal involvement (1966-82) with the passage of Medicare and Medi- 
caid; and the era of managerial control and market mechanisms (1983 to 
the present), which began with the government's Prospective Payment 
System, deregulation, and a reliance on market forces. Interspersed in 
this largely quantitative analysis of events are case histories of particular 
organizations in each historical period. And framing the organizational 
changes is an important contribution to “profound institutional change”: 
new governance structures and mechanisms, discontinuous and new log- 
ics, new actors and new relations among actors, and blurred boundaries 
of both the population and the organizational field. 

While the researchers document the organizational changes and their 
interactions with major institutional change, they are not able to explain 
how and why the institutionally entrenched era of professional dominance 
fell apart and why the new era of fragmentation, disruption, and confusion 
has occurred. A field theory of countervailing powers would have helped 
to explain how the very dominance of the medical profession produced 
its own pathologies and provoked other major powers, such as the payers 
and government, to restructure the financing and terms of health care. 
This would, in turn, have reconceptualized the institutional dynamics of 
the three eras. For example, while the measures used of professional dom- 
inance declined, the organized profession, as well as investor-owned chains 
that predated 1965, made sure that Medicare had built into it a number 
of provisions that locked in professional prerogatives and risk-free cor- 
porate profits, so that at least the first half of the “era of federal involve- 
ment” was more accurately the “era of the federal feast.” This spurred 
countervailing reactions that shaped the current era. 

One concern raised by the book is the uncritical use of commercial and 
political inventions, such as “health maintenance organizations” and “in- 
tegrated health care systems,” which are designed as good ad copy. Small 
surprise, then, that “HMOs” exhibit “such internal diversity” that re- 
searchers can only measure certain dimensions, or that the term “inte- 
grated healthcare systems is virtually impossible to define” (p. 356). These 
commercial and political terms are then reified into second-order con- 
structions of first-order constructions of reality, such as organizational 
sets, populations, and fields, each with boundaries and linkages, framed 
by organizational ecology and community ecology, framed by institutional 
actors and environments. Although the authors explain these terms as 
lucidly as one can, a fundamental problem remains. Where do they leave 
either readers or researchers as one moves up to quasi organizations’ sets, 
boundaries, populations, linkages, and fields? Researchers, it would ap- 
pear, legitimate and reify these political and commercial enterprises, giving 
them further dignity as constituting “sets” and “fields.” But, then again, 
how would they avoid doing so and still carry out their research? 
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A Watched Pot. By Michael Flaherty. New York: New York University 
Press, 1999. Pp. 159. $30.00. 


John P. Robinson 
University of Maryland 


Anyone who has been in a traffic accident probably remembers thinking, 
“this is not happening” as the crash unfolded in “slow motion.” Similarly, 
waiting for hours in a doctor’s office, during police detainments, or in 
other episodes of “nonboiling pots” leads to a perception of time in sus- 
pended animation. 

Michael Flaherty has clearly spent a good part of his life mulling over 
and systematically documenting these existential facets of everyday tem- 
poral life. He has further sought to put them into an elegant theoretical 
framework, building on the writings of Mead, Dewey, Goffman, and Den- 
zin among others. His observations and conclusions are conveniently laid 
out in this extended essay, consisting of six brief chapters and a meth- 
odological appendix. 

After a largely theoretical introductory chapter reviewing the experi- 
ential subtleties and puzzles involved, in chapter 2 Flaherty describes 
three basic paradoxes in our temporal experience: (1) Perceived time passes 
slowly both when activity pace is either abnormally high or abnormally 
low, (2) the same slow moving interval of time in the present is perceived 
as having passed more quickly later in retrospect, (3) some busy intervals 
pass quickly while other equally busy intervals pass slowly. In each case, 
he finds folk theories and slogans either contradicting these hypotheses 
or unable to explain them (despite his use of the “watched pot” metaphor 
in the title). In his final paragraphs, he alerts the reader that the theory 
he will develop to account for these paradoxes is counterintuitive but 
based on “research and discovery, and not the reiteration of common 
sense” (p. 39). 

The third chapter on “protracted duration” provides an abundance of 
anecdotal quotes describing that state, as drawn from biographical ac- 
counts—both in 389 press and other published text quotes and from 316 
brief student essays. Interestingly, when these accounts are coded into six 
categories, the two sources provide roughly equal proportions of experi- 
ences falling in these categories. The main three categories—suffering/ 
intense emotion, violence/danger, and waiting/boredom—each contain a 
quarter to a third of all quotes, while the rest are described under altered 
states (9%), concentration/meditation (5%), shock/novelty (4%), and other 
(2%). Even though the waiting/boredom examples (usually classroom sit- 
uations among students) seem to stand out from the rest, Flaherty main- 
tains they slow the passage of time “for exactly the same reasons” (p. 63). 

These reasons are then detailed in the fourth chapter. The first is the 
extremity of the circumstances—either involving very high or very low 
levels of energy outlay in relation to normal “synchronous” activity. The 
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second is the resulting increase in emotional concern about one’s under- 
standing of the situation, the third heightened cognitive involvement, the 
fourth greater stimulus complexity (even if the situation itself is not com- 
plex), and fifth the increased “density” of experience. The crucial result 
is the author’s U-shaped curve on page 105 that predicts that perceived 
time slows in periods when activity is either too high or too low in relation 
to normal periods of being synchronous (in sync) with one’s normal levels 
of activity. 

The fifth chapter addresses the second major paradox described in 
chapter 2, namely that the further in the past an activity occurs, the more 
quickly it is perceived to have occurred. Here, Flaherty draws on ratings 
from his students when asked to reflect on life events that occurred “yes- 
terday,” in the last month or in the last years. Even though the pace of 
life should be the same in the three circumstances, he finds the further 
in the past they occurred, the more quickly his student respondents rated 
them as having happened. One exception is that, while subject to the 
same time pattern, middle-aged students rated time passage in all three 
conditions as faster than either younger or elderly students—a finding 
consistent with other research showing middle age to be the most hectic 
phase of Americans’ lives. 

In his final chapter, the author interprets his results in relation to the- 
oretical writings on the sociology of emotions and subjectivity concluding 
that “the stream of consciousness is a heavily managed flow,” in which 
habitual and problem-solving behaviors occur simultaneously rather than 
oscillating between the two extremes (as implied in the literature), One 
major implication for future research is to challenge the conclusion from 
anthropology that such time dependent results might only hold for so- 
cieties organized around time and the clock “For now, . . . a glimmer of 
evidence . . . suggests the possibility of cross-cultural uniformity in tem- 
poral experience” (p. 151). Finally, he joins with Maslow, Csikszentmi- 
halyi, and other scholars who see behavior as more actively chosen by 
individuals rather than as oversocialized units acting as programmed by 
society. 

All these profound conclusions and speculations, then, essentially rest 
on bivariate results from two empirical studies with rather unrepresen- 
tative samples asked very few and very limited questions about their 
everyday lives. One can be impressed with the range of theorists reviewed 
by the author, but one might also wonder whether Mead or Dewey would 
have agreed with the implications and speculations he has drawn from 
their work. He tacks direct quotes from them on time and human energy 
issues, but perhaps other scholarly writings might make his case more 
persuasively. For example, several of the findings are reminiscent of 
Sherif’s assimilation-contrast model or of cognitive psychological studies 
of “telescoping” of time in self-reports of activity. 

However theoretically provocative, there is little more in the way of 
theoretical development in this book than outlined above, and the main 
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ideas are repeated or restated in so many ways that readers may wonder 
whether they understood them in the first place. This illustrates what 
mischief the mind gets into in slow-moving encounters and why the qual- 
itative differences between ultra boring and exhilarating experiences tran- 
scend their similarity in terms of slow time passage. Even Flaherty notes 
how they refer to qualitatively different types of “slowness” experiences. 
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The Evolution of Norms! 


Jonathan Bendor 
Stanford University 


Piotr Swistak 
University of Maryland, College Park 


Social norms that induce us to reward or punish people not for what 
they did to us but for what they did to other members of one’s 
group have long been thought as sine qua non sociological and thus 
impossible to explain in terms of rational choice. This article shows 
how social norms can be deductively derived from principles of 
(boundedly) rational choice as mechanisms that are necessary to 
stabilize behaviors in a large class of evolutionary games. 


INTRODUCTION 


The question, Why are there norms? is perhaps one of the most funda- 
mental problems that the social sciences have ever tackled. Its significance 
turns on the causal significance of its subject matter: “No concept is 
invoked more often by social scientists in the explanation of human be- 
havior than ‘norm’” (Gibbs 1968, p. 212). 


* For comments on earlier drafts of this article we would like to thank Jenna Bednar, 
Phillip Bonacich, Charles Cameron, Joe Harrington, Robert K. Merton, Harvey Mol- 
otch, John Padgett, Scott Page, Janek Poleszcruk, David Snow, and seminar partici- 
pants at the University of California, Berkeley, Boston University, University of Cal- 
ffornia, Irvine, Jachranka, Stanford University, University of California, Los Angeles, 
and the University of Chicago. Swistak thanks the Russell Sage Foundation, where 
he was a visiting scholar while much of tbe work on this article was done. Bendor 
thanks the Center for Advanced Study in the Behavioral Sciences for its financial and 
intellectual support. Direct correspondence to Jonathan Bendor, Graduate School of 
Business, Stanford University, 518 Memorial Way, Stanford, California 94305-5015. 
E-mail: bendor_jonathanOgsb.stanford.edu 

* This quote was brought to our attention by Christine Horne (2001) A recent review 
of the Hterature in sociology, economics, and game theory on the emergence of norms 
is given in a series of papers by Horne (2001), Eggertsson (2001), and Voss (2001). 
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The Problem 


In the most general sense a norm might be regarded as any rule of be- 
havior. Yet sweeping notions are rarely analytically useful. For this reason 
the concept of a norm has been typically restricted to a subset of behavioral 
rules. A norm, for instance, has often been considered to be in place “if 
any departure of real behavior from the norm is followed by some pun- 
ishment” (Homans 1950, p. 123; see also Blake and Davis 1964, p. 457). 
Indeed if violating a rule never triggers any kind of punishment, calling 
it a norm would violate a common sociological intuition.’ Thus, following 
Homans and Blake-Davis, we will understand norms as behavioral rules 
that are backed by sanctions. Hence when we ask, Why are there norms? 
it is norms in this sense that we have in mind. And as we shall later see, 
one of our results confirms the intuition that behavioral rules without 
sanctions are unimportant. 

A second aspect of norms that seems fundamental to the notion relates 
to the nature of sanctioning. If a norm is violated, the obligation to impose 
punishment need not be restricted to those who were hurt by the original 
transgression; it can be extended to third parties—people unaffected by 
the deviation but who are in a position to punish the deviant. We will 
refer to rules of behavior that include third-party sanctions as social 
norms. Note that norms that obligate third parties to impose sanctions 
can be considered quintessentially social: by imposing requirements on 
an entire community and not merely on the interested parties, they create 
a general code of conduct. Violations of a general code matter to everyone 
in a community; they are not merely private matters between two parties. 
And because they are general, potentially embroiling people in conflicts 
that were not of their own making, it is precisely social norms that have 
been thought to be sine qua non sociological and thus impossible to explain 
in terms of rational choice. (Why, after all, should A punish B for what 
B did to C?) As we shall prove below, social norms can not only be derived 
as rational forms of behavior but, more important, they turn out to be 
necessary to stabilize behavior in groups and institutions. Consequently, 
most of our results focus on explaining social norms: their emergence and 
their features. 


‘If violating a rule never triggers amy kind of punishment, a rule is completely dis- 
cretionary and has no binding force. Indeed, even a common understanding of the 
term “norm” implies the existence of an enforcement mechanism. For example, the 
first definition of “norm” in Webster’s Ninth Collegiate Dictionary is “a principle of 
right action binding upon the members of a group and serving to guide, control or 
regulate proper and acceptable behavior” (1989, p 806). 
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Problems with the Existing Explanations 


Given the fundamental importance of the problem of norms, it is not 
surprising that all of the social sciences have tried to resolve it. While the 
answers have crossed interdisciplinary boundaries, they have mainly 
fallen into two classes of explanations: one comes from the tradition of 
methodological individualism and economics, the other one draws on the 
structural-functional tradition of sociology. The first type of explanation 
assumes an individual to be a Homo economicus and explains actions in 
terms of individual choices and objectives. In the structural-functional 
tradition an individual is seen as Homo sociologicus and his or her be- 
havior is explained in terms of group influence (e.g., conformity pressures) 
and the function that this behavior serves a group or society. Both modes 
of explanation have been criticized for some inherent problems; both have 
left a number of important questions unanswered. Before we present our 
solution to the problem of norms and the underlying theory, it will be 
good to remind the reader why the existing explanations have been crit- 
icized. The ensuing list of problems will set a reference point for the 
construction that follows. It will also serve as a set of objectives or criteria 
that can be used to evaluate the explanation proposed in this article. 
Arguably, the most prominent answer to the problem of norms is that 
of functionalism and the homo sociologicus tradition: norms exist because 
they are functional for the group.’ As is well known, however, a variety 
of problems attend this answer. First, there are conceptual issues: in par- 
ticular, how are we to define the key concept, “functional”? Second, there 
are empirical problems: the strong functionalist thesis—all norms are func- 
tional for their collectivities—seems to founder empirically in the face of 
codes involving, for example, revenge and vendettas. Third, there are 
theoretical issues. As Stinchcombe (1968) and Elster (1983, 1989b) have 
argued, a complete functionalist argument must specify a selective pres- 
sure for the actions, rules, or structures that fulfill the functions. The 
selective pressure may be feedback loops (e.g., firms that do not maximize 
profits fail more often than those that do) or incentives (e.g., pressures to 
conform to roles or the internalization of values and norms) that directly 


“This is what Wilbert Moore has called “the ‘canonical’ view that any observed 
cultural form or pattern of behavior must fit the system—that is, must have a function” 
(1978, p. 328). By functionalism we mean the school of thought that flourished in 
sociology in the 1950s and 1960s under the influence of Talcott Parsons and Robert 
Merton. The essential aspects of this paradigm go back to the early functionalism of 
cultural anthropologists such as Bronislaw Malinowski and Alfred Radcliffe-Brown 
as well as the tradition of European sociology, in particular that of Emile Durkheim. 
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induce decision makers to take the required actions.’ Without some such 
specification the functionalist thesis becomes an illegitimate teleology. 
Fourth, structural-functional explanations of norms are generally static, 
a feature they share with most microeconomic and game-theoretic models 
of norms. The common problem with these theories is that while they 
can explain properties of equilibria, they cannot explain how a particular 
equilibrium has been arrived at. Thus these are static theories of inherently 
dynamic phenomena. 

The fifth and perhaps the most important problem concerning func- 
tionalism (and a good part of standard sociological theory as well) was 
characterized by Coleman as follows: “Much sociological theory takes 
norms as given and proceeds to examine individual behavior or the be- 
havior of social systems when norms exist. Yet to do this without raising 
at some point the question of why and how norms come into existence 
is to forsake the more important sociological problem in order to address 
the less important” (19904, p. 241). 

Coleman has seen the problem of deriving macro properties (properties 
of groups) from micro properties (properties of individuals) as the main 
unsolved problem of sociology and the program (Coleman 1986) for so- 
ciology’s future development. “The emergence of norms is in some respect 
a prototypical micro-to-macro transition, because the process must arise 
from individual actions yet a norm itself is a system-level property” (Co- 
leman 1990a, p. 244). For Coleman an effective theory of norms would 
begin with a set of assumptions about individuals in a group and conclude 
with a deductive derivation of group norms. 


The Proposed Solution 


In this article we show how evolutionary game theory can be used to 
explain an essential type of norms—social ones. In addition to bridging 
the micro-macro gap, this solution avoids, we believe, all the other prob- 
lems of standard functional explanations that were mentioned above. 
Further, because we employ evolutionary game theory rather than (Nash) 
equilibrium analysis, we avoid the problems of static analyses that have 
attended efforts to explain norms via classical (noncooperative) game 
theory. 

More specifically our answer to the question, Why are there norms? is 
an evolutionary one (Schotter 1981; Opp 1982; Sugden 1989; Bicchieri 


* Parsons and other structural-functionalists recognized that individuals had to be 
motivated, somebow, to implement societal functions. But their reliance on the micro- 
mechanisms of conformity and internalization has been criticired for reflecting, as 
Wrong put it (1961), an “oversocialized conception of man.” 
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1993, chap. 6; Binmore and Samuelson 1994). As Sugden put it, norms 
“have evolved because they are more successful at replicating themselves 
than other patterns [of behaviorT” (1989, p. 97).° These replication pro- 
cesses are realized via incentives that induce members of a group to favor 
norms over other kinds of rules that do not encode norms. Since evolu- 
tionary game theory assumes individuals who, though boundedly rational, 
pursue norms that give them higher utilities, the main problem of a func- 
tional theory—How are functions realized?—is solved in a straightfor- 
ward manner: individuals will support a norm whenever it gives them 
higher utility to do so.” 

As noted, evolutionary game theory has the important substantive ad- 
vantage of being a dynamic theory of behavior (Schotter 1981). Evolu- 
tionary game theory (Maynard Smith and Price 1973; Maynard Smith 
1982) analyzes how populations of strategies change over time. The key 
assumption is that relatively successful strategies proliferate, while rela- 
tively unsuccessful ones dwindle. This core postulate refers direcily to 
disequilibrium dynamics. Thus, unlike classical game theory, which has 
focused on behavior in equilibrium, evolutionary game theory can in- 
herently deal with disequilibrium phenomenon.’ In other words, evolu- 
tionary game theory can not only establish what is equilibrium behavior; 
it can also help us understand the dynamics that led to that behavior and 
forces that keep it stable. These properties preadapt it naturally to ana- 
lyzing the emergence of norms. 

Most of our results focus on explaining the emergence of social norms 
and, in particular, rules of behavior that prescribe third-party sanctions 
(in addition, of course, to dyadic, second-party punishments). Specifically, 
we use evolutionary game theory to show that strategies that encode social 
norms are necessary to stabilize behaviors in all nontrivial games (theorem 
1); strategies lacking the social property of third-party enforcement cannot 
do the job. Moreover, for such strategies to be evolutionarily stable re- 
gardless of how quickly individuals adjust their behaviors, there must be 
an effective punishment for deviations from the norm (theorem 2). Stable 
strategies in such games must preserve the essence of their “social” nature 
throughout the game and throughout the relevant population (theorems 
3, 4, and 5). Further, we show that some social norms, that is, the strategies 


* Though our argument is close to Sugden’s (1989; see also Sugen 1986), he does not 
focus on strategies that impose third-party sanctions. Few of the examples in his book 
are what we call “social strategies.” 

? For functionalist or efficiency-oriented analyses that take the microincentive problem 
seriously, see Ellickson (1991) and Calvert (1995). 

“It is therefore important not to let the name, “evolutionary game theory,” cause 
confusion: despite thelr semantic similarities, evolutionary game theory differs from 
classical game theory in several significant ways. 
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that encode them, are both evolutionarily stable and dysfunctional, in the 
sense of being Pareto inefficient (theorem 6). We also show, however, that 
functional (Pareto-efficient) social norms are more stable than dysfunc- 
tional ones (theorems 7 and 8). Thus, although a group can indeed dwell 
in a bad equilibrium underpinned by counterproductive social norms, 
long-run dynamics tend to drift toward more efficient ones. And quali- 
tatively, there is a tendency for groups to drift toward an equilibrium 
regulated by social norms, instead of staying in equilibria where social 
norms have no part. 

Substantively our results apply to a very wide spectrum of phenomena. 
If we were to agree with Ullmann-Margalit (1977) that norms are em- 
pirically relevant in three types of situations—prisoner’s dilemma (PD) 
problems, coordination problems, and situations of inequality where the 
status quo benefits one party and hurts another—then most of our results 
cover all three types. Indeed, our earlier result (Bendor and Swistak 1998), 
as reinterpreted in this article (theorem 1), links stability with the necessity 
of social norms and covers literally a two-person games of any strategic 
interest whatsoever. Because, however, it is useful to motivate and illus- 
trate the general argument via a specific example, we focus throughout 
on a particular game—the two-person PD, which is so well understood 
in the social sciences that little explication of the game is required here.? 
We hope that the example’s specificity will not mislead readers into under- 
estimating the generality of the analytical results that follow. 

AJ of our results are based on evolutionary game theory. This approach 
diffused early and rapidly in the biological sciences and a bit later, though 
no less rapidly, in game theory, economics, and political science." Soci- 
ology remains much less influenced by the rational choice paradigm in 


’ In the one-shot PD, each player has two choices, to cooperate or defect. Payoffs to 
each player in the one-shot game are as follows: R (when both cooperate), P (when 
both defect), T (to the defecting player when his opponent cooperated), and S (to the 
cooperating player when his opponent defected). The payoffs are assumed to satisfy 
T>R>P>S: defection yields a higher payoff regardless of the opponent’s action, yet 
mutual defection is worse for both players than is mutual cooperation. Further, in 
analyses of the repeated game it is typically assumed that 2R>T + S mutual coop- 
eration 1s better than alternately exploiting and being exploited 

10 See, e.g., Hines (1987), Axelrod and Dion (1988), and Vincent and Brown (1988) for 
review articles in the biological sciences. For review articles focused on economics and 
political science, see, e.g., Fmnedman (1991), Selten (1991), Mailath (1992), and Sa- 
muelson (1993). There are also many books devoted in part or entirely to evolutionary 
game theory. These include Axelrod (1984), van Damme (1987), Hofbauer and Sigmund 
(1988), Binmore (1992), Bomze and Potscher (1989), Cressman (1992), Fudenberg and 
Levine (1998), Samuelson (1998), Vega-Redondo (1996), Weibull (1995), and Young 
(1998). 
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general and evolutionary game theory in particular. Because it may still 
be relatively unfamiliar to many sociologists, we provide an introduction 
to game theory (classical and evolutionary). Next we lay out the central 
problem: the instability of dyadic strategies and what this implies about 
norms. Then, we show that stable nondyadic strategies do exist by in- 
troducing one strategy, conformity (or CNF), that uses an ancient social 
logic to stabilize itself. We then provide general results about the existence 
of stable strategies and their corresponding norms in all two-person games. 
In the following sections, we identify several essential properties of stable 
strategies and refute the strong functionalist thesis by showing that some 
stable norms are Pareto deficient. The functional thesis is partially re- 
deemed later, when we show that more efficient norms are more robust 
than less efficient ones. In our two concluding sections we sketch some 
extensions, identify circumstances in which norm-encoding strategies do 
not enjoy evolutionary advantages, and give some brief applications of 
the results. 


GAME THEORY AND EVOLUTIONARY ANALYSIS 


There are two standard ways to clarify an abstract, mathematical ar- 
gument. First, one could begin by constructing the simplest model and 
then extend it gradually to more complex ones. Second, instead of dis- 
cussing the problem in general terms, one could illustrate it via a widely 
known and well-understood example. Since many sociologists are rela- 
tively unfamiliar with the language and tools of game theory we will use 
both methods to make the presentation as clear as possible. 

Although evolutionary game theory is now a distinct mode of analysis, 
it grew out of classical, noncooperative game theory. Hence some very 
basic knowledge of the latter is useful for understanding the former. We 
begin, therefore, with some simple yet essential elements of noncooperative 
game theory, embedded in the substantive context of the “problem of 
cooperation” (as represented by the PD).” We then turn to evolutionary 
game theory. 


1 Rational choice still remains a rare mode of theorizing in sociology (Coleman and 
Fararo 1992; Hechter and Kanazawa 1997; Voss and Abraham1999) despite the influ- 
ential early history of methodological individualism and rational choice (Homans 1950, 
1961; Blau 1964), the attention garnered by Coleman’s program (Coleman 1986, 1990q), 
and a growing number of conspicuous contributions from younger sociologists (e.g , 
Hechter 1987; Heckathorn 1988, 1990; Macy 1993; Macy and Skyoretz 1998). For some 
notable exceptions on the use of evolutionary game theory in sociology, see, e.g., Co- 
leman (1990b), Kollock (1993), Lomborg (1996), and Raub and Weesie (1990). 

2 This explantion is based in part on Bendor and Swistak (1996). 
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Some Basic Game Theory 


The problem of cooperation and its simplest model.—To better under- 
stand the nature of the general problem of norms, we begin our discussion 
with the simplest possible game-theoretic model (a one-shot, two-by-two 
game) of one of the most important problems of the social sciences—the 
problem of cooperation. Informally speaking, the problem of cooperation 
concerns explaining when rational actors will stably cooperate as they 
interact in situations involving conflict. Arguably, social interactions are 
fundamentally based on such exchanges. Many informal interactions, in 
small groups or networks for instance, turn on an exchange of favors, 
and such exchanges exhibit an important type of tension: egoistically, it 
is always best to obtain favors without returning them (doing a favor is 
costly), and the worst is to give and not get; yet both parties are worse 
off when favors are withheld than they would have been had favors been 
exchanged. 

More specifically, consider two players, Row and Column. Row can 
either do Column a favor or refuse to do so; Column has similar options. 
Let’s assume that if Row obtains a favor without returning one, then 
Row gets his best outcome, with a utility of T = 5. Assume, moreover, 
that the second-best outcome (R = 3) results from the mutual exchange 
of favors. If favors are withheld, Row gets the third-best outcome 
(P = 1). Finally, the worst outcome for Row is when Row helps Column 
but gets nothing in return (S = 0). For simplicity, we can further assume 
that the game is symmetric so that Column has identical utilities. With 
T>R>P>S this is an instance and the simplest model of the famous 
PD game. Defection is a dominating strategy: it yields a higher payoff 
regardless of the other player’s action. This makes mutual defection (no 
favors exchanged) the only equilibrium in the one-shot version of this 
game. Rational actors are, hence, bound to end up defecting with each 
other. The dreary outcome is that in equilibrium players get a payoff of 
(P = 1, P = 1) which is worse, for both, than the payoff to mutual co- 
operation—when favors are exchanged—of (R = 3, R = 3). Defection is 
individually rational but collectively suboptimal (See fig. 1 for the choices 
and payoffs of the one-shot PD.) 

A model with repeated interactions. —lIf the PD is played just once (and 
players know this and are rational) then the game’s outcome is a Pareto- 
inferior set of payoffs; both players could have done better had they chosen 
to cooperate. Suppose, however, that individuals interact repeatedly, and 
they anticipate many future interactions with no definite termination date. 
(In a repeated version of the PD we will additionally assume that R > 
(T + SY2, which means that the collectively optimal outcome results from 
mutual cooperation.) Formally, we can model this situation by assuming 
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Fic, 1 —The prisoner's dilemma game. T>R>P>S and R> T +S) (row player’s 
payoff is listed first in each cell). 


that both players play an infinitely repeated” one-shot game with the 
same payoff matrix and the same probability of continuing the game in 
every period.“ A (pure) strategy of player A in a game with B is defined 
as a complete plan for A; that is, a function which specifies A’s move in 
any period & depending on B’s moves toward A in periods 1 through 
k — 1.” Let V(j,i) denote strategy j’s expected payoff when it plays strategy 
i. The payoff to each strategy, V(y,2) and V(i,7), is then computed as the 
expected value of a string of payoffs resulting from the play between two 
strategies. For instance, if both j and i are “always defect” (ALL- 
D)—defect unconditionally in all periods—the payoff to each of them is 
computed as the following infinite sum: 


WALL — D, ALL — D) = P+ Pô + P8? + P8 +... = PIS 
In general we can interpret 6 as an effect of discounting future payoffs 
or a probability that the players will interact in the next period or a joint 
effect of uncertainty and discounting. Since all three interpretations of 6 
deal with the effects of the future, it is both natural and accurate to refer 
to a game with sufficiently high ô as a game in which the future is suf- 
ficiently important. We will use this less technical phrase throughout the 
article. 

The reason we will focus on games for which the future is sufficiently 


4 See Rubinstein (1991) for why an infinite game can be used as a reasonable model 
of finite interactions. 

4 Note that with this assumption—a constant probability of continuing the game—the 
expected duration of the repeated game is finite and equals 1/(1—8). 

3 A mired strategy in a repeated game is defined as any probability distribution on a 
set of pure strategies. 
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important is simple—the problem of cooperation is theoretically uninter- 
esting in repeated games in which the future matters little. This is because 
an equilibrium in an iterated PD (PD) with low values of ô is the same 
as the equilibrium in the one-shot PD—defection. The intuition behind 
the simple algebra of this result is simple too: if the future payoffs do not 
matter much, maximizing in a repeated game is really equivalent to max- 
imizing in a current period of the game. Hence for low 6, the ALL-D 
strategy is the best response to all other strategies in the IPD. The emer- 
gence and the stability of cooperation become meaningful issues only in 
games where the future is sufficiently important. Hence from now on we 
will limit our analysis to this type of game. 

To see what kinds of strategies can be supported in equilibrium in 
games where the future is sufficiently important consider, for instance, a 
strategy of tit for tat (TFT): cooperate in the first period of the game and 
thereafter in any period k do what one’s partner did in k — 1. Consider 
two players, A and B. Suppose that player A is using TFT and player 
B, knowing A’s strategy, is trying to maximize against it. If the future is 
sufficiently important (i.e., 6 is high enough), then—given the retaliatory 
nature of TFT—it is best for B to cooperate with TFT in all periods. 
Defecting in any period would only lower the expected payoff.'* Hence, 
if B were contemplating playing TFT against A, he would have an in- 
centive to do so since using any other strategy would yield either a lower 
or, at best, the same expected payoff. If we now apply the same reasoning 
to the other player we see that neither player, given the other’s strategy, 
has an incentive to change his own. In game theory this means that the 
pair of TFT strategies in the IPD with sufficiently important future form 
a Nash equilibrium.” 

And so, the last paragraph provides two important observations. First, 
we have illustrated the notion of equilibrium in games—the so-called Nash 
equilibrium. And second, we have established that mutual cooperation 
can be sustained as a Nash equilibrium if the future is sufficiently im- 
portant. In contrast to the dreary outcome of the one-shot game, this is 
clearly an optimistic conclusion. 

The existence of a cooperative equilibrium does not mean, however, 


1 For instance, the strategy of ALL-D will score lower with TFT than would another 
TFT To see this, note that WALL-D,TFT) = T + P+ P +PË +.. =T+ 
P1 — 8. and WTFT, TFT) = R + Rê + Rẹ + Rẹ +... = R/1 — è If we take, e.g, a 
game with T = 5,R = 3,P = 1,S = 0 then WTFT,TFT)> V(ALL-D, TFT) in all 
games where ô> 0.5. In fact, any strategy that defects with TFT in any period of the 
game will do worse than a strategy that cooperates with TFT in all periods For the 
proof that TFT is a best response (Le , one that yields a manmal payoff) to TFT when 
the future is sufficiently important, see, e.g., Axelrod (1984). 

Y This term is used to honor John Nash, one of the main pioneers of game theory. 
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that a game will end with both parties cooperating. Whether it will or 
not depends on what other equilibria are possible in the game. The prob- 
lem of identifying all equilibria in repeated games is the subject of a well- 
known result that goes by a cryptic name: the folk theorems. While folk 
theorems apply to any repeated game—not just the IPD—we focus below 
on their interpretation and implications for the game of IPD—the case 
of our running example. Also, to avoid getting mired in more technical 
aspects, we will consider one very specific interpretation of the IPD folk 
theorems. 

This interpretation concerns the “amount of cooperation” sustainable 
as a Nash equilibrium in the IPD. Suppose we measure the amount of 
cooperation in the IPD as the frequency of cooperative moves in the set 
of moves by both players. With this interpretation, the relevant folk the- 
orems can be stated briefly: any amount of cooperation can be sustained 
in equilibrium. Hence, a pair of strategies in equilibrium can support 
anything between 0% cooperation (e.g., a pair of ALL-D strategies) to 
100% cooperation (e.g., a pair of TFT strategies).’? Thus, an equilibrium 
may contain permanent cooperation, or permanent defection, or any in- 
termediate outcome between these two extremes. 

What eventuates as equilibrium behavior depends, of course, on what 
player A thinks player B is playing and what B thinks A’s strategy is. If 
A believes that B is playing ALL-D, A would have no incentive to do 
anything but to play ALL-D as well.” The same reasoning holds for B. 
Thus, if A thinks that B is playing ALL-D and B thinks that A is playing 
ALL-D, they will both keep defecting when playing each other. But this 
presents the following problem: What if A’s beliefs about B or B’s beliefs 
about A are wrong? Would their interaction correct this problem? Clearly 
not. As they both constantly defect with each other, the game’s actual 


“ According to Aumann (1987, p. 20) one particular version of the folk theorems has 
been known since the late fifties. The name was invented because the theorem lacks 
clear authorship and game theorists knew about the result long before any published 
account of it appeared. Since the same effect of a proliferation of equilibria can be 
obtained under different assumptions, the plural form—folk theorems—is often used. 
By now there are many accounts of this result in the literature. The reader may want 
to consult any of the recent textbooks on game theory (e.g., Rasmusen [1989] provides 
a leas technical account; Fudenberg and Tirole [1991] and Myerson [1991] present well- 
formalized, more mathematical accounts). 

3 Hence, using classical game theory it is impossible to rule out any outcome in the 
IPD that gives each player his/her “security level,” ie., the payoff that they can uni- 
laterally obtain for themselves. (In the IPD, a player can ensure that s/he gets at least 
P/1 — 8, simply by always defecting. Hence an outcome in which someone gets less 
than this value cannot be an equilibrium.) 

% Note that playing ALL-D is the best response to always defect in all games, ie., no 
matter how important future payoffs are. 
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play gives them no opportunity to disconfirm the belief that their opponent 
is playing ALL-D. 

Consider, for instance, the following eventuality: suppose that A plays 
a suspicious version of TFT (i.e., STFT): defect in the first period and 
thereafter play what the opponent played in the previous period. Assume 
also that A thinks that B is using ALL-D. Suppose now that B does the 
same: he plays STFT and thinks that A plays ALL-D. Then both A and 
B will defect in every period of the game, making it impossible to dis- 
confirm either player’s hypothesis about his partner’s strategy.”’ Conse- 
quently, these mistaken hypotheses will lock the players into the inefficient 
outcome of mutual defection—unless they have some additional infor- 
mation that could test their hypotheses about their opponent’s strategy. 
Such additional information can be provided by the social context of 
interactions. Indeed, if interactions take place in a small group, for ex- 
ample, players can often observe what happens between others. And even 
if they do not directly observe the relevant actions, they will frequently 
learn about them through gossip. Information about these other inter- 
actions may often reveal that a partner’s strategy is not at all what one 
might have thought it was. Thus a group may be a clearing house for 
information that allows individuals to learn and to adjust their behaviors 
in ways that would be impossible otherwise. This brings us, finally, to 
evolutionary game theory, which extends classical game theory by ana- 
lyzing how individual behaviors will evolve as people learn about the 
behaviors of others in the group. 


Evolutionary Game Theory 


A game in a group and the evolution of behaviors —Since folk theorems 
imply that it may be rational to play a great many strategies in the IPD, 
the issue of learning becomes fundamental in games with sufficiently 
important futures. A group supplies an important vehicle for learning 
about the strategies of others. Take, for instance, our example of two 
players, A and B, who kept defecting with each other while wrongly 
assuming that their opponent played an ALL-D strategy, whereas both 
A and B have, in fact, played a strategy of STFT. If A and B play against 
each other in this manner, they will never find out that their assumptions 
about the strategy of the other player are wrong; consequently, they will 
be unable to maximize against the opponent’s true strategy of STFT.” 


1 This means that the strategies of both players are rationalizable (Bernheim 1984; 
Pearce 1984). 


2 Any strategy that generates mutual cooperation with suspicious TFT from period 2 
on is Gf the future is sufficiently important) a best response to that strategy 
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This may not be so if there were another player in the ecology with whom 
A or B were able to establish cooperation. Suppose, for instance, that 
there is a player C who plays a strategy of tit for two tats (cooperate in 
the first two periods of the game and then defect if and only if the opponent 
defected in the past two consecutive periods; hereafter TF2T). Given the 
definitions of STFT and TF2T it is easy to see that C will induce mutual 
cooperation with both A and B from period 2 on. However, cooperation 
with C should lead A and B to revise their beliefs about each other. 
Consequently, A and B may want to change their strategies in a way that 
would allow them to cooperate with each other and, hence, maximize 
their payoffs in the games they play in this group. For instance, they may 
switch to playing TF2T, given this strategy’s success in establishing co- 
operation with A and B. In general, having learned about others’ strat- 
egies, players may want to switch their own strategies to ones that were 
more successful. This example depicts the core idea of an evolutionary 
game. 

The evolutionary game.—As is common with important notions, the 
general idea of an evolutionary game is simple.” A game represents pair- 
wise interactions between individuals in a group. The standard evolu- 
tionary model (Maynard Smith 1982) assumes that the interactions are 
unstructured: that is, every individual has the same constant probability 
of encountering any other member of the group. 

Each pairwise interaction is modeled as a one-shot game with a specific 
payoff matrix. Since individuals interact repeatedly, each pairwise contest 
ends up being a repeated game between the two individuals. In the game- 
theoretic model of a pairwise repeated interaction, 6 represents the con- 
tinuation probability, that is, the probability that, having reached period 
t, the game will continue into period t+ 1. A standard simplifying as- 
sumption is that the payoff matrix of the stage game (i.e., the one-shot 
game) stays the same across all iterations. 

Once a game is specified, V(i,j) denotes strategy 1’s expected payoff 
when it plays strategy j. Strategy i’s overall score or fitness, denoted 
Vis), is the sum of its pairwise payoffs. Consider, for instance, a group of 
three actors playing TFT, TF2T, and ALL-D. Each actor plays the other 
two in the group. For instance, TFT scores V(TFT, TF2T) in a game 
with TF2T and V(TFT, ALL-D) in a game with ALL-D. Thus V(TFT), 
the total score of TFT in this group, equals V(TFT, TF2T) + 
V(TFT, ALL-D). 

Evolutionary agents —Evolutionary change involves actors adjusting 
their behaviors (strategies) over time using trial and error: they adopt 
strategies that seem to work and discard those that do not. The model is 


2 For a more detailed introduction to evolutionary analysis, see, e.g., Weibull (1995). 
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dynamic. While the setup is essentially game theoretic, actors are not 
assumed to be the spooky homo Indens—“well-informed mathematical 
prodigies capable of costlessly performing calculations of enormous com- 
plexity at the drop of a hat [who assume] that [their] fellows are prodigies 
like [themselves], [and continue] to hold [that] belief whatever evidence 
to the contrary [they] may observe” (Binmore and Samuelson 1994, pp. 
45-46). Individuals are rational agents but boundedly so—their goal is to 
maximize, but they are capable of making mistakes.” 

How, then, will these boundedly rational agents adjust their behaviors? 
The answer to this question turns, naturally, on what they can observe. 
If the only observable conditions of the group are the performance and 
the relative frequencies of the strategies used by the actors, then the 
VG,)'s and p,'s would be the only factors that could affect replication of 
behaviors. In other words a dynamic process that governs replication can 
only be a function of strategies’ fitnesses and their frequencies. 

Evolutionary dynamics.—So far the evolutionary model describes what 
happens in a group within a block of time which corresponds to a life 
span of players” strategies or, speaking informally, their norms. We will 
refer to this timespan as a generation. Within a generation players learn 
about each others? norms. Across generations, they adjust their behavior 
given what they have learned about the group so far. The essence of 
evolutionary dynamics is simple: the more fit a strategy is in the current 
generation, the faster it increases. In other words, an evolutionary process 
is a dynamic that is increasing in fitness. We call this the fundamental 
evolutionary postulate; any dynamic with this property will be called an 
evolutionary dynamic. Note that this axiom is about how strategy fre- 
quencies change over time, it is not an equilibrium condition. As we have 
emphasized earlier, disequilbrium dynamics are central for evolutionary 
game theory. 

The mechanisms driving change of behaviors vary with the domain of 
application. In biology it is typically assumed that the mechanism is ge- 
netic (Dawkins 1989). In the social sciences, evolutionary game theory 
postulates that behavioral processes, such as learning through, for ex- 
ample, imitation or socialization (Axelrod 1984; Gale, Binmore, and Sa- 
muelson 1995; Boyd and Richerson 1985; Cabrales 1993), are the driving 


% For a detailed discussion of the connection between the ideas of bounded rationality 
(Simon 1957; March and Simon 1958) and evolutionary game theory, see Majlath (1992). 
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forces. Thus individuals learn to discard behaviors (strategies) that yield 
low payoffs and switch to strategies with high payoffs.” 
Equilibria.—The “dual” of dynamics is stability, for the standard notion 
of evolutionary stability is tightly connected to a dynamical analysis. Un- 
der what conditions should we consider a strategy to be evolutionarily 
stable? Consider a group where all individuals play the same strategy; 
let’s call it a “native” strategy. Because everyone is using the same strategy, 
everyone gets the same payoff; hence the ecology is in equilibrium. Now 
perturb it slightly by allowing a few new behaviors to invade the group. 
(These new strategies are often called “mutants,” reflecting the biological 
roots of evolutionary game theory.) An evolutionarily stable strategy is 
one that, once it is sufficiently common in a group, can resist this kind 
of small perturbation or invasion. More precisely, strategy i is evolution- 
arily stable if there exists an e* > 0 such that for all e < e" the population 
playing the native strategy i can resist any e-size invasion of mutants. 
Intuitively, “resist an invasion” may be understood in two ways: 
stronger, if after the invasion the invaders decrease in frequency under 
the evolutionary dynamic, and weaker, if they do not increase. Since in 
iterated games the best the native strategy can do is prevent the mutant 
from spreading, weak stability is the only type of stability attainable in 
repeated interactions (Selten 1983; van Damme 1987). More specifically, 
we will call a strategy weakly stable (Bomze and van Damme 1992) if it 


5 Tt is sometimes said that assuming that players know each others’ strategies is 
unrealistic and hence conclusions based on this assumption are of dubious value. We 
would like to briefty address this important concern. The root of the problem turns 
on the assumption which allows any theoretically possible strategies into the analysis. 
Clearly, some strategies are much too complex to be considered as reasonable models 
of human behavior. This, however, is not true of all strategies; many are exceedingly 
simple. TFT, e.g., is defined by two very simple properties, reciprocity and retaliation, 
and if a player is using TFT it is quite realistic to assume thet her partner knows 
what she is doing. And in general, if strategies are simple there is nothing unrealistic 
in assuming that players know each others’ strategies Indeed, the common knowledge 
of norms is an essential part of a group’s culture. Thus, it is natural to think about a 
group in which it is common knowledge that its members are, for instance, “recip- 
rocating” and “retaliatory” even if one of these properties remains latent (e.g., players 
who always cooperate with each other would never retaliate). That is, it is quite natural 
in such cases to assume that players may know what to expect of others even if they 
have never interacted before. Hence if we confine our attention to a set of cognitively 
feasible, simple strategies we find nothing unrealistic about the assumption that players 
know each others’ strategies. Moreover, it is important to point out that our analysis 
will produce equilibria that can be supported by such “simple” strategies. Thus allowing 
all theoretically possible strategies into the analysis ultimately has no bearing on the 
empirical soundness of the result. Another way of dealing with this problem is to 
assume that players do not use strategies but rather simple behavioral rules of thumb 
which are not “complete plans of the game.” We develop such a theory elsewhere 
(Bendor and Swistak 2000). 
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does not decrease in frequency in any group (with a finite number of 
strategies) where its frequency is sufficiently high.** In this case certain 
mutants may have the same fitness as the native and hence may remain 
in the population indefinitely. 

Finally, it is important to emphasize that the fundamental evolutionary 
postulate does not define a single process; instead, it defines a class of 
processes. Thus a native strategy may be stable under one dynamic or a 
class of dynamics, yet unstable under another. Hence sweeping claims 
that “strategy x is evolutionarily stable” always have to be predicated on 
the class of dynamics under which x is, in fact, stable. 

It is also important to note that the form of dynamics is directly related 
to certain fundamental social characteristics of people in the group. The 
rate at which an individual adjusts his behavior can be thought of as a 
measure of the inertia of the norms he holds. This inertia is affected by 
two principal determinants of human behavior—the sociological factor 
of group conformity and the economic factor of payoffs. Thus a pure 
Homo sociologicus would completely disregard payoff considerations and 
would only react to the conformity factor, or the proportion of the group 
that shares his norm—the more people share his norm, the higher his 
utility for the norm. On the other extreme there is Homo economicus, a 
player who reacts only to payoff considerations—the higher the payoff to 
a norm, the higher his utility for the norm. (See Bendor and Swistak 
[1997] for a more detailed discussion and a model of these effects.) In 
between the two ideal-types of Homo sociologicus and Homo economicus 
lies a large spectrum of behaviors where players’ incentives are partly 
affected by conformity and partly by payoffs. A specific form of a dynamic 
process is a function of the specific set of incentives held by people in the 
group. The dynamic will be different if all actors are the pure Homo 
sociologicus type, or if they all are Homo ecomomicus, or if they all have 
mixed incentives, or if a group consists of some Homo sociologicus, some 
Homo economicus and some mixed types, and so on. 

Robust equilibria.—There is, however, one happy circumstance in 
which the composition of individuals’ incentives in a group does not 
matter. Suppose strategy i is umbeatable: it gets the highest payoff once 
it is sufficiently common. More precisely, consider any group in which 
strategy i is played with frequency p,. We refer to i as unbeatable if there 
is a p"<1 such that for any group where p,>*, Vi) > V(j) for any 
strategy j. Clearly, when i is unbeatable it must also be (weakly) stable 
under aH evolutionary processes. And it is easy to see that the reverse 
claim holds as well: if a sufficiently frequent strategy is weakly stable 


* Others have referred to a weakly stable strategy as semistable (Selten 1983), neutrally 
stable (Sobel 1993), or neutral evolutionarily stable strategy (Warneryd 1993). 
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under all evolutionary process, it must be unbeatable. Hence, the existence 
of an unbeatable strategy confers two great methodological boons: first, 
it is unnecessary to specify a replication process in order to study the 
qualitative properties of stability; second, the stability of an unbeatable 
strategy is unaffected by changes in process specification. No matter what 
specific dynamic holds in a given domain, once an unbeatable strategy 
has become sufficiently common, it will not decrease. 

There is, however, much more to the idea of seeking stability under all 
processes than a simple consideration of methodological robustness; this 
type of stability may be necessary for an equilibrium to be descriptively 
meaningful. There are at least two reasons for this claim. First, because 
how strategies replicate reflects the way people adjust their behavior and 
because different individuals adjust in different ways and may change 
the way they adjust, an equilibrium that cannot be sustained under all 
processes may be highly unstable. Second, a mix of learning and imitation, 
in the face of both social and economic incentives, may affect different 
people in different ways that are often complex and hard to pin down. 
Hence, assuming that the process is described by a specific type of dy- 
namics seems infeasible. Empirically it may be impossible to discern pre- 
cisely how strategies replicate or what equations approximate the dynam- 
ics. In conclusion then, equilibrium states that are empirically meaningful 
may have to be states that are stable under all processes. We will refer 
to strategies that are unbeatable, or equivalently (weakly) stable under 
all processes, as uniformly stable. It is this type of stability that we will 
seek to establish below. It is interesting that the answer to the question, 
Why are there norms? is tightly linked to the concept of uniform stability. 


THE ESSENCE OF THE PROBLEM AND THE CONCEPT OF A 
SOCIAL NORM 


Norms are ordinarily thought of as rules specifying actions regarded by 
a group as proper (prescriptive norms) or improper (proscriptive norms). 
It is typical, hence, to understand norms as codes of behavior regardless 
of whether this behavior involves merely two individuals” or more. 
While we have no argument with such a general conception of norms, 
we would not find the emergence of some rules belonging to this class 
particularly puzzling. Take the example of PD. It is not surprising that 
two actors locked in a long-term relationship may choose to cooperate in 
this game. It is quite clear that mutual cooperation can be supported in 


™ Tt is common to talk about a norm of “reciprocity” or a norm of “retaliation” (Axelrod 
1984; Bicchieri 1993; UDmann-Margalit 1977), a “TFT” norm of behavior, etc. 
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equilibrium by, for instance, a pair of TFT strategies: if one player believes 
his partner is playing TFT, he cannot improve his payoff by playing 
anything but TFT himself (Axelrod and Hamilton 1981; Axelrod 1981, 
1984). This then explains the norm of “reciprocity” and “retaliation” both 
of which are properties of TFT. Since this explanation is pretty straight- 
forward, the associated norms are not particularly surprising. It is norms 
that cannot be explained this way that are puzzling. 

Hence, we see the essence of the puzzle of norms as turning on one 
that make us reward and punish other players not for what they did to 
us but for what they did to other members of the group. In general, we 
will say that a behavioral rule followed by i is a social norm if i conditions 
his behavior toward some j not only on what has transpired between i 
and j but also on other interactions in the group to which i and j belong. 
A norm that says that a friend of your friend should be your friend or 
one that says that a foe of your foe should be your friend are examples 
of such rules. By the very nature of the interactions involved these types 
of rules cannot be reduced to pairwise interactions; they are intrinsically 
group phenomena. Thus social norms are social sine qua non. These are 
the types of norms that we want to explain. 

Consider a very specific instance of the general problem. Take, for 
example, three people, A, B, and C, and assume that they all interact 
with one another (pairwise) in an IPD. Suppose that initially all actors 
cooperate with others in all interactions: A cooperates with B, B coop- 
erates with C, and A cooperates with C. Assume, however, that at some 
point A starts defecting toward B whereas B loyally continues to cooperate 
with A. Assume, moreover, that as B is being exploited by A, cooperation 
between A and C and between B and C continues. C’s problem is now 
clear: should C punish A (by defecting toward A) for what A does to B 
or not? The norm that says that foes of your friends ought to be your 
foes requires that C defect toward A. But if C punishes A for what A 
does to B, C may jeopardize the beneficial cooperative exchange she has 
with A. By complying with the social norm, C would be decreasing her 
expected payoff—it may not be rational for C to do so. Indeed some have 
claimed (Elster 1989a) that the very essence of such norms is that they 
cannot be explained by rational considerations. It is precisely this type of 
behavior—one that involves third party sanctions and rewards—that is 
the focus of this article. Thus when we pose the question, Why are there 
norms? it is norms in this specific sense of “social norms” that we have 
in mind. To remind the reader about this qualified meaning we will some- 
times say social norms instead of norms. Whether we say it or not, how- 
ever, the article focuses on this specific sense of norm. 
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THE PROBLEM: FREE RIDING ON PUNISHMENT AND DEFENSE 


If we look more closely at the nature of the problem that faces the three 
actors in the example above, we would discover that complying with the 
norm creates a standard public good problem, also known as the second- 
order collective good problem (e.g., Axelrod 1986; Coleman 1990a; Flache 
and Macy 1996; Hardin 1995; Heckathorn 1990; Oliver 1980). Everyone 
is better off if the norm is obeyed, yet no one has an incentive to comply. 
The public good, in this case, is a code of behavior—a norm. 

The essence of the problem is best explained by an example (Boyd and 
Lorberbaum 1987) which extends the story of players A, B, and C above. 
Suppose people play each other in pairwise games of the iterated prisoner’s 
dilemma (IPD). Initially everyone plays TFT. Because everyone is using 
a mice strategy—one that never defects first (Axelrod 1984, p. 33)—each 
player cooperates with everyone else in all periods. Therefore TFT’s prov- 
ocability, its readiness to punish defection, is never tapped. Because a 
latent property may decay, some TFT strategies may mutate into less 
provocable ones, say TF2T (cooperate in periods 1 and 2 and then defect 
if and only if your opponent defected in the previous two periods.) Now 
most people are playing TFT; a few use TF2T. Because TF2T is, like 
TFT, nice, everyone continues to cooperate with all partners; hence the 
introduction of this new strategy is, as yet, unobservable. (Henceforth, 
invaders that are observationally indistinguishable from the native strat- 
egy will be called neutral mutants.) 

Now a behaviorally distinct strategy, STFT, appears. The nonnice 
cousin of TFT, STFT defects in period 1 and thereafter reciprocates its 
partner’s previous move. Because STFT is not nice, it is a genuine threat 
to the community’s code of constant cooperation. TFT responds to this 
deviation by punishing STFT in period 2. Unfortunately, this triggers a 
vendetta, with STFT defecting in odd periods and TFT in even ones. 
Thus, though TFT’s punishment is effective—if most people use TFT 
then STFT will perform the worst in the ecology—it also hurts TFT: a 
vendetta is not the best reply to STFT, if the future matters enough. (More 
precisely, TFT’s behavior is not a best response if ô, the continuation 
probability, is above a critical threshold and 2R >T + S.) Under these 
conditions, the best response to STFT is to ignore its first defection and 
to cooperate thereafter—exactly how TF2T behaves. 

Thus in this case, the burden of defending a “nice” code falls on players 
using TFT; those using TF2T free ride on this enforcement. The result 
of this free riding is that, if ô exceeds the critical threshold, TF2T obtains 
the highest payoff: V(TF2T) > V(TFT) > VSTFT). 

The dynamic implications of this outcome depend, naturally, on the 
specifics of strategy replication, on how players adapt. Assume, for in- 
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stance, that players in the current generation are motivated only by payoff 
considerations and they adjust by switching to the strategy that did best 
in the previous generation, hence imitating the winner. If this is how 
players adjust their strategies, in the next generation everyone in the 
community will play TF2T. And so the simple dynamic of imitating the 
winner destabilizes TFT, as its willingness to punish miscreants proves 
costly. Thus this famous strategy would not replicate itself in this selection 
environment. 

Further, this example generalizes enormously, as has been shown by 
Bendor and Swistak (1998) in the following theorem 1. This result contains 
two terms that need defining. First, a one-shot game is trivial if it has an 
action that yields the game’s maximal payoff no matter what the other 
player does. Second, a dyadic strategy i is one that bases its behavior 
toward another strategy j only on the history of play between i and j. 
Information about the history between j and a third strategy & is never 
used by 7 in its conduct toward j. (As a convenient shorthand, we will 
sometimes ignore the difference between a strategy and a player using 
that strategy.) If strategy i is not dyadic, then its conduct toward j will 
at least sometimes be based on what has happened outside the (i,j) re- 
lationship. Hence we will call these social strategies.” 

THEOREM 1 (Bendor and Swistak 1998).—lm any repeated nontrivial 
game in which the future is sufficiently important no pure dyadic strategy 
is uniformly stable. 

Because trivial games have no strategic interdependence whatsoever, 
they are trivial in every respect. Practically, then, the theorem covers every 
kind of two-person interaction that might be of substantive interest to 
social scientists. Thus, in all that follows, we restrict attention to nontrivial 
games. To avoid tedious repetition, statements of results will omit this 
restriction. In the corollary that follows, for instance, the phrase “for any 
repeated game” should be read “for any repeated nontrivial game.” 

To understand why social norms evolve, it is useful to state a corollary 
of theorem 1. 

COROLLARY.— If a pure strategy is uniformly stable in a repeated game 
in which the future is sufficiently important, then it must be a social 
Strategy. 

Logically speaking this corollary merely restates the theorem, but it 


” The connection between the formally defined idea of social strategies and the leas 
formal notion of a social norm should be clear. since a strategy is a formal, game- 
theoretic model of a norm, social strategies represent social norms. Dyadic strategies, 
which by definition are not based on third-party play, cannot represent social norms. 
(We will sometimes take advantage of this correspondence by slipping back and forth 
between “social norms” and “social strategies” in the text The distinction is maintained 
rigorously in the appendix.) 
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emphasizes a different set of strategies: those that encode social norms. 
Thus the corollary to theorem 1 gives part of the solution to the question, 
Why are there (social) norms? This part of the answer is simple: if there 
are any strategies that are stable in the face of all evolutionary dynamics, 
including harsh processes driven by extreme economic incentives as when 
everyone imitates the winner, then they must encode some type of social 
norm. 


WHY MUST FOES OF YOUR FRIENDS AND FRIENDS OF YOUR 
FOES BE YOUR FOES? A THEORY BEHIND THE FOLK WISDOM 


By showing that social norms are necessary for stability, we have cor- 
roborated half of Sugden’s (1989, p. 97) conjecture that norms “have 
evolved because they are more successful at replicating themselves than 
other patterns [of behavior].” To corroborate the rest of his conjecture, 
we must show now that in these same settings, there are social norms 
that do replicate reliably. We begin with an example and generalize it 
below. 

Let us return to the previous example of Boyd and Lorberbaum to see 
what kind of norm or strategy might work. For simplicity we replace 
TFT by the unforgiving grim trigger (GT); the mutants continue to be 
TF2T and STFT.” As in the original example, the native loses to TF2T 
because the latter maintains a cooperative relationship with both of the 
other strategies, whereas GT and STFT are locked in mutual defection 
from period 3 on. If, however, the native would punish TF2T for tolerating 
STFT’s uncalled-for defection in period 1, then the native would out- 
perform both mutants. A norm that would require the native to do that 
is easy to imagine. Consider a new native, GT’, which modifies the old 
one in the following way: This native regards any strategy that is never 
the first to defect in a group as socially mice (all nice strategies are socially 
nice but, as we shall see, some socially nice strategies are not nice). GT’ 
is itself socially nice, and it defects forever against any strategy that either 
is not socially nice or that fails to punish one that is not socially nice. 
Thus GT’ defects from period 2 onward against STFT, and it punishes 
TF2T from period 3 onward because TF2T failed to sanction STF, 
which is not socially nice. It is easy to show that, given any sufficiently 
small invasion of any two distinct mutants, GT’ will have the highest 
fitness. (If both invaders are socially nice, then all three strategies are 
observationally indistinguishable and so will be equally fit) The new 


” GT is defined as cooperate in period 1, and cooperate in period & > 1 if and only if 
the partner has cooperated in all previous periods. 
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native, by being socially nice but not nice—it defects first against 
TF2T—has the necessary flexibility to outperform TF2T. 

But this is not a general solution. GT’ can still be beaten by an arbitrarily 
small invasion. Ironically, the cause of its downfall is its “ancestor,” GT. 
Consider a small invasion by TF2T, STFT, and GT. GT behaves just as 
GT requires: it always cooperates with the native, and it immediately 
punishes the strategy that is not socially nice, STFT. Therefore the native 
and GT cooperate forever. However, because GT is nice, whereas GT’ is 
only socially nice, the former cooperates constantly with TF2T, whereas 
the latter sinks into continual mutual defection with TF2T from period 
4 onward. Hence if the future is sufficiently important, GT will outscore 
its more sophisticated kin. 

Consequently, in order to outdo GT as well, the native must include a 
higher order rule, punishing GT for failing to punish TF2T’s failure to 
punish STFT. Call this new native GT”. But the reader has probably 
guessed that GT” is itself vulnerable to an invasion that includes the 
previous three mutants plus the ancestor of GT"*—GT". By the preceding 
logic, GT’, by punishing TF2T and STFT but maintaining good relations 
with GT, will score higher than GT’ no matter how small the invasion. 
But this will destabilize the native strategy, and the reasoning enters 
another loop. 

It is evident, therefore, that for this kind of strategy to be uniformly 
stable, it must have a metanorm structure, punishing all #th-order de- 
viations from its code. Player 1, who failed to punish player 2, who failed 
to punish player 3, and so on, must also be sanctioned. It is of course 
logically possible to construct such codes. But doing so may not only seem 
a rather daunting violation of the spirit of evolutionary game theory, which 
is informed by a strong dose of bounded rationality (Mailath 1992); it also 
looks completely utopian from the perspective of any model of human 
cognition and behavior. 


HOW FRIENDS BECOME FOES—A NORM THAT SOLVES THE 
PROBLEM OF COMPLEXITY 


The answer to the perplexing information problem that would require 
constant monitoring of the entire network of interactions and perpetual 
analysis of its complex metanorm structure lies in the way Homo sociol- 
ogicus organizes information and acts upon it. The answer is a specific 
social norm. This norm corresponds to a very well known social 
mechanism. 

The essence of the solution is exhibited by the following new strategy 
that we shall call conformity (CNF). This strategy is based on a binary 
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categorization of all players in a group (Heider 1958; Lévi-Strauss 1974). 
In every period every player is considered either a friend or a foe. In the 
beginning, before play starts, everyone is deemed a friend. If someone is 
a friend in period t, they remain a friend in ¢+ 1 if and only if they 
cooperated with all other friends and defected against all foes in period 
t. Anyone who ever violates these rules becomes a foe thereafter. The 
following result is then easily established. 

PROPOSITION 1.—If the future is sufficiently important, CNF is uni- 
formly stable in the IPD. 

The proof is straightforward. For any invasion, either a mutant j be- 
haves exactly the same as CNF does or it behaves differently with at 
least one partner in at least one period. If the former, then j must obtain 
the same payoff as the native. If the latter, then j must have either defected 
toward a friend or cooperated with a foe, at some date Ł. In either case, 
CNF will punish it thereafter. Given CNF’s response, the best that j can 
do is to defect against CNF from t onward. For the standard reasons, 
when the future is sufficiently important it is better to cooperate always 
with CNF than it is to cooperate in a finite number of periods, gain a 
one-time advantage of the temptation payoff 7, and then get stuck in 
mutual defection thereafter. Hence CNF does better with itself than does 
any such j, which for sufficiently small invasions ensures that 
V(CNF) > V (j). Therefore CNF is uniformly stable. 

CNF has several appealing behavioral and cognitive properties that 
correspond to some well-known social mechanisms. It embodies, for in- 
stance, an ancient political logic that is very intuitive: the friend of my 
foe is my foe, the foe of my friend is my foe, and the friend of my friend 
is my friend.” (One might view CNF as trying to impose Heider’s balance 
principles [1958, pp. 200-207], or perhaps more precisely, Davis’s [1967] 
cluster principles, on its environment.) Thus at any given time the strategy 
partitions the entire community into two camps: allies and enemies. From 
a cognitive perspective, CNF, although it embodies a complete metanorm 
structure, is simple enough to be represented as a strategy of little more 
complexity than tit for tat: to know a partner’s standing in the community 
tomorrow one only need keep track of that partner’s status today and, if 
the partner is currently a friend, his current actions. Social mechanisms 
like gossip greatly reduce the burden of the second requirement—news 
about defections among friends spread amazingly quickly in some very 
large and complex networks. (Due to its unforgiving nature, if the partner 


» Note, however, that CNF does mot impose the fourth relation of this logic, Le., it 
does not mandate that foes of foes are friends. This is fortunate for the empirical status 
of the theory since there is more evidence for the first three relations than there is for 
the claim that foes of foes must be friends (e.g., Taylor 1970, p. 204). 
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is a foe today, CNF does not need to observe that player’s current actions 
in order to know his status tomorrow.) Thus the cognitive demands that 
solve the seemingly hopeless problem of complexity turn out to be sur- 
prisingly weak, well within the demands of bounded rationality. 


“COVENANTS WITHOUT THE SWORD ARE BUT WORDS?” 


CNF in the context of the PD has a number of appealing properties and 
provides several invaluable insights. It clearly illustrates how a social 
norm can work and proves that a uniformly stable strategy exists in at 
least one game (the IPD). Of course, the problem is much more general, 
going far beyond the logic of the IPD. Fortunately, it turns out that one 
can use CNF’s basic logic to establish that uniformly stable social strat- 
egies exist in several large classes of games. 

CNF has three key elements: (a) a friend-foe categorization of all part- 
ners which is applied recursively; (b) a socially desired action and a pun- 
ishment action in the one-shot game; and (c) a rule of playing the desired 
action toward friends and punishing foes. This logic carries over, with a 
few modifications, to almost all types of two-person games. Consider 
games that are symmetric.” In each stage game there is a finite set of 
actions 4,,...,a,. For notation, v(a,,a,) denotes the payoff obtained by 
playing action a, in the stage game against action a,, and max,v(a,,a,) is 
the payoff of the best response to a,. The subset of symmetric games 
central to our next result is defined by two of Hobbes’s essential elements: 
first, covenants are possible; second, there are swords for sanctioning 
people who break covenants. To see how these two elements are related 
in symmetric games, focus on those payoffs produced when the players 
take identical actions (payoffs on the main diagonal of the payoff matrix), 
and consider an action which yields the highest payoff on this diagonal. 
Since this may reasonably be deemed a “cooperative” outcome, let us 
denote this action as a, and its payoff—+(a,,a,—as R, to remind us of 
the PD notation. Now examine the subset of symmetric games with the 
following property: there exists a punishment action, a,, such that R ex- 
ceeds the best response to the punishment, max,v(a,,a,). We will refer to 
a game with such a punishment action as a game of enforceable cooper- 
ation. The next result shows that this property—a sword that can enforce 


* A one-shot game is symmetric if its payoff matrix is symmetric. the players have 
the same action sets and reversing the actions reverses the payoffs Further, it is 
typically assumed that the game is txformationally symmetric as well: players have 
no information that would allow them to distinguish each other as being in different 
positions, i.e., one as being the Row player and the other, Column. 
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a covenant—is decisive for the existence of pure uniformly stable strategies 
and the norms they encode. 

THEOREM 2.—Pure uniformly stable strategies exist in a repeated sym- 
metric game with sufficiently important future tf and only if the stage game 
is one of enforceable cooperation. 

It is easy to see why sufficiency holds. If the crucial payoff condition 
holds, then one can construct a native exactly on the lines of CNF: co- 
operate (play a.) toward friends and punish foes with a,. If anyone ever 
breaks these rules, they are a foe from the next period onward. This 
conduct implies that if a mutant ever acts differently with any partner 
than the native does, the native will punish it ever afterward, ensuring 
that the native does better with other natives than the mutant does with 
the native (when the future is sufficiently important). Hence the native’s 
overall fitness must exceed the mutant’s, if the natives are sufficiently 
numerous. Proving necessity is more troublesome; see the appendix for 
the proof. 

Many repeated symmetric games studied by social scientists are covered 
by the above result. For example, in a game of iterated chicken, a uni- 
formly stable social norm is to always be conciliatory with friends and 
always be aggressive toward foes. Since the payoff from mutual concil- 
iation exceeds the value of the best response to aggression, a uniformly 
stable pure social strategy exists in this game.” 

For a symmetric game that is not a game of enforceable cooperation, 
consider a common problem in institutions, the division of labor. The 
game illustrated in figure 2 presumes that the organizational goal will be 
reached if and only if the players succesfully divide the labor and thus 
complete the task. Conflict arises because one job confers higher status 
than the other. Thus, although each player prefers performing the low- 
status task to not attaining the goal, each prefers the structure in which 
he or she is top dog. 

Although this game is symmetric in terms of actions and payoffs, the 
“name of the game” is specialization. Hence any pure native gets a low 
payoff when it plays itself. It is therefore easily invaded by a pure mutant 
that always takes the action that complements the native’s choice.” 


n There is a natural interpretation to punishing doves or “softies” in chicken: if ag- 
gressors are true invaders and softies are weak-kneed allies who refuse to fight them, 
then punishing doves is a case of moralistic aggression (Trivers 1971; Frank 1988). 

“4 The division-of-labor game illustrates contexts which we might expect to become 
informationally asymmetric. As Sugden (1986) suggests, even though the game is sym- 
metric substantively—the payoff matrix of the one-shot game is symmetric—players 
might recognize seemingly superficial differences in their positions. It is not hard to 
show that if the game is informationally asymmetric, then pure uniformly stable strat- 
egies do exist in games such as those in fig. 2. The case of asymmetric games is also 
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Fig. 2.—The division-of-labor game 


GENERALIZING CNF’S PROPERTIES: THE ESSENTIAL FEATURES 
OF UNIFORMLY STABLE NORMS 


Having shown that uniformly stable strategies and their corresponding 
norms exist in a wide variety of games, it is time to analyze their essential 
properties. A natural place to start is with our old standby CNF, which 
has proven so useful in providing insights into the nature of uniformly 
stable strategies. 

The proof that CNF is uniformly stable does not imply that all uni- 
formly stable strategies must be similar to CNF. In particular, it is im- 
portant to point out that a uniformly stable norm need not be as relentlessly 
unforgiving as CNF. A strategy that is uniformly stable does not have to 
punish deviants forever for a single violation of a group norm. If the 
punishment is enough to wipe out the violation’s gain and the deviant 
has returned to a proper behavior, a uniformly stable strategy may return 
to mutual cooperation or some other pattern of behavior required by the 
group norm. 

Naturally, how much punishment suffices—to wipe out a violation’s 
gain and to stabilize the native strategy—depends on the specific payoffs 
of the stage game. For some stage games the payoffs permit the corre- 
sponding rules of punishment to be particularly simple. For instance, in 


easy to solve. The following counterpart of theorem 2 holds for asymmetric games: 
THEOREM 2'.—Pure uniformly stable strategies exist in a repeated asymmetric game 
with sufficiently important future if and only if punishment 15 affective, Le, 
oa,,b,) + (b,1,) max Aanb) +max,0(b,0,) in the stage game In this formulation 
G,,...,¢, denote Row’s actions in the stage game; b,,...,b, are Column's (m need not 
equal a, but we assume both are finite); a, and b, are socially desirable actions, and 
a, and b, are punishment actions. 
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any IPD one can easily construct a uniformly stable strategy i that for- 
gives, and does so in a rather natural way. Here is how i works. It uses 
the standard friend-foe categorization and cooperates with friends. If 7 
(say) deviates, however, by defecting with i’s friends or cooperating with 
i’s foes, or if ¿'s foes cooperate with j, then i should punish j until 7’s 
‘gains from violating the norm have been wiped out: as it turns out, in 
the IPD it suffices for j to submit to punishment—cooperate while being 
punished by i—as many times as it had violated the code. (Thus if j 
defected against three friends of i, then it would have to submit to pun- 
ishment three times. In the IPD this would mean receiving three payoffs 
of S, to offset its prior ill-gotten gains of three 7”s.) Once j has paid his 
debt, however, the transgressions are forgiven and mutual cooperation 
can be restored. Hence we conclude that the unforgiving quality of CNF 
is not an essential property: a strategy does not have to be unforgiving 
to be uniformly stable.” 

Which, then, of CNF’s properties are really essential? What features 
must a strategy have if it is to be uniformly stable in repeated symmetric 
games? The above remark about other, more forgiving norms suggests 
that few of CNF’s dyadic properties are essential. (This point will be 
reinforced by theorem 6 below, which shows that uniformly stable norms 
need not be “cooperative.”) We already know, however, that to be uni- 
formly stable a pure strategy must be social: it must attend to at least 
some third-party play in at least some periods. CNF, for instance, never 
stops monitoring third-party interactions and does not exclude any player 
from monitoring. (Even foes must be monitored, in their interactions with 
friends.) The next two results show that these two aspects of CNF—its 
temporal endurance and the social breadth of its monitoring—are not 
extreme; they are essential. 

We formalize the definition of these monitoring features (time and social 
space) as follows. First, we will call a strategy perpetually social if it does 
not become dyadic after a finite number of periods. Second, we will call 
strategy i comprehensively social if there is no strategy j and period ¢ in 
the game such that in all periods following +, i’s actions toward other 
players in the group are independent of their interactions with j. Note 
that if this condition did not hold, that is, i were not comprehensively 
social, then following some period ¢ i’s actions toward a certain j would 
bave been independent of j’s interactions with other players. But this is 
equivalent to saying that, following period t, i would not need to monitor 


* That being unforgiving is inessential is good news, because this property causes 
problems if partners’ actions are imperfectly observed, as we will see later in this 
article Fortunately, there are other uniformly stable social norms that are both for- 
giving and which sustain efficient outcomes as the stable state. 
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j’3 interactions at all (The proofs of the following results are in the 
appendix.) 

THEOREM 3.—In any repeated game of enforceable cooperation in 
which the future is sufficiently important, all uniformly stable strategies 
must be perpetually social. 

This temporal property is required because otherwise a native could 
be invaded by a neutral mutant that patiently waited for the native to 
stop monitoring indirect pairs. Hence eternal vigilance is the price of 
stability. The next result shows that the norm embodied in a uniformly 
stable strategy must brook no exceptions in the community. 

THEOREM 4.—In any repeated game of enforceable cooperation in 
which the future is sufficiently important, all uniformly stable strategies 
must be comprehensively social.” 

Social comprehensiveness is required because if A, stops monitoring 
interactions of A, then player A, may behave identically as A, toward 
everyone in the group except A, with whom he may score higher than 
A,. If this happens, A,’s strategy i would lose to the strategy of A,, ren- 
dering i not uniformly stable. As noted, CNF monitors all of its partners, 
indefinitely. Theorem 4 shows that the same is required of every uniformly 
stable norm: it must be comprehensively social. 

We now turn to examining the internal logic of uniformly stable norms. 
Again, CNF provides a clue. Recall that any CNF-like strategy that lacked 
a complete metanorm structure—failed to punish some mth-order devia- 
tion—could be invaded. The same logic applies to all uniformly stable 
norms. 

THEOREM 5.—In any repeated game of enforceable cooperation in 
which the future ts sufficiently important, every uniformly stable strategy 
must have a complete metanorm structure. 

The proof rests on the existence of mutants that reply to punishment 
with punishment (i.e., by playing a,). When encountering such invaders, 
the native-punisher gets v(a,,a,), which is less than v(a,,a,), since it is a 
game of enforceable cooperation. So punishing such mutants is costly. 
Thus there is a potential advantage to free riding on enforcement: If the 
native, #, does not punish j’s failure to punish a lower-order deviation by 


= In our conceptual framework, strategies (equated here with players) can distinguish 
amongst each other only on the basis of their actions; players lack ascriptive identities. 
Hence if two players have behaved identically toward everyone in the group then 
other strategies must treat them identically since they cannot tell them apart. A natural 
extension of this framework would endow strategies (players) with such ascriptively 
based information, so that they could behave differently toward people who have 
behaved identically (Bendor and Swistak 1999). It is worth noting that theorem 4 still 
holds in this more general formulation 
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k, then i can be beaten by j, since the latter avoids paying the cost of 
enforcing the code.* 

It is important to emphasize here that although requiring a complete 
metanorm code sounds rather daunting, it is actually easy to implement 
such rules: one simply codes all members of the group as being in either 
good or bad standing and treats them accordingly, thereafter applying a 
straightforward recursive logic to update an individual’s social standing. 
The recursive updating enables the metanorm structure to be complete 
yet simple enough to be implemented even by very boundedly rational 
agents. Certainly CNF, a paradigmatic example of a code with a complete 
metanorm structure, is very simple. 

Indeed, humans are capable of creating and using far more complex 
codes; CNF, which merely illustrates how to construct a complete yet 
simple metanorm, hardly exhausts what is possible. And in many cir- 
cumstances it is sensible to allow for nuances that CNF ignores. For 
example, higher-order violations may be less important than lower-order 
ones: A’s cheating B in a trade may matter more than C’s continuing to 
cooperate with the renegade A. Hence it might be collectively desirable 
for the metanorm to recognize such distinctions and prescribe punishments 
accordingly: the more serious the violation, the heavier the sanction. (Of 
course, all punishments must be effective in the sense of satisfying theorem 
2’s criterion.) 

Taken together, then, theorems 3-5 show that uniformly stable strat- 
egies must encode social norms in several fundamental ways: they hold 
for everyone in the community, they always hold, and any »th-order 
violation of the code must be punished. 

Note, however, that we have not claimed that all uniformly stable norms 
are functional in the sense of being Pareto efficient. There is a good reason 
for not making this claim: it is false. Efficiency is not an essential property 
of uniformly stable strategies, as we will demonstrate later. 

Even CNF is not strongly stable—but this is not so bad.—So far we 
have only been able to show that uniformly stable strategies like CNF 
are weakly stable. None of our results claimed that strongly stable strat- 
egies exist. The reason is simple—such strategies do mot exist in repeated 
interactions. And it is easy to see why. If all members of a group play 
CNF and some of them mutate into always cooperate (ALL-C; cooperate 
unconditionally in all periods), then both strategies will have the same 


* Using the same setting of a population of pairwise IPDs, Bendor and Mookherjee 
(1990) had found that under a Nash equilibrium analysis of norms, only one-level 
codes are necessary. It is interesting to note that changing to an evolutionary framework 
and requiring uniform stability impHes the necessity of metanorms. (See Axelrod [1986] 
for a discussion of the importance of metanorms from an evolutionary perspective.) 
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payoff. Indeed, absent other strategies the two will be behaviorally in- 
distinguishable. ALL-C will be a neutral mutant of CNF, and no evo- 
lutionary process can restore the preinvasion state in which everyone plays 
CNF. Consequently, although CNF can ensure that it will never decrease 
under any evolutionary process, in some invasions the mutants cannot be 
eliminated. Strong stability is unattainable. 

However, being only weakly stable is a much less serious problem for 
social strategies than it is for dyadic ones. A dyadic strategy may never 
recover from the random drift of a neutral mutant. For example (Young 
and Foster 1991), suppose a small fraction of a TFT ecology mutates into 
the simplest nice strategy, ALL-C. Since ALL-C is a neutral mutant, it 
can drift randomly in the group. Now suppose the ecology is invaded by 
the simplest nonnice strategy, ALL-D (defect unconditionally in all pe- 
riods). Even though ALL-D’s entrance makes manifest the once-latent 
differences between TFT and ALL-C, TFT—which does not encode a 
social norm—never punishes ALL-C for its deviation of tolerating ex- 
ploitation. Thus Young and Foster’s simulation (1991) of an ecology com- 
posed of TFT, ALL-C, and ALL-D showed that in the long run, random 
drift eventually destroyed cooperation, even if initially almost everyone 
played TFT. The breakdown occurred because ALL-C, at first behav- 
iorally indistinguishable from TFT, could by chance win many converts, 
setting the stage for the predatory ALL-D. 

In contrast, as soon as CNF detects a difference between itself and a 
strategy that had once been a neutral mutant, it punishes the deviant. So 
an ecology dominated by CNF exhibits only two patterns: either everyone 
obeys the code of universal cooperation, whence all strategic differences 
remain latent and unsanctioned, or someone deviates, revealing heretofore 
latent differences, and CNF attacks all overt deviators. As has long been 
argued (Durkheim 1933; Davis 1937; Merton 1956), there is a social func- 
tion to deviance: it clarifies the rules. In our model, deviance makes man- 
ifest what had been latent. Thus, so long as CNF remains sufficiently 
numerous, the social state that it enforces is, in a sense, strongly stable. 

Hence, if one were to rerun the Young and Foster simulation, replacing 
TFT by CNF, and if the process started out with sufficiently many players 
using CNF, then our analytical results imply that cooperation in this 
modified ecology will not break down. Cooperation will be sustained 
because CNF attacks ALL-C as soon as the latter fails to punish ALL- 
D.” Thus the process in which ALL-C takes over by random drift is 


” None of the three strategies can become completely extinct in the Young-Foster 
simulation. there is always a small background mutation rate for each strategy. This 
matters substantively the constant though rare presence of ALL-D reveals to con- 
formity that ALL-C is nonprovocable and hence a deviant. Concetvably, CNF could 
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blocked. This prevents the subset of cooperative (socially nice) strategies 
from being weakened by the infiltration of the toothless ALL-C. 

Social norms can still experience random drift, of course. In an ecology 
composed of CNF, TFT, and ALL-D, TFT is indistinguishable from CNF: 
both strategies always cooperate with themselves and with each other, 
and punish ALL-D from period 2 on. 


FUNCTIONALISM REVISITED: MUST NORMS BE OPTIMAL? 


So far all of our examples of uniformly stable norms have sustained a 
collectively optimal outcome. (Our running example, e.g., was that of 
ongoing cooperation in the IPD.) Maintaining optimality in the stable 
state is consistent with the strong functionalist thesis. This, however, is 
only a part of the analytical solution. Now it is important to find out if 
suboptimal outcomes can also be supported in equilibrium. The answer 
to this second part of the problem turns out to be yes. In fact, a whole 
raft of strategies and a wide spectrum of suboptimal states can be sus- 
tained in equilibrium. To see this, we focus once again on games of en- 
forceable cooperation. 

For any particular game, label the actions so that u(4,,4,)S...S 
(a, ,a,,). Thus the most “cooperative” (pure) norm would prescribe a,, as 
the socially desirable action, since it yields the highest payoff for a pure 
strategy when it plays itself. (Earlier we referred to this as a cooperative 
action, denoted as a,.) A completely noncooperative norm prescribes play- 
ing a,. Accordingly, we will often call playing a, as “cooperating” and a, 
as “defecting.” Take the smallest r (of r = 2,...,m) such that va,,a,) > 
(a,,a,).* This action, a,, is the minimally cooperative one. For this class 
of games, the degree of cooperation that a strategy induces in the stable 
populational state is a meaningful notion. For example, in the IPD we 
say, following Bendor and Swistak (1997), that a stable state supports x 
degrees of cooperation if x is the limiting proportion of mutually coop- 
erative moves as the number of periods goes to infinity. (This index gives 
an expected frequency of cooperative moves in a stable state.) More gen- 
erally, we say that a stable state supports x degrees of a,-cooperation 
(t > r) if x is the limiting proportion of periods in which players choose 
(a,,4,) as the number of periods goes to infinity. We then obtain the fol- 
lowing result. 





be destabilized if ALL-C invaded and then took over the ecology by random drift 
before any ALL-D invaded. Since random drift takes a long time to work, this scenario 
is unlikely. 

= There must be such an r because a game of enforceable cooperation has a punishment 
action a, such that x(a_,a,) max, Nanã), which implies that Wanu) > Kt). 
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THEOREM 6.—In a repeated game of enforceable cooperation in which 
the future is sufficiently important, a uniformly stable state with social 
strategies can support any strictly positive degree of a,-cooperation. 

Thus the only unstable state is the one of complete “defection.” This 
result shows that the strong functionalist thesis does not hold in our model: 
evolutionary forces do not ensure (Pareto) optimal outcomes. 


FUNCTIONALISM MODIFIED: WHY FUNCTIONAL (EFFICIENT) 
NORMS MIGHT PREVAIL IN THE LONG RUN 


Yet it seems intuitively reasonable to expect some quantitative difference 
between the stability of a cooperative strategy such as CNF and of a 
strategy that usually defects with its clones. Axelrod (1984) has long argued 
that cooperative strategies like TFT have evolutionary advantages over 
noncooperative ones. Indeed, elsewhere we (Bendor and Swistak 1997) 
have proved this to be the case: more efficient (cooperative) strategies 
require smaller frequencies in order to stabilize in a group. If a similar 
result can be established under the assumptions of our model then co- 
operative strategies like CNF would be more robust in two related ways. 
First, a strategy that is maximally cooperative with itself—always plays 
a, with its clones—would be able to invade a native that is not socially 
nice with a lower frequency than one required by a mutant that sometimes 
defects with itself. Second, once established, a socially nice native would 
be able to resist larger invasions. Let us therefore focus on a strategy’s 
stabilizing frequency—the lowest frequency that ensures that a strategy 
is stable (under a specified dynamic). First, we establish the minimal 
stabilizing frequency of any strategy in an important subset of games of 
enforceable cooperation. Games in this subset have a well-defined pun- 
ishment action, ap because either (1) there is only one action that satisfies 
the “swords and covenants” condition,” as in binary choice games such 
as chicken or the PD; or (2) there are several such actions, but among 
them there is one, call it a,, that is clearly the best punishment since, 
while ensuring that the cost of imposing the punishment is as low as 


* To see why the pure Hobbesian state is not stable, consider the IPD. Let i, the native, 
be masty—never the first to cooperate—so that it alway defects with its clones and lts 
neutral mutants The invaders are STFT and some nonnasty strategy j. We know that 
in nontrivial games one can always design aj such that V(STFT, 7) > V(iy). So to be 
uniformly stable, i must punish STFT for doing better with j than it does. But since 
l is already defecting in every period against STFT, it has exhausted all possible 
punishments. So no version of i can work. Hence any nasty native can be beaten, and 
so the corresponding population state of complete defection is unstable. 

“ That is, there is only one punishment action a, such that max,v(a,,a,) < o(a_,a,). 
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Fia. 3 —A game with two punishment actions: both y and s are punishments, but only 
s is an obvious punishment action 


possible, it gives deviants the lowest payoff; that is, it maximizes the 
native’s minimum payoff.” We will refer to these situations as games with 
obvious punishment. (For an example of a game with more than one 
punishment action but only one obvious punishment see fig. 3.) All sub- 
sequent theorems will pertain to this subset of games of enforceable co- 
operation. Clearly, any binary choice game that is a game of enforceable 
cooperation must involve obvious punishment. 

To keep notation simple, we denote the following payoffs to correspond 
to the notation in the PD: T is the “temptation” payoff, the game’s largest 
payoff; R is the payoff to the maximally cooperative action a,—that is, 
u(a,,a,); P, is the payoff of the deviant’s best response to punishment 
(where a, is the well-defined punishment action), while P, denotes the 
lowest payoff that a player can get when implementing punishment. (In 
the symmetric PD, B = P, but this is not generally the case.) We use a, 
to denote the best punishment action in a game with obvious punishment. 

Consider now a class of strategies that generalizes CNF in the following 
natural way. Any strategy in this class uses the standard friend-foe cat- 
egorization and embodies the following two fundamental and intuitive 
principles of friend-foe designations: 


* That is, max,o(a,4,) S max,v(a,,2,) for all a,. 
% That is, min,v(a,,a,) > min, (a,,a,) for all a,. 
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1. The foe of a friend is a foe: specifically, anyone who does not *co- 
operate” (play a.) with a friend is a foe. 

2. The friend of a foe is a foe. There are two manifestations of being 
a friend of a foe: (a) anyone who does not punish (play a, toward) a 
foe is a foe; (b) anyone toward whom a foe is “too friendly” is a foe.” 


Strategies in this class thus play a, with friends and a, toward foes. 
Anyone who violates any of the fundamental principles today is a foe 
tomorrow; a friend in ¢ who obeys the fundamental principles remains a 
friend in t + 1. Prior to play everyone is considered a friend. Any strategy 
with these properties we will call normatively nice and retaliatory. 

THEOREM 7.—In any repeated game of enforceable cooperation with 
obvious punishment and sufficiently important future, strategies which 
are normatively nice and retaliatory require the smallest minimal fre- 
quency in order to be uniformly stable. 

Thus normatively nice and retaliatory strategies are not only uniformly 
stable; they are also the most robust ones. Of all uniformly stable strat- 
egies, normatively nice and retaliatory ones require the smallest frequency 
to be stable under all evolutionary processes. This double robustness sug- 
gests that there is something powerful indeed about normatively nice and 
retaliatory strategies. 

It is interesting to note that any normatively nice and retaliatory native 
strategy creates a world with a very simple sociometric structure: from 
the perspective of the native strategy, people are divided into those who 
are in good standing and those who are not. Everyone in good standing 
must be on good terms with each other, and on bad terms with those in 
bad standing. What emerges, then, is the distinction, critical for group 
formation, between “us” and “them.” Hence the CNF mechanism, as im- 
plemented by normatively nice and retaliatory strategies, becomes the 
cornerstone of the group formation process. The resulting sociometric 
structure would be familiar to Heider (1958) and Lévi-Strauss (1974), 
among others.** 

Finally, let us directly compare the minimal stabilizing frequencies of 


*% Specifically, if i is the focal strategy that is categorizing its partners as friends or 
foes, then in period t strategy i regards strategy k as having been treated in “too 
friendly” a manner by a foe, A”, if V(¿ A") < V(4, A”), where V(ik) denotes i’s payoff, 
through period ż, when it plays k. 

“ It might be useful to illustrate this result via a numerical example from the iterated 
PD. Using the payoffs from Axelrod's book (T = 5, R = 3, P = 1, and S = 0), the 
minimal stabilizing frequency (as 6 goes to one) of a normatively nice and 

strategy is two-thirds. (See the proof of theorem 7 in the appendix for a formula for 
the minimal stabilizing frequency.) This means that invasions of nearly one-third can 
be repelled. Such natives have a sizable basin of attraction. 


“ We would like to thank John Padgett for this latter point. 
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strategies with varying degrees of efficiency. To keep the analysis straight- 
forward and the comparisons “clean,” we will compare the efficiencies of 
strategies that belong to the class of CNF-like strategies: all use the re- 
cursive friend-foe construction and are unforgiving (once a foe, always a 
foe). 

Given that the actions are labeled so that v(a,,a,) < > SAG, 86m), it 
follows that V(i, {X1 — 5) is between v(a,,a,) and v(a,,a,) for any pure 
strategy i. We can thus write V(i,iX1 — 5) as x ` Wala.) + (1 — xju(a,,8,), 
where 0 <x <1, and so interpret x as an index of strategy 1’s efficiency. 
For a completely efficient strategy, x = 1; for a completely inefficient 
strategy, x = 0. For example, in the IPD, if a CNF-like native prescribes 
cooperating in even periods and defecting in odd ones, then x = .5. 

To ensure meaningful comparisons, in the next result we will only 
compare strategies that are maximally robust for any given degree of 
efficiency. That is, we will compare strategies that are the best represen- 
tatives of their class of efficiency.“ 

THEOREM 8.—In any repeated game of enforceable cooperation with 
obvious punishment and sufficiently important future, the more efficient 
the (maximally robust) CNF-type native, the larger the maximal invasion 
this strategy can repel. 

For example, consider a work-shirk game, where action is effort, which 
can be any integer in (0,..., 100). Assume that complete shirking (zero 
effort) is a strictly dominant strategy in the stage game, while the more 
effort one exerts the better off is one’s partner. Suppose the unique sym- 
metric optimal action is 75, so that if both people work flat out then the 
result is an inefficient rat race. Provided that (100,100) > 0,0), working 
flat out can be supported by a uniformly stable norm-encoding strategy 
of the CNF type: everyone puts in 100% effort with friends and shirks 
completely with foes. However, the maximal invasion that this strategy 
can repel is smaller than that repelled by the CNF norm, which prescribes 
the best symmetric effort of (75, 75). 


“In the following result, we compare CNF-type natives. Two equally efficient CNF- 
type natives might not be equally robust because many games of enforceable coop- 
eration have more than one type of feasible punishment action in the stage game, and 
different punishments can be differentially effective in repelling invaders. By restricting 
our comparison to maximally robust strategies, we are impHcitly requiring that strat- 
egies use their most effective punishment action, Le., the one that permits the repulsion 
of the largest possible invasion, given the native's degree of efficiency. (If there are 
several equally effective punishments, it does not matter which one is used.) 
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EXTENSIONS 


Important note on the robustness of norm-govermed behav- 
tors —Theorem 7 confers a very significant methodological benefit. In its 
proof we have established not only the stability of certain norms but also 
their “degree of robustness,” that is, the precise value of the minimal 
frequency they require in order to stabilize in any group under any ev- 
olutionary dynamic. This means that if this threshold is not breached 
(invasions are smaller than the biggest that the native can repel), the 
native strategy will remain stable—no matter the source or nature of the 
mutation. Since mutations can arise from any variation in the model’s 
parameters, including, for instance, noise and uncertainty (see below), 
theorem 7 establishes the stability of normatively nice and retaliatory 
strategies under any small variation on the model’s parameters, one that 
does not produce “too many” mutants. This is a very powerful kind of 
robustness indeed. 

Network properties —We will now elaborate on some of the most ob- 
vious and also perhaps most interesting departures from the assumptions 
of our model. Up to this point we have examined norms in dense social 
networks. These networks are dense both in terms of interaction— 
everyone plays everyone else in every period—and information— 
everyone knows, at the end of every period, what happened in all inter- 
actions in the community. A natural question, therefore, is, What happens 
in networks that do not meet these conditions? 

First, it can be easily seen that social norms are just as necessary in 
sparse networks as they are in dense ones. If we consider, for instance, 
an environment in which both interaction and information are 
sparse—each person plays only one other (randomly matched) person in 
each period, and the outcome of each interaction becomes known to only 
one third party (rather than becoming known to the entire commu- 
nity)—then the following result is virtually a corollary of theorem 1. 

PROPOSITION 2.—In all repeated symmetric games where the future is 
sufficiently important, with random matching of single partners and sin- 
gleton bystanders, any pure strategy that is uniformly stable must be 
social." 


* The proof of this fact essentially follows that of theorem 1, with a few minor mod- 
ifications. First, note that since dyadic strategies never use information about what 
happened in other interactions, it is as if they function in a world with no bystander 
observations at all. Hence the proof of theorem 1 does not depend on third-party 
monitoring in any way, and so this part of proposition 2 follows immediately from 
theorem 1 Further, the essential logic of the theorem's proof, which turns on the 
existence of neutral mutants and of another mutant to which the neutral mutant is 
preadapted, owes nothing to the density of interaction or information Indeed, we can 
make the networks arbitrarily sparse, and the result still holds. 
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Thus social norms are indeed necessary in a much broader set of en- 
vironments than the world of theorem 1. 

A separate issue concerns the existence of uniformly stable norms in 
sparse networks. Would, for instance, the normatively infused CNF con- 
tinue to be sufficient? Let’s consider again the effects of sparse interactions 
versus the effects of sparse third-party monitoring. It turns out that sparse 
interaction by itself creates no qualitatively new problems for social norms. 
The following example shows why this is so. Suppose that in period 1 A 
and B happen to pair up and A cheats B. In period 2 A and E happen 
to meet and they cooperate. Since this is an informationally dense com- 
munity, everyone knows that E failed to punish A in period 2. Conse- 
quently, from period 3 on, any player using the strategy of CNF will 
punish E whenever they meet her. Thus despite the sparser interaction, 
it is still true that no one can get away with free riding on the cooperative 
code. Hence CNF remains uniformly stable (provided, of course, that the 
future is sufficiently important). Sparse interaction, by itself, has no qual- 
itative impact. 

Similarly, delays in transmitting information about outcomes of third- 
party interactions do not alter any of our results. The reason is, once 
again, quite simple. Suppose that actor E’s violation in the above example 
becomes known to other members of the group only after some time has 
passed. If the discount parameter ô is sufficiently close to 1 (L.e., if future 
payoffs are sufficiently important), the “payoff advantage” that E can gain 
while her transgression remains unknown will be offset by the loss from 
the ensuing punishment (assuming, of course, that it is severe [long] 
enough). 

Sparse third-party information, however, can be damaging—naturally 
so, since this information is the lifeblood of social norms. To see this, note 
that at one extreme, when there is no third-party information at all, social 
norms are simply infeasible: player E cannot punish A for A’s cheating 
B if E never learns what A did. A continuity argument suggests that 
when it exists but is extremely sparse, such norms should also be impos- 
sible to sustain. Either too few people will know about deviations to 
support the native’s code, or the native strategy will be so hair-trigger 
sensitive, to offset the infrequency of third-party information, that the 
native will eventually turn on and destroy itself. Near the other end of 
the informational continuum, if almost everyone gets third party infor- 
mation then uniformly stable norms do exist. People who are “out of the 
loop” are, in effect, playing mutant strategies, and we already know that 
if there are sufficiently few deviants, CNF and other norm-encoding strat- 


“ Raub and Weesie (1990) and Buskens (1999) study the effects of informational lags 
about third-party interactions on the efficiency of outcomes. 
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egies are uniformly stable. Hopefully the breakdown of social norms is 
analytically well behaved in that we can parameterize the amount of third- 
party information so that norms are uniformly stable if and only if the 
parameter exceeds a threshold. In general, however, this remains a matter 
for further research. 

Uncertainty.—In the preceding analyses, norms are enforced flawlessly 
because players always know how their partners had behaved in previous 
periods. In the real world, uncertainty intrudes in various ways. Two of 
the most important kinds of uncertainty concern perceptions and slippages 
between intentions and actions (Axelrod and Dion 1988). First, players 
may misperceive each other’s actions. Thus A thinks B cheated C when 
in fact B cooperated. Incorrectly believing that B violated a code of co- 
operation, A punishes B. The opposite error can also occur: A, mistakenly 
believing that B helped C, might fail to punish B’s actual transgression. 
The second type of uncertainty involves implementation: A intends to 
cooperate but—possibly due to external shocks (A’s computer crashes and 
the report cannot be completed)—fails to do so. B, observing only what 
A has done and not A’s intention, responds accordingly. 

At the beginning of this section we have already noted that if the “noise” 
clouding any parameters of the game does not create “too many” mutants, 
then theorem 7 ensures that the native norm remains stable. This need 
not be true if the amount of uncertainty increases. Clearly, sufficient 
amounts of either type of noise, misperceptions or implementation glitches, 
can make the enforcement of norms—or indeed the use of any strat- 
egy—problematic (see, e.g., Downs, Rocke, and Siverson 1985; Molander 
1985; Donninger 1986; Bendor 1987; Mueller 1987; Nowak and Sigmund 
1989, 1990, 1992, 1993; Nowak 1990; Bendor, Kramer, and Stout 1991; 
Kollock 1993; and Lomborg 1996). A 

Tf, for instance, the noise is such that within a generation everyone is 
bound to implement an action that he or she did not intend, social norms 
turn out not to be necessary to stabilize behaviors—in this case dyadic 
strategies can be uniformly stable as well. In a paradoxical twist the 
introduction of what is usually taken to be a problem—uncertainty—has 
a stabilizing effect on some dyadic strategies. 

But since some of these now-stable strategies are dyadic, it is no longer 


*” A constant probability of a tremble in every period of the game implies that in an 
infinite game trembles will reveal all latent differences among otherwise observationally 
indistinguishable strategies. Thus if trembles can occur in every period, neutral mutants 
are eliminated. This in turn allows dyadic strategies such as ALL-D to be uniformly 
stable. Indeed, since neutral mutants do not exist when trembles occur, a higher per- 
formance standard can be achieved: there are dyadic strategies (such as ALL-D) that 
are unique best responses to themselves (Selten 1983) and so are strongly stable under 
all evolutionary processes, once sufficiently numerous (Boyd 1989). 
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true that under this type of uncertainty stability requires social strategies. 
Would, then, social strategies retain some other evolutionary advantage 
over dyadic ones? Based on some simulations that we have done (not 
reported here) we conjecture that they would. It seems that turning a 
dyadic strategy into a social one can yield two types of advantages: first, 
it can turn an evolutionarily unstable strategy into a stable one; second, 
it can increase a strategy’s robustness (i.e., decrease its minimal stabilizing 
frequency). 

Consider, for instance, the Boyd-Lorberbaum ecology, discussed at 
length earlier, in which a native TFT is invaded by TF2T and STFT. 
Recall that in the absence of noise TFT is not uniformly stable in this 
ecology ([F2T garners the highest payoff.) In fact, the same holds true 
if there is noise but trembles are sufficiently infrequent. Thus for small 
levels of noise dyadic TFT remains evolutionarily unstable.” This would 
no longer be true, however, if one turns the dyadic TFT into the social 
one. The social version of TFT uses the standard friend-foe categorization 
in a TFT-like way: someone is a friend tomorrow if and only if they 
cooperate with friends and defect against foes today. This social TFT is 
stable. In fact our simulations show that it can repel invasions in the 
Boyd-Lorberbaum ecology even under sizable levels of noise.” 

The relation between norms and various kinds of uncertainty is a com- 
plex and important topic that clearly merits further research. Some issues 
are of obvious importance for theories of institutions and public policy. 
Consider, for instance, the following problem: How does increasing un- 
certainty affect the relative effectiveness of enforcing codes by centralized 
institutions (e.g., a formal system of police and courts) versus enforcement 
by the more decentralized institutions of norms? In general, we suspect 
that enforcement by social strategies—particularly stringent ones such as 
CNF—will become increasingly “brittle” as relations become noisier, com- 
munities larger, and monitoring more difficult.” Hence, we believe that 
a classical thesis about enforcing codes of behavior in complex societies—if 


* This in turn remains true even if one modifies TFT (Sugden 1986) to enable it to 
cope more effectively with noise. Sugden’s variant (contrite tit for tat) is well-designed 
to handle trembles, but it is still a dyadic strategy and so cannot enforce a code of 
conduct on third parties. 

£ Even when the probability of making an implementation error by each player in 
every period of the IPD was set at 10%, the social TFT was still able to repel all 
invasions in the Boyd-Lorberbaum ecology. 

" This interaction between size and monitoring problems has been studied via Nash 
equilibrium analysis in, e.g, Bendor and Mookherjee (1987) and Radner (1986). The 
general conclusion is that, given informational imperfections, the larger the community 
the harder it is to sustain cooperative outcomes by decentralized regimes (e.g., trigger 
strategies), and centralized institutions become relatively superior. 
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rules in such societies are to be effectively enforced then formal institutions 
are required—will be supported by a careful evolutionary analysis. 


APPLICATIONS AND INTERPRETATIONS 


This article has tried to answer the question, Why are there norms? The 
core of our answer is that in demanding selection environments, social 
norms enjoy the crucial evolutionary advantage of replicating more easily 
than do other behaviors. We have also established that this evolutionary 
advantage of social strategies and their corresponding norms continues 
to hold if perturbations of the basic model do not produce “too many” 
deviant strategies. Hence our result is robust under small perturbations 
of the parameters of the model. It is not, however, universal: it does not 
hold for all values of the parameters. Norms do not always replicate more 
easily. As with any formal result, we can only expect the deductive con- 
clusions to be observed in reality if this reality conforms to the assumptions 
of our model. Hence in conclusion, we would like to review the meaning 
of several major assumptions and provide interpretations of the model’s 
crucial parameters. 

At the most general level we should note that the cognitive requirements 
of using norms are fundamental to a general (transspecific) theory of 
strategic behavior but irrelevant for sociology as long as it is confined to 
humans. Clearly, social strategies require more cognitive sophistication 
than do dyadic ones. Studies of animal behavior have shown that, for 
example, primates easily meet these higher requirements.” But some less 
intelligent animals, such as bats (Wilkinson 1984), reciprocate dyadically 
without using third-party sanctions. Thus the evolution of social norms 
depends on the evolution of big brains. Because all normal humans satisfy 
this condition, this parameter is a constant in the study of human societies 
and hence irrelevant in this domain. However, it would matter when 
studying the different social systems of mammals, for example. 

A second general observation is that our models assume that the pri- 
mary form of interaction involves two people.” This is a substantive 
assumption which may not be adequate in many situations. Such is the 
case with common dilemmas. The problem of collective goods has a dif- 
ferent structure and one that is more appropriately represented as an N- 
person game. Such games have a different formal and conceptual structure 


* Chimpanzees, e.g., form intricate coalitions that involve third-party interventions 
(de Waal 1982). 

H Our deductive conclusion is that players will be linked to others by social strategies 
that impose third-party sanctions, but the basic building blocks in our model, as in 
Homans (1950), are two-person interactions. 
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in which the notion of third-party sanctions, as it was used in this article, 
would have to be redefined and reanalyzed. 

To gain insight into the more specific assumptions embedded in our 
theory, consider the following hypothetical application. Assume, for in- 
stance, that we want to understand why community A has converged on 
prosperous cooperation and thrives economically while community B, 
despite many underlying similarities, is burdened with defections and 
ensuing economic stagnation. First, we should note that theorems 2-8 all 
apply to games of enforceable cooperation. Hence if we want to use these 
results to draw conclusions about behaviors in communities A and B, 
these conclusions must be limited to behaviors that can be modeled as 
games of enforceable cooperation. Second, since our evolutionary model 
allows only for an e amount of noise and requires sufficiently large values 
of 6, it is best suited for modeling small communities where gossip is an 
efficient source of information and probabilities of future interactions are 
high. 

But restrictions on the types of communities and interactions among 
players are not the only necessary caveats. Other conditions must hold 
as well. For instance, since all our theorems specify equilibrium conditions 
they should be applied only to communities that have attained stable 
states, that is, remained stable for a reasonably long time. Similarly our 
analysis proclaims conditions that are necessary and/or sufficient to obtain 
uniform stability—that is, stability under al types of evolutionary dy- 
namics. Yet A and B can be stable in a weaker sense: they could be stable 
under some but not all evolutionary processes. (For example, elsewhere 
we have shown [Bendor and Swistak 1997] that if conformity incentives 
are sufficiently strong, then dyadic strategies such as TFT can be stable 
under certain evolutionary dynamics, even though certain other dynamics 
would destabilize TFT—and any other—dyadic strategy.) 

If we can successfully assume that all of the above conditions hold, our 
theorems can finally be used to make a number of interesting predictions. 
First, theorem 1 and its corollary tell us that both the efficient equilibrium 
of A and the inefficient equilibrium of B must be supported by social 
norms, Neither a cooperative state nor a defective one can, in the long 
term, be supported by dyadic rewards and punishments—third-party 
sanctions are necessary to keep them stable. These social norms, as the- 
orems 3, 4, and 5 tell us, must be perpetually social (they can never become 
dyadic), comprehensively social (no players can be excluded from the 
group enforcement), and thorough (they have to punish deviations of all 
orders). 


5 As noted earlier, however, it is not required that these social norms be unforgiving; 
this feature of CNF is not essential. 


1533 


American Journal of Sociology 


The fact that uniform stability is not possible without a social structure 
of interactions is perhaps the most important conclusion. (It is also in- 
teresting that social structure is needed regardless of the efficiency of the 
group’s output.) 

Second, theorem 6 tells us that both the efficient state of community 
A and the inefficient state of community B can be explained as uniformly 
stable equilibria. In fact our theory predicts that any state, from the state 
of almost pure defection to the state of pure cooperation, can be realized 
as a uniformly stable equilibrium. 

Third, uniform stability means that both communities are robustly sta- 
ble. For instance, if we had the power to change players’ incentives and 
push them all in the direction of Homo economicus, this change would 
not affect the equilibrium. Thus, for example, if the defective norms that 
imprisoned Banfield’s impoverished southern Italian town (1958) had 
been robustly stable, then even if the citizens’ values had drifted in the 
direction of Homo economicus the town’s inefficient equilibrium would 
have persisted. Changing values would not have sufficed to destabilize 
that steady state of poverty. Hence, by telling us what kinds of changes 
will not work, such considerations can help us design policy mechanisms 
that can overturn inefficient, defection-ridden equilibria (Stinchcombe 
1997). 

Fourth, despite their qualitative similarity—both A and B being ro- 
bustly stable—these communities exhibit an important quantitative dis- 
similarity. This difference is revealed by theorems 7 and 8, which tell us 
that the more cooperative a strategy (other properties equal) the lower 
the frequency it needs to be stable, and the lowest stabilizing frequency 
is attained by strategies that are cooperative and retaliatory. These in turn 
imply that the efficient equilibrium of A is more robust than the less 
efficient equilibrium of B: A will be able to resist larger invasions of 
mutant behaviors than would B. In other words, to maintain an inefficient 
equilibrium a community has to be better isolated from external shocks 
(immigration, cultural diffusion, etc.) than a community in an efficient 
equilibrium. Note that this conclusion has policy implications for the 
possibilities of overturning certain types of collectively undesirable 
equilibria. 

We hope the above discussion provides a better understanding of the 
concepts and assumptions that underlie the deductive results of this article. 
We also believe that our results throw new light on some old but always 
vital and controversial debates about how social and economic forces are 
related to each other. One can clearly see, for instance, that our hypo- 
thetical example of two communities—one enjoying a good equilibrium, 
the other mired in a bad one—resembles the approach taken by, for 
example, Putnam (1993), among others, in his argument that differences 
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in social capital can explain the economic success of Northern Italy and 
the failure of the South. Our findings provide new insights and new 
interpretations for such issues. Indeed, Coleman’s concept of social capital 
receives a new interpretation from the robust, efficient equilibria attained 
by normatively nice and retaliatory strategies. As the phrasing of his 
intriguing idea—the juxtaposition of “social” with “capital”—suggests, 
Homo soctologicus and Homo economicus must work together if societies 
are to reach their potential. 


APPENDIX 


Proofs 
Theorem 2 


It is easy to see that if a stage game is one of enforceable cooperation 
then there exist uniformly stable strategies. Consider, for instance, the 
strategy of CNF, as defined in the text. Take now any group E = 
(6,90, Gub, ) --- Gua, Where i denotes CNF. For any strategy j, (k = 
1, ... NW), either j,’s payoff is identical to i’s (for all values of ô and fp,) or 
i defects with j, from some period on. In this second case we have 
VGi,t) > VG,,7) for sufficiently large ô, which implies that V(#) > V(G,) if 
p, is sufficiently large. Thus, for sufficiently large p, there is a ô, such that 
in all groups where ô> ô, we have V(t) > V(j,) which implies that i is 
uniformly stable. 

To show that if there is a pure uniformly stable strategy in a nontrivial 
repeated game with sufficiently large 3 then the stage game has to be a 
game of enforceable cooperation is more complex. Assume, by contradic- 
tion, that there exists a game which has a pure uniformly stable strategy 
i and yet it is not a game of enforceable cooperation. Since the stage game 
is not a game of enforceable cooperation there is no effective punishment 
action, that is, for any distinct actions a, and a, we have 


v(a,,a,) S max, G,,,). (A1) 


Suppose now that strategy i is uniformly stable for sufficiently high 
p, and 6 and consider the following ecology E = ((i,4,),G,P,),(R,p,)). In this 
ecology the mutant & is constructed so as to give the mutant j the highest 
possible payoff while ensuring that the native i does not obtain this payoff 
when it plays k. This will allow j to beat i, whence i cannot be uniformly 
stable. So we construct strategies j and & as follows. 

First, take j to be such that V(i,i) < V(Uj,i). Note that since ¢ is a pure 
strategy and (A1) holds, it is possible to obtain such a 7. Before we con- 
struct strategy k and refine our construction of f, let's permute the m 
actions of the stage game so that vía,,8,) > °°: > Kaalu). Thus if any of 
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the diagonal payoffs equal the stage game’s maximal payoff of 
max, va,,a,), then ví(a,,a,) must do so. In general, however, there may be 
more than one action which yields the game’s maximal payoff on the 
diagonal. Thus the payoffs on the diagonal must belong to one of two 
distinct subsets, maximal and nonmaximal, that is, there is an integer w, 
where 0 < w < m, such that u(a,,a,) = max, v(a,,a,) for ally = 1,...,w and 
vía, 4,) <max, va, a,) for all r = w + 1,...,m. (If no diagonal payoff is 
maximal, then w = 0.) 

Note that if for any r < w vía,,a,) = vía, ,a,) for all ¢ then Row would 
get the maximal payoff no matter what Column did, that is, the stage 
game would be trivial. Since we have assumed that it is mot trivial, for 
any r < w there must be a # such that 


v(a, 4,) > v(a,,8,) (A2) 


For any r <w denote an action a, for which (A2) holds as a,. Now 
consider the following mixed strategy: 


k* = (Lima, + +++ + (L/m)a,, + (Lmao + +++ + (1/m)4,,, 


which means that k” plays a,,... 8, ¡Gy+1 -»-:%, each with probability 
1/m. (This construction ensures that no strategy which plays k* can get 
the stage game’s maximal payoff.) Let a, and a, where 1 <x,y Sm be 
such that v(a,,a,) = max,,v(a,,a,). Now, construct strategy k and refine 
strategy j as follows. In period 1 & plays any action that is different than 
the action of i and j in period 1. In period 2 j plays any action toward 
k which is different than what j plays against 4. Now, from period 3 on 
k plays k* against i and a, against j, whereas j plays a, against k. Note 
that in any period t > 2, v'(j,k) =max,, va, a,) and, given the construction 
of k, v'(i,k) < max,,(a,,4,), where v'(i,7) denotes payoff in period t in a 
game between i and j. Thus, for sufficiently high ô we have V(j,k) > 
V(i,k). For any p, and sufficiently large ô as p, converges to zero, V(#) 
converges to p,V(1,1) + (1 — p,)V(#,4) whereas V(j) converges (the left-hand- 
side limit only) to p,VG,0+(1 — p,)VU,2). Since V(¡,A) > VG,k), for any p, 
we can find p, and p, such that V(s) < V(y) for sufficiently large 6’s. This, 
however, means that ¿ is not uniformly stable, and so we have a contra- 
diction. Q.E.D. 


Theorem 3 


Suppose that strategy i is not perpetually social. Then there is some period 
t after which it becomes dyadic. Then let there be an arbitrarily small 
invasion by strategies j and k, where these strategies are designed as 
follows. Both are neutral mutants of i for periods 1,...,#. After t, strategy 
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j continues to behave as į does in all encounters with i and itself; hence 
VD = Vit.) = VU.) = VU7). However, k differentiates itself from the 
native in £ + 1, and in ¿+2 j behaves differently toward E than 1 does. 
This differentiation permits k to play with j differently than it does with 
i, ensuring (for any nontrivial game) that V(j,k) > VG,k), and hence 
Vj) > VID), for sufficiently large 6. Q.E.D. 


Theorem 4 


Suppose that strategy i is not comprehensively social. Then there is a set 
of strategies {i,j,k...} and period # in the game such that in all periods 
following t ¿'s actions toward all strategies in the group are independent 
of their interactions with j. Assume now that strategy k is a neutral mutant 
of i up to and including period t. From period ¿+1 on k behaves the 
same way i does toward all strategies in the group, except for strategy j. 
More specifically, consider & and j as in the proof of theorem 3. Strategy 
i will score the same as j with all strategies in the group except for k. 
And since j scores higher with k than i does, for sufficiently large 5 we 
will get V(7) > V(@). But this means that i is not uniformly stable. Q.E.D. 


Theorem 5 


Suppose in any game of enforceable cooperation, a strategy i does not 
have a metanorm structure. Then there must be some mth-order deviation 
that i does not punish. Construct a mutant #’ that is identical to i except 
that i’ commits mth-order deviations. Now invade i by a combination of 
i’ plus a set of sn — 1 distinct mutant strategies, where j, commits a first- 
order deviation, mutant j, commits a second-order deviation (does not 
punish first-order deviations), and so on until mutantj,_,, which does not 
punish * — second-order deviations. All of these strategies respond to pun- 
ishment by playing a. The native, i, punishes all strategies (ie., 
Siy «+ Ju-), While é’ punishes all except j,_,, with whom it continues in a 
“cooperative” relationship where both play a, in every period. Thus with 
strategy j,_,, the native gets only v(a,,a,) as a per period average payoff, 
while the mutant 7’ gets «a,,a,), which must be higher. Since 
by construction i’ behaves identically to i with all other strategies, it gets 
the same payoff with them as i does, whence V(i’) > V(i) for sufficiently 
large 6. Q.E.D. 


Theorem 6 


We will say that a pure strategy i is ecology neutral with i* in ecology E, 
where 1,” e E, if an action taken by i in any period against any opponent 


1537 


American Journal of Sociology 


j e Eis the same as 1”'s action against j. Two strategies which are ecology 
neutral in E are behaviorally indistinguishable in E since they act the 
same against all other strategies in E. Consider now an infinite sequence 
of zeros and ones. Take the initial » elements of the sequence and denote 
by »(1) the number of ones among them. Any real number 0 <r < 1 can 
be represented (usually in more than one way) by the following limit: 
lim... [* (1) 4s]. Take now a pure strategy i, which “cooperates” (plays a,) 
only in periods corresponding to the element “1” in the infinite series as 
long as its opponent 7 does the same in a game with į as well as in all 
other pairwise games in the ecology (Le., if j is ecology neutral with 2); if 
j moves differently in any period of the game, i responds by “defecting” 
(playing some punishment action) in all periods that follow. By the def- 
inition of i, i supports r degrees of a,-cooperation when universal in the 
population. 

Tf r > 0, then a strategy i that supports r degrees of a,cooperation must 
cooperate when playing its clone in an infinite number of periods If a 
strategy j is not ecology-neutral with i, then j moves differently than 1 
against some strategy & in some period t of the game. By the definition 
of i, from period t + 1 on, i will support r degrees of a,-cooperation when 
playing another i while “defecting” with j in all moves. Thus for suffi- 
ciently high ô we will have V(i,2) > V(j,1), which ensures that ż is uniformly 
stable. Q.E.D. 


Theorem 7 


We will first show that, in any repeated game of enforceable cooperation 
with obvious punishment and sufficiently high 6, the minimal frequency 
that stabilizes a strategy under any evolutionary dynamic is (T — B) 
IT + R — P — P). Assume, by contradiction, that there is a strategy i with 
the minimal stabilizing frequency p, = [(T — B(T + R — P ~ Bl-e, 
where €>0. Consider E = ((t,P0),(3,P,),(R,P,)), where p, = [T- R) 
IT +R- P-R) — e. Let j and k be such that after some initial trigger 
moves, j always defects with ¿ and k and always cooperates with other 
j's, while k always defects with ¿ and cooperates with 7. For example, if 
i cooperates in the first period take j and & as follows: j defects in periods 
1 and 2 (unconditionally), and then cooperates with all strategies which 
defected in the first two periods and defects unconditionally with all other 
strategies; k defects in period 1 and then cooperates unconditionally with 
all strategies that defected in period 1 and defects unconditionally oth- 
erwise. (A similar construction is possible when i defects in period 1.) 
For a moment we do not assume anything about the frequencies of j 
and & in the population, other than, obviously, p, + p, + p, = 1. In such 
an ecology, min lim, VGX1 — 6) = pR +PR+9,P and max lim,,, 
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VXL — 8) = HB + p,R + p,T. For i to be stable under all evolutionary 
processes it is necessary that max lim, ., VGX1 — 8) > min lim,., VQ) 
(1 — 8), thatis,p,.R+/,2+?,R2/.R +p,R + p,T. Denoting p, = e, we 
can write this inequality as 


bo >(T-RVT+R-FR-F)—leT-R/(T+R-F- B). 


We haye assumed, however, that p, = [T- BAT +R-B-B)- €, 
whick gives us eS T — RUT +R-—B-—B)) Since nothing was as- 
sumed about the value of s, we can clearly take e small enough to get 
e>el(T — RUT + R — P — B)] which contradicts 7’s minimal stabilizing 
frequency being [(T — RV(T + R- R — B)] — e. 

We will now prove that in any repeated game of enforceable cooperation 
with obvious punishment and sufficiently high ô, strategies which are 
normatively nice and retaliatory require the smallest minimal frequency 
of (T — B)/( + R — P — B)] to be uniformly stable. ` 

Take any group E = {(j,,p,),(jnsbs),.- mpa) in which j, is normatively 
nice and retaliatory. Consider any j„ (m = 2,..., N). We will examine the 
following two cases: (1) j, cooperates infinitely often with j,, that is, for 
each period m there is a period k, k >n, such that j, cooperates with j. 
in period k; and (2) j, cooperates finitely often with j,, that is, there exists 
a period & such that in each period », # > k, j, defects with 7,. 

Case 1.—If case 1 holds, then take any period k such that j, cooperates 
with j, in period k +1. Since j, is normatively retaliatory, we get 
V*Gi5,) 2 V"Gasd,) for all y, (t = 2,..., N). Moreover, since for every » 
there is k >» such that j, cooperates with ¿, in period $ + 1, there is an 
infinite series k,,k,,..., such that for every k,, V“"(j,,9,) > Vaji). Con- 
sequently, V(j,,3,) > VU. ,3,). Thus V(j,) > VG,,) for all values of 6, which 
completes the proof of case 1. 

Case 2.—Consider, again, an arbitrary ja (m = 2,..., N). In case 2 there 
is a period A + 1 such that in all periods n,m > k, j, defects with j,. Let's 
decompose the total payoff V into V = V* + V*”, where V* is the payoff 
after the first k periods and V*” is the continuation payoff (in the remaining 
infinite part of the game.) Consider the second part of the game, that is, 
the V*”’s. In this part j, defects with j, in all periods. From case 1, we 
know that for any strategy j, if j, cooperates with j, in infinitely many 
periods then min [V(j..9,) = VOG ndo) > 0; if, however, Í defects with Í 
from some period on, then min [VG ) — VG ma Jj)]= min VU) — 
max V(y,. j) = B-T<0. Since this second expression is smaller, 
min[VG, 41) — VU 301 = E — T. Thus, 
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min[V""(G,) — VT Ga) = min [VEG j) VEGA DP, 
+ min [$ V Gui) — Vn Ja 
= p(R-F)+ (Pp XE — T). 


Considering now that lim, , V*°(j*) =lim,., VG*) for any strategy j*, 
VG.)-VG,.) approaches O as 6 approaches 1 and $, = (T—B)/(T + 
R — P — P), which is equivalent to saying that the minimal stabilizing 
frequency of j, approaches (T — B)/(T + R — P — PB). This completes the 
proof of case 2 and theorem 7. Q.E.D. 


Theorem 8 


Since a maximally robust native must use the most effective obvious 
punishment action, without loss of generality we restrict attention to the 
case where all the strategies being compared use the obvious punishment 
action a,. Suppose now that a CNF native strategy i has level of efficiency 
x, where lim,.., V(¡X1 — 8) = [rv(a, 0.) + (1 — xju(a,8,)]. 

Consider an ecology where CNF would be most vulnerable to invasion. 
Such a “test” invasion for a CNF native, i, is one in which there are two 
mutants, say j and k, where e, + e, = e and the invaders are constructed 
so as to boost the fitness of one of the strategies (say, J) and to reduce the 
fitness of the native as much as possible, in order to create a winning 
mutant (V(4) > V(é)) with the smallest possible invasion size. Thus j and 
k are constructed as follows: j plays a best response to the native’s pun- 
ishment and plays a, with itself. Mutant 2, after “recognizing” j, plays so 
as to maximize j’s pairwise score; thus lim,., V(J,kX1 — 6) = T. Mean- 
while, & responds to i’s punishment so as to minimize wa, ,a,). Given this 
construction, V(y) is increasing as e, > 0, so we consider invasions in which 
the proportion of j-mutants is negligible. 

For such invasions, 


lim, ..lim,., VEXI- = (1 - EXA Ay) + (1- x)o(a,,4,)] + eB, 


whereas lim, ,,lim, ., V(jX1 — 8) =(1 — OR + eT. Since ofa, ,a,) > v(4,,4,), 
V@) is increasing in x, while V(7) is unaffected. Hence as x increases it 
follows that e, the invasion size, must increase in order to ensure that 
V(j) > V(G). This means that the more efficient i is, the larger the maximal 
invasion it can repel under any evolutionary dynamic. Q.E.D. 
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Sociological investigations of economic exchange reveal how insti- 
tutions and social structures shape transaction patterns among ec- 
onomic actors. This article explores how interfirm networks in the 
U.S. venture capital (VC) market affect spatial patterns of exchange. 
Evidence suggests that information about potential investment op- 
portunities generally circulates within geographic and industry 
spaces. In turn, the circumscribed flow of information within these 
spaces contributes to the geographic- and industry-localization of 
VC investments. Empirical analyses demonstrate that the social net- 
works in the VC community—built up through the industry’s ex- 
tensive use of syndicated investing—diffuse information across 
boundaries and therefore expand the spatial radius of exchange. 
Venture capitalists that build axial positions in the industry’s coin- 
vestment network invest more frequently in spatially distant com- 
panies. Thus, variation in actors’ positioning within the structure 
of the market appears to differentiate market participants’ ability 
to overcome boundaries that otherwise would curtail exchange. 


INTRODUCTION 


The role played by geography and social topography in structuring in- 
teraction has long interested sociologists. Beginning with Bossard (1932), 
many studies have investigated the importance of propinquity in deter- 
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mining the likelihood of friendship and marriage. These studies consis- 
tently find that the probability of a relationship increases sharply when 
two individuals live near one another. A parallel line of research estab- 
lishes that the likelihood of forming a social relationship declines as a 
function of distance in social space (Lazarsfeld and Merton 1954; Blau 
1977; Blau and Schwartz 1984). The similar findings relating social and 
physical distance to the likelihood of a relationship reflect the fact that 
both operate by influencing the probability of random interaction. To 
form a relation, two individuals typically must meet in space and time. 
Because both physical and social locations strongly influence people’s 
activities, proximity on these dimensions increases the likelihood of a 
chance encounter (Blau 1977). Although the research on geographic pro- 
pinquity and homophily has focused primarily on the formation of friend- 
ships and marriages, the same processes that localize these forms of in- 
teraction may also structure economic exchange relations in physical and 
social space. 

The literature documents the decline in interpersonal interaction with 
an increase in geographic and social distance, but it has offered few ex- 
planations for heterogeneity in the salience of these dimensions over time 
and across actors. The effect of geographic distance on the likelihood of 
interaction varies from actor to actor. Similarly, demographic character- 
istics differ in the degree to which they structure interaction (Blau and 
Schwartz 1984; McPherson, Popielarz, and Drobnic 1992). In the latter 
case, two factors might explain this variation. First, individuals’ prefer- 
ences for interacting with similar others might produce homophily (La- 
zarsfeld and Merton 1954; Rogers and Kincaid 1981). According to this 
view, differences in the extent to which we observe homophily on so- 
ciodemographic characteristics derive from the underlying preference dis- 
tribution for similarity. Alternatively, others argue that homophily stems 
primarily from the structure of opportunity (Blau 1977). In this perspec- 
tive, sociodemographic dimensions vary in the degree to which they gen- 
erate homophilous interactions based on the salience of these dimensions 
in the arrangement of daily activities. 

Consistent with the second view, we argue that differences in the in- 
fluence of propinquity and homophily in economic exchange systems stem 
from variation across actors in their opportunities to trade. We explore 
this idea in a market context by investigating empirically patterns of 
exchange in the U.S. venture capital (VC) industry from 1986 to 1998. 
Our analyses first demonstrate the prevalence of localized exchange by 
showing that the likelihood that a venture capitalist invests in a new 
venture (“target”) declines sharply with the distance between venture cap- 
italist and target. The data reveal strong evidence of localized exchange 
both in terms of the physical and “industry” distance between the venture 
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capital organization and the potential investment target, where the latter 
quantity represents the level of dissimilarity between the VC firm’s pre- 
vious investment experiences and the industry classification of a given 
target company. We then show that certain characteristics of a venture 
capitalist’s position in the industry’s information network influence the 
degree of localization in its exchange patterns. Thus, we investigate the 
social structural factors that determine variation in the extent to which 
VC investments are “homophilous” (Le., localized) with respect to geog- 
raphy and industry. 

We contend that the structure of relationships in the VC community 
contours the movement of capital for at least two reasons. First, investors 
must be aware of investment opportunities before they may capitalize on 
them. In the public financial markets, many organizations disseminate 
information about potential investments; however, young, private entre- 
preneurial ventures typically fall outside the scope of these organizations’ 
activities. In the absence of public information about early stage com- 
panies, personal and professional relationships provide one of the primary 
vehicles for disseminating timely and reliable information about promising 
new ventures. Second, investors carefully evaluate the quality of an in- 
vestment opportunity before determining to support it. New ventures 
typically represent risky organizational propositions (Stinchcombe 1965). 
Not only do they face liabilities of newness and smallness, but also they 
often operate under unproven business models and in inchoate markets. 
In turn, the lack of a performance history upon which to base quality 
assessments tremendously increases the importance of trust—which we 
know to be built through repeated interaction—in the VC investment 
relation. Both information and trust require social interaction, yet the 
likelihood of this interaction varies across actors. The sociological liter- 
ature on patterns of affiliation consistently finds that two factors, pro- 
pinquity and homophily, influence the likelihood of interaction. 

Because useful information regarding exchange opportunities travels 
across private networks, geographic and industrial spaces—areas within 
which interpersonal ties concentrate—represent spatial dimensions that 
contain the transmission of information about potential investments. We 
conjecture that the relatively more modest flow of information across 
geographic and industrial spaces deters geographically and industrially 
distant investments. Nevertheless, we argue that interfirm relationships 
in the venture capital community effectively reduce spatial limitations on 
the flow of information. Our empirical analyses focus on how the network 
connecting the members of the VC community—built up through the 
industry’s widespread use of syndicated investing—facilitates the diffu- 
sion of information across spatial boundaries, thereby decreasing the 
space-based constraints on economic exchange. Investors that build cen- 
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tral positions in the syndication network concomitantly extend their access 
to information about spatially distant targets and expand the radius of 
their investment activity. Therefore, consistent with the sociologist’s gen- 
eral view of economic transactions, the social structure of the market 
determines the ability of participants to overcome the informational con- 
straints that would otherwise restrict market exchange. 

The analyses presented in this article establish a series of findings that 
contribute to the literature in economic sociology. First, the article rep- 
resents the beginning of a sociological explanation of the spatial organi- 
zation of economic activity. Economic geographers studying the spatial 
concentration of industry frequently call attention to the conundrum that 
production activity often concentrates in high technology industries even 
though the primary production input—knowledge—can be inexpensively 
transported across large distances. In the absence of physical or economic 
constraints on the mobilization of resources through space, contemporary 
economic explanations of spatial concentration generally emphasize pos- 
itive feedback processes that, for example, result from the geographic 
spillover of knowledge across firm boundaries but within small areas 
(Arthur 1990; Krugman 1991). According to this view, a sequential entry 
process leads new firms to locate near existing firms of the same type to 
establish local markets for scarce inputs (such as skilled labor) or to gain 
early exposure to knowledge produced by nearby firms. As we discuss in 
the conclusion, we believe that the spatial clustering of social and pro- 
fessional relations offers an alternative explanation for the spatial con- 
centration of industry. 

We also believe that the article’s findings have implications for regional 
economic development policies. VC firms now play an important role in 
the U.S. economy. In the first half of 1999 alone, VC funds invested more 
than $12.5 billion in early-stage companies. As the industry amasses ever- 
larger pools of capital to dispense, venture capitalists expand their influ- 
ence in determining who receives funding to pursue their entrepreneurial 
visions. To the extent that these spells of entrepreneurship affect socio- 
economic trajectories, venture capitalists become agents for social strat- 
ification. Similarly, VC firms have been critical catalysts in the devel- 
opment of many new high-technology industries. Because young 
companies in these areas make large investments in technology devel- 
opment significantly in advance of their ability to generate the cash flows 
to finance these investments, they must rely on capital infusions from 
venture capitalists and other investors. As these industries become im- 
portant engines for economic growth and wealth creation, access to ven- 
ture capital funding might significantly affect the macroeconomic health 
of regions and nations. 
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THE PATTERN OF EXCHANGE IN THE VENTURE CAPITAL 
MARKET 


In the last three decades, a new organizational form, the VC firm, has 
emerged as a substantial contributor to the financing of fledging com- 
mercial enterprises. VC firms broker the relationship between investors 
and entrepreneurs. Using funds raised primarily from institutional in- 
vestors and wealthy individuals, they search out promising, yet risky, 
investment opportunities. In this search, one might expect that firms 
would benefit by evaluating the broadest possible set of potential in- 
vestments; on average, the best five investments from a set of a thousand 
possibilities should outperform the top five investments from a set of 10 
candidates. Despite the incentives for choosing from a broad array of 
opportunities, however, venture capitalists exhibit highly localized in- 
vestment patterns in both physical and industry space (Gupta and Sap- 
ienza 1992; Norton and Tenenbaum 1993). 

Two types of explanations can account for these insular investing strat- 
egies. One set focuses on the preinvestment activities of the venture cap- 
italist, in particular the conditions that favor opportunity identification 
and evaluation. The second set of explanations addresses the postinvest- 
ment role of the VC firm, specifically the ease of monitoring the new 
venture and the facility with which the VC firm can provide value-added 
services. 


The Preinvestment Role: Opportunity Identification and Appraisal 


Finding investment targets entails two important tasks. First, venture 
capitalists must acquire information about the existence and character- 
istics of investment opportunities. Second, they must assess the quality 
of these opportunities. Because each of these tasks becomes increasingly 
difficult at a distance, we believe that even passive investors—those who 
invest without intending to play an active role in managing the new 
venture—will likely invest locally. Interpersonal relations act as primary 
pathways structuring the transmission of information within communities 
of actors. This observation underpins diverse sociological literatures; for 
example, the network-based explanations of the diffusion of innovations 
(Coleman, Katz, and Menzel 1966) and the more general sociological re- 
search on interpersonal influence processes (e.g., Friedkin 1998) reflect 
this fact. The capacity of networks to disseminate reliable information 
also underlies sociological investigations of economic markets. For ex- 
ample, Granovetter’s (1973) seminal article on the strength of weak ties 
motivated a number of studies of how network shapes determine the 
transmission of transaction-relevant information across the participants 
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in a market. Many subsequent studies specifically elaborate the mecha- 
nisms through which ties among actors affect both the patterns and gov- 
ernance of economic exchanges (e.g., Raub and Weesie 1990; Granovetter 
1985). 

Following this rationale, the structure of social and professional rela- 
tions likely influences which actors in the VC business become aware of 
promising, early-stage investment opportunities. The majority of invest- 
ment targets are small, inchoate entities. Timely information regarding 
high-quality investment opportunities in this domain often reaches a ven- 
ture capitalist through her network. Thus, as economic sociologists have 
noted in other market contexts, the circumscribed diffusion of reliable 
information across networks plays a central role in the formation of 
exchange relations in venture capital. 

The importance of networks in generating investment leads affects the 
spatial distribution of investment activity because social relations tend to 
cluster in both geographic and social spaces. Since the writings of Park 
(1926), studies have found that social actors form ties more frequently 
when they occupy proximate positions in physical space. In human ecol- 
ogy, the law of distance interaction states that the probability of interaction 
between social elements declines as a multiplicative function of the dis- 
tance between them (Hawley 1971). Sociologists believe this law arises in 
large part because the costs of interacting—including finding and screen- 
ing exchange partners and maintaining relationships—increase with dis- 
tance (Zipf 1949). The seminal studies of Festinger, Schacter, and Back, 
(1950) and the work of the human ecologists have refined these insights 
by showing an increased interaction frequency when actors meet in “func- 
tional” space (e.g., when the tenants in an apartment complex meet in the 
laundry room). 

Although only a small body of literature examines how location affects 
the functioning of markets and the organization of economic activity, these 
studies reveal strong spatial effects (cf. Bothner 2000). In an analysis of 
the geographic diffusion of labor unions in Sweden, Hedström (1994) 
demonstrates that the geographic configuration of communication net- 
works in the Swedish population can explain the spatial contagion in the 
establishment of these organizations. Research on interlocks among cor- 
porate boards created by overlapping memberships shows that spatially 
proximate companies are more likely to share directors (Mintz and 
Schwartz 1985; Kono, Palmer, Friedland, and Zafonte 1998). Studies of 
communication patterns within organizations observe that employees in- 
teract more frequently with coworkers in nearby offices (Allen 1977). 
Baker’s (1984) study of price volatility in an options exchange importantly 
shows that the structure of interaction influences the transmission of in- 
formation in a market context even in a very small physical space: the 
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trading floor of a commodities exchange. Quoting an options trader, Baker 
(1984, p. 783) states, “Noise, static. The errors increase as an inverse square 
of the distance between brokers. .. . You trade with people in close 
proximity to reduce this risk.” A growing empirical literature therefore 
establishes that individuals activate geographically localized social net- 
works when they engage in economic exchange. Assuming this holds in 
the VC industry, geographic propinquity should facilitate the first task of 
the venture capitalist: learning about private investment opportunities. 
A parallel line of research, beginning with Lazarsfeld and Merton (1954) 
but most extensively developed by Blau (1977), shows that social space 
also structures the likelihood of interaction. In social space, differentiation 
along sociodemographic dimensions operates as the equivalent to distance 
in geographic space (Milgram 1967). Relations cluster among similar peo- 
ple because individuals’ sociodemographic attributes determine what they 
do and where they do it. Sociodemographic dimensions act so strongly 
on the structure of interaction that homophily explains most of the social 
structural variation in the 1985 General Social Survey (Marsden 1988; 
Yamaguchi 1990). Many other studies also find proximity in social space 
a salient factor in explaining who interacts with whom (for a review, see 
McPherson and Smith-Lovin [1987]. In our analysis of relationships and 
the spatial pattern of economic exchange in venture capital, the social 
space argument suggests that a venture capitalist’s prior experience in a 
particular industry should affect the extensiveness of the venture capi- 
talist’s personal contact network among entrepreneurs and other investors 
in that industry. Having many contacts in turn facilitates the identification 
of new investment opportunities. Thus, experience in an industry may 
lead to specialization among venture capitalists along this dimension.’ 
In addition to identifying investment opportunities, venture capitalists 
with deep contact networks in an industry or a geographic area can often 
better assess the veracity of the information they receive about the quality 
of an investment opportunity. As the economics and practitioner litera- 
tures on venture capital often note, information asymmetries make op- 
portunity appraisal essential in this context. Information asymmetries exist 
because entrepreneurs know more than venture capitalists about the op- 
portunities they seek funding to pursue. Moreover, venture capitalists 
cannot simply rely upon entrepreneurs for accurate information about the 
quality of their business plans; this information may be tainted because 


? The value of industry-specific knowledge in evaluating investment opportunities of- 
fers an obvious alternative explanation for industry specialization among VC firms. 
Because our primary interest lies in the factors that differentiate the degree of industrial 
specialization across firms, we make no attempt to discriminate among competing 
explanations for the “main effect” of industry specialization. 
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entrepreneurs sometimes overstate the attractiveness of their proposals to 
secure funding and to obtain high valuations for their incipient companies. 
Thus, venture capitalists must guard against the hazard that entrepre- 
neurs might try to sell them a “lemon” (Amit, Glosten, and Muller 1990).’ 

At least two dimensions of a venture capitalist’s contact network con- 
tribute to the localization of investments by influencing the venture cap- 
italist’s ability to appraise investment opportunities under asymmetric 
information. First, individuals have greater confidence in information 
collected from trusted parties. Consistent with this disposition, reports on 
the VC industry indicate that venture capitalists repeatedly finance in- 
vestments that they learn about through referrals from close contacts, 
including entrepreneurs that the capitalist previously financed, fellow ven- 
ture capitalists, family members, or friends (Fried and Hisrich 1994), 
These individuals have an interest in conveying accurate information and 
bringing high-quality investment opportunities to the attention of the 
venture capitalist because they typically wish to maintain an ongoing 
relationship with the venture capitalist.‘ Second, lacking a strong tie, 
multiple and corresponding sources of information might offer the venture 
capitalist some assurance regarding the quality of a potential investment. 

The density of strong and redundant ties likely declines particularly 
sharply in distance. Almost by definition, maintaining strong ties requires 
frequent interaction. The likelihood of this interaction decreases rapidly 
with distance—indeed, much more rapidly than the dissipation of weak 
ties that require less time and energy to maintain. Similarly, redundant 
ties imply the existence of multiple and duplicate information sources. 
Assuming, as we do, spatial constraints on tie formation and maintenance, 
the likelihood of receiving information about a potential entrepreneur 
from two or more sources will decrease even more rapidly in distance 
than the likelihood of having a single information path. 


The Postinvestment Role: Monitoring and Advising New Ventures 


After investing in a startup, venture capitalists perform two important 
functions. First, they monitor their investments. Because venture capi- 
talists make substantial investments in young companies with managers 


3 Akerlof (1970) first posed the lemons problem using the used car market as an example. 
Amit et al. (1990) extend this logic to venture capital investing. 

1 Among others, Blau (1964), Coleman (1990), and Burt (1992) offer general discussions 
of the social structural underpinnings of referral processes. Coleman (1990, pp 180-188), 
e.g., describes “Intermediaries in trust” as individuals who certify the trustworthiness 
of one individual to another. Accordingly, “The advisor’s only stock of trade is the 
credibility of his advice, and if his advice proves incorrect, his loss is in the trust- 
worthiness of his judgment in the eyes of those he has advised” (p. 181). 
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whose interests may conflict with the venture capitalists’ objectives, ven- 
ture capitalists actively monitor their investments to mitigate agency prob- 
lems. Meanwhile, the survey-based and ethnographic accounts of entre- 
preneurship scholars, as well as the business press and writings of 
practitioners, emphasize the value that venture capitalists add to early- 
stage companies by providing expertise and social capital. 

A substantial body of (primarily theoretical) work in corporate finance 
concerns the optimal design of contracts between venture capitalists and 
target companies to attenuate the agency problems inherent in providing 
capital to new ventures (for a review, see Kaplan and Strömberg [1999]. 
Venture capitalists must contend with the possibility that entrepreneurs 
might pursue their own interests or reduce effort once they receive cash 
from their financiers (Jensen 1986). Several aspects of the contracts be- 
tween venture capitalists and the entrepreneurs they fund, such as staged 
financing (Amit et al. 1990; Gompers 1995; Bergemann and Hege 1998) 
and the allocation of control rights (Hellman 1998),* help mitigate this 
concern. Although these contracts reduce the need for monitoring, they 
do not eliminate it. Thus, monitoring the managers of their portfolio 
companies remains an important postinvestment activity for the venture 
capitalist. 

Practitioner accounts tend to emphasize the services venture capitalists 
provide to the companies they support. According to an oft-repeated in- 
dustry adage: It isn’t getting the money, it’s who the money comes from. 
This statement refers to the value-added services that venture capitalists 
offer and the endorsement value derived from having high status financial 
backers. Obviously, VC firms provide financial expertise; they also rou- 
tinely act as general management consultants, providing advice on stra- 
tegic and operational issues (Bygrave and Timmons 1992). Many venture 
capitalists successfully started and ran their own companies before en- 
tering the industry. Thus, they have experience in many of the manage- 
ment and strategic issues confronting the entrepreneurs they back. In 
addition, because of their present and past financing activities and prior 
business experiences, venture capitalists hold abundant social capital that 
they can make available to the companies in which they invest (Stuart, 
Hoang, and Hybels 1999). 

Spatial proximity to the target facilitates the execution of both of the 
venture capitalists’ postinvestment roles. Monitoring requires frequent 
visits to company operations. As Gorman and Sahlman (1989) report, 


* Venture capitalists secure several contractually specified control rights when they 
invest in early-stage companies. For example, they typically receive board seats and 
the power to replace the venture's management team if it fails to meet financial or 
other performance-based milestones. 


1554 


Distribution of Venture Capital 


venture capitalists spend an average of four to five hours per month on 
site at each of the companies in which they play a lead role. Even when 
another VC firm leads the investing, a venture capitalist will still typically 
visit the company at least once per quarter. In total, monitoring and 
advising occupies about half of the venture capitalist’s time (Gorman and 
Sahlman 1989). Obviously, time spent in transit reduces the number of 
‘companies that an individual can monitor; thus, geographic proximity 
reduces the time costs of monitoring. Venture capitalists with limited past 
experience in the target’s industry also find monitoring more challenging. 
Knowledge regarding the target company’s industry allows the venture 
capitalist to oversee investments more efficiently and more effectively, in 
part because industry experience enhances the venture capitalist’s ability 
to recognize signs of trouble at an early stage. 

Similarly, the advisory function becomes more difficult, and may be 
less valuable, when venture capitalists are separated from targets by large 
physical or industry distances. Venture capitalists can offer more assis- 
tance to targets when they interact with startups’ management frequently 
and in person. Familiarity with the business issues confronting new ven- 
tures requires continual interaction, so spatial propinquity also facilitates 
advising. Similarly, the more extensive the venture capitalist’s past ex- 
perience in the target firm’s industry, the more industry-specific expertise 
and the greater the industry-specific social capital the venture capitalist 
can provide. 

Because both the preinvestment activities (opportunity identification 
and appraisal) and the postinvestment roles (monitoring and the provision 
of value-added services) favor local investing, we anticipate finding this 
pattern in the data. Nevertheless, because most industry participants 
would expect to observe this phenomenon and because of the range of 
potential explanations for it, the limited spatial reach of exchange in the 
venture capital investing relation does not, in and of itself, constitute a 
particularly significant finding. This pattern could result from boundaries 
around the flow of information about investment opportunities, from a 
rational response to the costs of monitoring even with perfect information, 
or simply from satisficing behavior that engenders local search. Given the 
obstacles to discriminating among the multiplicity of factors that all pre- 
dict local investing patterns, we focus instead on the factors that influence 
the sensitivity of venture capitalists’ investments to target company prox- 
imity. In other words, we investigate the factors that lead venture capi- 
talists to extend the reach of their investments beyond their surrounding 
geographic and industrial neighborhoods. Fortunately, the determinants 
of the spatial sensitivity of venture capitalist’s also provide some purchase 
for discriminating between competing explanations for the localization of 
investments. 
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Determinants of the Spatial Reach of VC Investments 


Having established the reasons to expect localized investment patterns, 
we now develop specific predictions relating attributes of VC firms, char- 
acteristics of their information network, and features of specific VC-target 
pairings to the probability that a venture capitalist will invest in a geo- 
graphically distant target or one in an industry in which the venture 
capitalist has had little prior investment experience. Although our data 
provide strong evidence of local exchanges, VC firms vary significantly 
in their proclivity to invest in proximate targets. We begin by proposing 
VC age and investment experience as factors explaining this variance, 
but we intend to show that tenure in the industry and experience proxy 
for the expansion of a venture capitalist’s contact network and the growth 
of its reputation, which develop through time and the accrual of an in- 
vestment track record. If our conjecture holds, the effects of tenure and 
experience should attenuate when we account explicitly for this network 
development. 

VC age.—The age of a venture capital organization captures at least 
four dimensions of tenure in the industry. First, as firms age, their members 
probably extend their networks both within the venture capital com- 
munity and among entrepreneurs in a range of industries. Second, even 
without forming new ties, the spatial reach of a venture capitalist’s contact 
network likely increases over time as geographic and social mobility pro- 
duce spatial dispersion among the contacts in their network. Third, as 
they age, venture capitalists also accumulate experience in evaluating 
business proposals and entrepreneurs that could improve the venture cap- 
italist’s ability to perform these tasks at a distance. Fourth, long-tenured 
firms in the industry often become widely known, increasing the likelihood 
that other VC firms will bring good investment opportunities to them. As 
our argument develops, we hope to disentangle the effects of networks 
and experience captured by firm age, but age might pick up residual 
effects from these processes. Thus, we anticipate, older VC firms will fund 
distant targets more frequently than young VC firms. 

VC experience.—As the discussion of VC age implies, age captures the 
combined effects of a number of processes that produce systematic changes 
in organizations as they mature (Hannan 1998). One accompaniment of 
age is the accumulation of experience. Over time, the experience that 
venture capitalists accrue can alter the influence of distance in the in- 
vestment decision in at least three ways. First, experience might reduce 
the costs of monitoring at a distance as venture capitalists become more 
adept at this task. Similarly, as they gain confidence in their ability to 
evaluate investment opportunities and entrepreneurs, they might grow 
less dependent on trustworthy or redundant information sources to ap- 
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praise the quality of investment candidates. Finally and most relevant to 
our argument, in the course of their investments, venture capitalists de- 
velop relationships with other VC firms and with experts and entrepre- 
neurs in the industries in which they repeatedly invest. These networks 
provide privileged access to information about promising investments. 
Let us consider each of these in detail. 

A substantial literature on organizational learning suggests that past 
experience influences organizational behavior (March 1988). As venture 
capitalists gain experience monitoring their investments, they develop 
competence in activities such as writing effective contracts to minimize 
agency problems and recognizing the signs that forewarn of problems at 
the companies in which they invest. Effective monitoring also requires 
insight into the link between effort and outcome, which practice cultivates. 
An improvement in the efficiency of performing these activities reduces 
the time that the venture capitalist must spend monitoring, which should 
in turn enable the venture capitalist to invest in more distant firms because 
of a reduction in the time costs of monitoring at a distance. 

In addition to improving their monitoring ability, experience also hones 
venture capitalists’ ability to appraise potential investments. As they eval- 
uate more business plans and directly observe more early-stage companies, 
venture capitalists may gain a better understanding of the factors that 
lead to success and failure in general and within a particular industry. 
As venture capitalists improve their acumen in evaluating investment 
opportunities, they might become more willing to invest based on infor- 
mation acquired from nonredundant and weak ties.’ Although weakly 
affiliated actors may lack the incentive to refer only high quality invest- 
ments (Fried and Hisrich 1994), venture capitalists might compensate by 
relying more heavily on their ability to discern quality differences among 
entrepreneurs and their business plans. Since venture capitalists likely 
have only nonredundant and indirect ties to informants on the quality of 
geographically distant targets or those in industries in which they lack 
previous investment experience, this shift toward a reliance on internal 
(rather than network-based) evaluation techniques disproportionately fa- 
vors the consideration of far away targets. Therefore, venture capitalists 
may become less sensitive to distance as they accrue investment 
experience. 

Beyond the changing character and quality of the information venture 


$ The decision-making literature counsels that a change in the willingness to rely on 
one’s own judgment does not necessarily constitute a rational shift in behavior. Re- 
search shows that individuals become more comfortable engaging in an activity simply 
by doing it (Bandura 1986), especially when the feedback regarding success lies chron- 
ologically distant from the activity. Thus, venture capitalists might judge themselves 
to be better investors regardless of any real improvement in their selection ability 
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capitalists will consider in evaluating potential investments, the accrual 
of prior experience also expands a venture capitalist’s access to infor- 
mation. For many reasons, each investment extends the venture capi- 
talist’s information network. First, the employees of previously funded 
companies occasionally launch new startups, since entrepreneurs in a 
particular industry tend to emerge from the rosters of organizations al- 
ready operating in that industry (Sorenson and Audia 2000). Also, the 
executives and employees at funded companies may forward potential 
investment opportunities that they learn about through friends, relatives 
and coworkers. To the degree that entrepreneurial activity forms a salient 
dimension in structuring social interaction, entrepreneurs will know other 
entrepreneurs, and thus they will be fecund sources for referrals. The 
contacts emerging from the associations generated by past financing re- 
lationships also offer another potential information advantage to venture 
capitalists: when investing within the same industry, these contacts offer 
privileged access to expert advice. Hence, we expect, prior investment 
experience in an industry increases the geographic reach of investments 
within the industry. 

Although we expect stronger experience effects when venture capitalists 
consider investments within the same industries as their prior investments, 
each of the aforementioned arguments could affect a VC firm’s invest- 
ments even outside of the industries in which it has accumulated expe- 
rience. For example, some aspects of monitoring require knowledge spe- 
cific to a particular industry, but others should apply generically to the 
monitoring of any business venture. Similarly, a portion of the evaluation 
of any entrepreneurial venture involves aspects of the business plan and 
the capabilities of the founding team not specific to any particular industry. 
Moreover, entrepreneurs at the target companies in which a venture cap- 
italist invests may forward referrals that extend beyond the industries in 
which they work. Therefore, we anticipate, venture capitalists with ex- 
tensive investment experience will more likely invest in both distant in- 
dustries and locations. 

Target company stage.—The difficulty of opportunity appraisal and the 
importance of monitoring vary with the target company’s development 
stage. Evaluating extremely early-stage companies proves difficult be- 
cause they lack track records for making informed quality assessments. 
In contrast, venture capitalists can judge the quality of the management 
team in light of its performance on a number of different performance 
metrics in later stage companies. In addition to offering more data to 
inform the due diligence process, later stage companies might also require 
less intensive monitoring. As organizational routines and policies evolve 
and management becomes better established, new ventures typically op- 
erate with greater reliability (Stinchcombe 1965; Hannan and Freeman 
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1984). Both of these observations suggest, more mature target companies 
receive investments from geographically or industrially distant VC firms 
more ofen than early-stage ventures. 

Syndicate networks in venture capital—New ventures frequently ob- 
tain funding from syndicates of investors, implying that they receive fi- 
nancing from more than one VC firm, often garnering multiple investors 
even in the same financing round.’ Many factors justify this practice: 
syndicates diversify risk by enabling venture capitalists to invest smaller 
amounts in a greater number of companies (Wilson 1968), they mitigate 
the information asymmetries between the initial investor and later round 
investors (Admati and Pfleiderer 1994), and they leverage investment eval- 
uation skills across coalitions of firms (Sah and Stiglitz 1986; Lerner 1994). 
Lerner (1994) also suggests that syndication might allow venture capi- 
talists to “window dress” the results they present to their investors, which 
they accomplish by gaining late-stage access to “hot” firms even though 
much of the capital appreciation in these companies has already occurred. 
This allows venture capitalists to report that they funded star companies 
when they attempt to raise future pools of capital. 

Regardless of the particular motives for syndicating investments, the 
frequent reliance on investment coalitions to fund target companies creates 
a dense interfirm network that expands the distance that information 
travels through the venture capital community. Each time a VC firm 
enters a syndicate, it develops new associations with, or strengthens ex- 
isting relations with, other VC organizations. Thus, the practice of syn- 
dication creates a network of relations within the VC community that 
structures the flow of information across market participants. Likewise, 
repeated transactions with previous syndicate partners build trust be- 
tween the focal venture capitalist and other VC organizations. As the 
relations between the reference firm and other members of the VC com- 
munity strengthen and broaden, so too does (a) the chance that the ref- 
erence venture capitalist will be invited to join future syndicates, and (b) 
the trust that it places in the monitoring and due diligence capabilities 
of fellow syndicate members. Thus, these connections enable a focal ven- 
ture capitalist to rely upon the opportunity recognition and monitoring 
capabilities of colleagues. We therefore expect: VC firms will more likely 
fund a spatially distant target if they have previously coinvested with 
another firm that invested in the target company. 

From the venture capitalist’s vantage point, having a previous 


7 Syndicates financed slightly more than two-thirds of the 7,590 VC-backed firms in 
the data we analyze. The average target received investments from 5.3 venture cap- 
italists. Thus, the typical investment leads to an association between a reference venture 
capitalists and 4.3 other VC firms. 
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exchange partner in an investment syndicate increases the reference ven- 
ture capitalist’s probability of investing over a significant distance most 
when the exchange partner lies in close spatial proximity to the target. If 
a venture capitalist trusts another member in a syndicate and its well- 
regarded colleague resides near to the target, the venture capitalist can 
rely on the due diligence, monitoring, and advisory capabilities of its 
trusted partner.” This suggests, VC firms will more likely fund a spatially 
distant target when a previous coinvestment partner belongs to the syn- 
dicate for the target and is located mear to the target in geographic or 
industrial space. 

Finally, the centrality of the venture capitalist’s position in the in- 
dustry’s coinvestment network could affect its sensitivity to target firm 
distance for several reasons. First, assuming that information about in- 
vestment opportunities flows through the ties in the syndication network, 
venture capitalists with large numbers of coinvestment relations with 
other organizations will have access to a significant amount of information 
about investment opportunities, particularly if they associate with other 
central actors (Bonacich 1987). This implies that central venture capitalists 
will be most aware of investment opportunities outside of their immediate 
geographic region and industrial foci. Second, centrally positioned actors 
enjoy high status because they occupy prominent structural positions in 
the community’s syndication network (Podolny 1993). High-status venture 
capitalists likely receive many invitations to join investment syndicates 
because of the legitimacy they confer on the VC firms and the targets 
with whom they associate (Freeman 1999). The visibility bestowed by 
high-status venture capitalists through the investment relation particu- 
larly benefits early-stage companies that lack a proven performance track 
record and a strong identity among potential investors (Stuart, Hoang, 
and Hybels 1999). Thus, we posit, VC firms centrally positioned in the 
VC community's coinvestment network will more likely fund spatially 
distant targets. 

To recapitulate, we begin our empirical analyses by establishing that 
VC firms typically invest locally. Then, we show that well-established 
and highly experienced venture capitalists exhibit more dispersed in- 
vestment patterns. Finally, we examine how the position of VC firms in 
the industry’s evolving coinvestment network affects the proclivity to 
invest in spatially distant targets. 


* The frequent appearance of serial syndicate partners in the data (the same pairs of 
VC firms investing together in multiple transactions) suggests the operation of an active 
norm of reciprocity in the industry, in which venture capitalists build relationships by 
sharing high-quality deals with established exchange partners. It also suggests the need 
for studies that compare network extension strategies against relationship strengthening 
strategies 
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EMPIRICAL STRATEGY 


Our analysis addresses the determinants of financing relations between 
venture capitalists and startup companies. In essence, the empirical com- 
ponent is a network study of tie formation; we model the probability that 
a particular VC firm invests in a given target company. Many studies of 
tie formation analyze every possible dyad and use logistic regression to 
estimate the effects of a covariate vector on the likelihood of a tie (e.g., 
Podolny 1994; Gulati 1995; Stuart 1998). This strategy creates two prob- 
lems. Methodologically, it does not correctly account for nonindependence 
across cases, as each firm enters the analysis many, many times.’ The 
large number of repeat occurrences of each firm can lead to systematic 
underestimation of the standard errors for firm attributes that do not 
change from dyad to dyad. Pragmatically, this strategy presents a second 
obstacle; the observation of all possible dyads can prove burdensome 
computationally, especially for large networks. For example, consideration 
of all potential dyads in our data would require us to create a matrix 
with more than 6 million cells. Since many of our variables require cell 
x cell calculation, this would result in a nearly hundredfold increase in 
the time required for variable construction. 

Sampling randomly from the set of potential VC-target dyads offers 
one potential solution to these issues. Nevertheless, this approach falls 
short of the ideal because it ignores the fact that the realized ties provide 
most of the information for the estimation of the factors that affect tie 
likelihood (Coslett 1981; Imbens 1992; Lancaster and Imbens 1996). Thus, 
we include all cases of funding relations that actually appear in the data. 
For comparison, we then create a matched sample of potential ties that 
did not occur. To control for temporal variation in the supply of and 
demand for venture capital, we create the matched sample by pairing 
each VC firm that funded a startup in a given quarter of a calendar year 
with a startup funded by a different venture capitalist in that same quar- 
ter." This approach substantially reduces the problems associated with 
multiple firm observations; the average VC firm now enters the analyzed 
data matrix 78 times, as opposed to more than 6,000 times in the matrix 
of all potential funding relations. To address the fact that venture capi- 
talists enter the data more than once, we report robust standard errors 


’ Dorelan (1981), Lincoln (1984), Krackhardt (1988), and Mizruchi (1989) discuss var- 
ious approaches for addressing the nonindependence problem. 

© Using target firms that received funding from different venture capital organizations 
as the sampling frame for the comparison set eliminates the issue of potential quality 
differences between the funding relationships that exist and those that never maten- 
alize, as all target companies in the data (by definition of the sampling criterion) pass 
the quality threshold necessary to receive venture funding. 
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estimated without the assumption of independence across observations 
on the same VC firm. : 

The use of a matched sample introduces one new problem. Logistic 
regression can yield biased estimates when the proportion of positive 
outcomes in the sample does not match the proportion of positive out- 
comes in the population. Because logistic regression is a multiplicative 
model, this bias does not simply affect the intercept term. Rather, bias 
can affect all coefficient estimates. In particular, uncorrected logistic re- 
gression using a matched sample tends to produce underestimates of the 
factors that predict a positive outcome (King and Zeng 2001). Large sam- 
ples do not necessarily alleviate this problem. To correct for this potential 
bias, we adjust the coefficient estimates using the method proposed by 
King and Zeng (2001) for the logistic regression of rare events. 

The traditional logistic regression model considers the dichotomous out- 
come variable to be a Bernoulli probability function that takes a value 
“1” with the probability r: 


1 
oS T 
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where X is a vector of covariates and £ is a vector of parameters. Re- 
searchers typically use maximum-likelihood methods to estimate 8. King 
and Zeng (2001) prove that the following weighted least squares expres- 
sion estimates the bias in $ generated by oversampling rare events: 


bias(B) = (XWX) X'W, 


where ¿ = 0.5Q,[(1 + w,)F, — w,], the Q are the diagonal elements of 
Q = X(X’WX)'X', W = diag{z,(1 — T,)w,), and w, represents the fraction 
of ones (events) in the sample relative to the fraction in the population. 
Essentially, the user regresses the independent variables on the residuals 
using W as the weighting factor. Tomz implements this method in the 
relogit Stata procedure.” 


Data Sources 


Using the Venture Economics database from the Securities Data Cor- 
poration (SDC), we developed a comprehensive data set of venture capital 


“See Michael Tomz’s “Relogit: Rare Events Logistic Regression” for the Stata ado 
file. Posted October 1999 at http://gking harvard.edu/stats html. 
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investments from 1986 to 1998.” This database purports to record all VC 
firm investments in domestic, private companies. Because each venture 
capitalist’s investment in a target generates a unique record in the da- 
tabase, we know the full history of VC investments in each target. Using 
this information, we create a matched sample by randomly pairing each 
VC firm that made an investment in a particular quarter with a target 
company that it did not fund (i.e. we create a potential but unrealized 
dyad). The matched sample includes 80,406 cases involving 1,025 VC 
firms and 7,590 target companies. Although exactly 50% of the cases form 
investment ties, the population as a whole actually realizes ties in 0.72% 
of the potential dyads. 

From these numbers, one can readily see that each target company 
enters the data more than once. Three factors explain this fact. First, the 
method used for creating the matched sample implies that, on average, 
a target enters the data twice for each VC investment it actually receives. 
Second, most target companies experience more than one financing event 
because venture-backed companies typically receive funding in a sequence 
of capital infusions, known as “rounds.” Each time a company undergoes 
a financing round, venture capitalists subject it to a thorough evaluation. 
Typically, venture capitalists provide small amounts of funding in early 
rounds because they prefer not to make large financial commitments to 
young companies about which they lack a sufficient understanding of 
quality (Gompers 1995). As they learn more about the company, venture 
capitalists then decide to terminate, maintain, or increase their funding 
level. Finally, for the reasons noted above, many startups receive financing 
through syndicates. As a result, each target enters the data an average of 
10.6 times. 


Independent Variables 


Geographic distance.—We measure geographic distance by calculating 
the number of miles between the venture capitalist’s main office and the 
location of the target company.” Practically, we calculate this measure by 


4 SDC's new ventures database reports venture capital investments dating back to 
the early 1970s. However, we must allow for the passage of time to observe the buildup 
of the syndication networks. Thus, we restrict the cases analyzed to the period from 
1986 to 1998, though we use all information in the database to construct the inde- 
pendent variables. 

4 Unfortunately, our data sources do not include information on when or if VC firms 
open satellite offices. Therefore, in some instances—when the VC firm has a second 
office nearer to the target than its head office—the distance that we compute under- 
estimates the actual distance between the VC firm and target. The exclusion of this 
information should add measurement error, making our tests of geographic effects less 
precise. 
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Fia 1—Geographic distance spline: the dotted line shows a 20-piece Hnear spline of the 
likelihood of investment, while the solid line displays the implied relationship of the logged 
functional form. 


assigning the longitude and latitude at the center of the zip code in which 
they reside to both venture capitalists and targets. Using spherical ge- 
ometry, we calculate the distance between the two points, i and j, as 


d, = C {arccos [sin (lat,) sin (lat, ) 
+ cos (lat,) cos (lat,) cos (long, — long, |X), 


where latitude (lat) and longitude (long) are measured in radians and C 
represents a constant based on the radius of the sphere that converts the 
result into linear units of measure. To convert the result to miles on the 
surface of the Earth, we use C = 3,437." 

In the models, we log geographic distance to account for the fact that 
transportation costs, both in terms of time and money, do not increase 
linearly over geographic space. Rather, as distance increases, actors sub- 
stitute technologies to improve the efficiency of transportation and com- 
munication. For example, a person will drive to visit someone 30 miles 
away, but he or she will fly to see a contact 3,000 miles away. Since this 
specification imposes a strict functional form on the relationship between 
distance and tie formation, we fit a spline to the data to check the validity 
of this assumption. Figure 1 presents the predicted relationship between 


H The scaling (Le., miles vs. tens of miles vs. kilometers) matters only to calibrate the 
relative importance of being located within the same zip code vs. being located in a 
neighboring zip code (Sorenson and Audia 2000). 
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geographic distance and tie likelihood using both the natural log of dis- 
tance and a 20-piece linear spline.“ The logged functional form fits the 
relationship between distance and investment likelihood quite well. Nev- 
ertheless, the graph appears to show two deviations between the para- 
metric and nonparametric estimates. First, the logged functional form 
declines somewhat more rapidly over the first 60 miles than the actual 
relationship between distance and financing. Presumably, at such short 
distances, little opportunity exists for technology substitution. Thus, trans- 
port costs should relate linearly to distance over this short span. Second, 
the data show a slight bicoastal effect. We believe that the distribution 
of VC firms explains this effect and will elaborate on this contention when 
discussing the results. 

Industry distance.—In addition to the factors that impact VC firms’ 
investments in geographically distant targets, we also explore the deter- 
minants of the “industry distance” between venture capitalists and targets. 
Because entrepreneurs at previousty funded startups and contacts gen- 
erated from working in an industry represent important sources for the 
identification of new investment opportunities, industry represents an- 
other dimension that delimits the flow of information. Moreover, venture 
capitalists typically perform due diligence and monitoring more expedi- 
tiously and successfully when they have prior investment experience in 
the target’s industry. To minimize assumptions, we define industry dis- 
tance as the similarity between the industry profile of the venture capi- 
talist’s prior investments and industry of the target firm. Ideally, this 
measure would weight actual differences in the production processes and 
market dynamics of different industries to capture the transferability of 
knowledge across domains, but the construction of any such measure 
would require a host of arbitrary assumptions. To avoid such suppositions, 
we opt to define industry distance as the percentage of previous invest- 
ments that the venture capitalist has made in industries other than the 
one in which the target firm operates: 


È p 
industry distance; = =, 

Z$ 
where i indexes VC firms, m denotes the industry of the startup j, and p 
represents an array of all prior investments in any industry. Thus, industry 
distance ranges from “0,” when all of a venture capitalist’s prior invest- 
ments fall in the target’s industry, to “1,” when the venture capitalist has 
no previous investments in the target’s industry. In assigning firms to 


u We split the data into 20 equal proportion pieces. Thus, each of the 20 pieces rep- 
resents 5% of the total cases. 
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industries, we use the Venture Economics one-digit industry taxonomy 
(this eight-category classification scheme divides firms into the following 
industries: biotechnology, computers, communication, consumer, energy, 
industrial, medical, and other). 

Firm age.—To measure changes in investment patterns as venture cap- 
italists mature, we calculate an age term based on the number of years 
from the VC firm’s founding date to the quarter of the investment. The 
Venture Economics database lacks valid founding dates for VC firms in 
6% of the cases. To avoid dropping these cases from the analysis, we 
assign a founding date to these firms of one year prior to the first in- 
vestment recorded by SDC." 

Experience —Two variables capture the effects of experience in fi- 
nancing. General experience counts the number of companies in which 
the venture capitalist has invested prior to the current quarter. Industry 
experience tallies the number of startups in the same industry as the target 
in which the venture capitalist has invested previously. 

Network position.—We compute three variables to capture different 
aspects of a firm's position in the network linking venture capitalists 
through previous, joint (syndicated) investments in startups. One can 
construe this syndication network as an actor-event network; venture 
capitalists intersect in the target “event” when more than one VC firm 
contributes funding to the same target. The syndicated investment thus 
provides the precipitating event leading to interaction among the venture 
capitalists that jointly finance a target. 

The first variable gauges the degree to which the reference VC organ- 
ization has an established relationship with the other VC firms that have 
also invested in the target. To construct this variable, we first create a 
count of the number of startups in which the focal VC firm has coinvested 
with each of the other VC investors in the target. Then, we average this 
score across all other investors in the target. The equation defines mean 
affiliation for VC firm i in investment target J: 


2 PETER 
ZA 
ried 
where k indexes all VC firms other than i, p indexes all previous invest- 


ment rounds in all target companies, and x takes the value of one when 
an investing relation exists and zero when it does not. The resulting 


* An analysis of all firms with valid founding dates revealed that the first investment 
occurred 366 days after the founding of the VC firm, on average. We also ran all 
models excluding cases that lack valid founding dates. Excluding these cases does not 
qualitatively change the results. 
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quantity captures the average strength of relations between the focal ven- 
ture capitalist and the other members of the investment syndicate for a 
particular target.” 

The second variable, affiliate distance, captures the possibility that ven- 
ture capitalists might feel that their trusted colleagues can successfully 
monitor and evaluate investments only when they lie close to the target. 
We create two variables to capture the distance between the target firm 
and the closest trusted affiliate in the investment syndicate. First, we create 
a variable that records the geographic distance of the closest syndicate 
partner to the target with whom the reference VC firm has previously 
coimvested. Using the inverse of this distance measure allows us to assign 
a “0” to investment options where the venture capitalist does not have a 
prior affiliate invested in the target.” For firm i’s consideration of target 
j, this measure can be defined as follows: 


1 

min (d) 

where d is the distance metric between venture capitalist k and the target 
j when x, = 1 and 2x,%, Ty, > 0, where x again denotes the existence 
of a relationship betWeen two actors. Second, we generate a parallel mea- 
sure to capture the industry distance of the closest prior affiliate in the 
syndicate. This measure has a natural upper bound of one, so we do not 
need to invert it. For firm 7’s consideration of target j, this measure can 
be defined as follows: 


affiliate distance, = min(d,), 
where d is the distance metric between venture capitalist k and the target 


j when x, = 1 and È 1,2, 2,%,, > 0, where x again denotes the existence 
of a relationship between two actors. 


affiliate distance, = 


1 This measure assumes the average strength of the relation between a reference 
venture capitalist and the other syndicate members to be important, but several rea- 
sonable variants of this measure exist. For example, the sum of prior coinvestment 
events captures the degree to which having multiple trusted partners matters. Likewise, 
the maximum number of times the reference venture capitalist has coinvested with 
another member of the syndicate measures the degree to which the one most trusted 
partner Influences investment patterns We constructed these alternative specifications. 
Since the variables correlate highly and all produce the same pattern of effects in the 
models, we only report results using average relationship strength. 

1% Without inverting the affiliate distance measure, nonexistent affiliates (le, when 
there is no k in the syndicate for target j with whom VC firm i has a previous 
relationship) should He infinitely distant from the target. Practically, this would require 
us to assign some arbitrary large number to these cases Thus, we invert the measure 
to eHminate the possibility that an arbitrary assumption in the coding of these cases 
generates the effects. 
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The third variable, centrality, captures the reach of a focal venture 
capitalist’s information network. We compute the centrality of each ven- 
ture capitalist in the syndication network as a measure of the information 
gathering capability of the focal venture capitalist. Because ties to actors 
who occupy central positions themselves increase the reach of a venture 
capitalist’s network more than ties to peripheral players, we compute 
Bonacich’s (1987) two-parameter centrality measure: 


c, (œb) = È (a + Bedra, 


where c is the centrality score, r indicates a relationship (coinvestment) 
between i and k, a scales the measure, and £ weights alters’ centralities. 
When the parameter $ takes a positive value (as in the present case), each 
venture capitalist’s centrality increases as a function of the centralities of 
the VC firms with which it shares coinvestment ties (when $ = 0, Bon- 
acich’s centrality measure correlates perfectly with the variable we label 
“general experience”). We assume that information travels between VC 
firms and, therefore, the amount of information available to the reference 
firm increases as a function of the centralities of those with whom it shares 
relations. In other words, ties to central firms increase the radius of the 
reference firm’s information network. Practically, one solves for centrality 
scores by taking the eigenvector associated with the largest eigenvalue of 
the relationship matrix. Since the designation of £ is arbitrary, we follow 
the example of earlier researchers and set it equal to three-quarters of the 
reciprocal of the largest eigenvalue (e.g., Podolny 1993). To allow com- 
parison across networks of different size, we set œ such that the mean 
centrality score in each period equals “1” (Bonacich 1987).*” 

Because we wish to document the factors that alter the spatial range 
of investments, we interact the measures of VC age, experience, and net- 
work position with the industry and geographic distance between VC firm 
and target in the models of the probability of an investment. Although 
our primary interest lies in the direction of the effects on the interaction 
terms, we must include the main effects of all the variables to adjust for 
the impact those factors might have on the intercept (Greene 1993). 


1 The parameter a is set so that the mean centrality score across all venture capitalists 
is “1,” but the mean value of the centrality measure in the analyzed dataset exceeds 
one because the highly central venture capitalists do more deals on average than do 
the less central players. Also, we use all coinvestment relations in the preceding five 
years to form the relationship matrix used to compute Bonacich centrality. By only 
using five years of coinvestment data, we allow for the fact that relationships erode 
over time If two firms have not coinvested within a five-year period, it seems unlikely 
that their members remain close confidants. 
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Control Variables 


Prior investment.—F or a variety of reasons, venture capitalists tend 
to continue to invest in companies in which they already own equity. 
Bergemann and Hege (1998) argue that maintaining a constant proportion 
of equity provides an optimal solution to the agency problems in the VC- 
target relationship. Thus, one might expect rational firms to participate 
in all subsequent financing rounds to preserve their investment level. 
Psychological forces could also compel venture capitalists to invest re- 
peatedly. Prior investments represent a commitment to the target com- 
pany, and as Staw (1976) shows in a similar context, an unwillingness to 
act upon negative information can result in an escalation of the VC firm’s 
commitment to the target. To control for these factors, we include a 
dummy variable that indicates whether the venture capitalist has pre- 
viously invested in the target.” 

Invesiment stage.—The forces that drive venture capitalists to make 
local investments may operate most strongly in early investment rounds. 
The earlier the developmental stage of the company, the shorter the track 
record of the new venture and the less information available to potential 
investors to assess the startup’s quality. Thus, monitoring and network- 
based evaluation strategies might be more important early in the target’s 
life. We create a dummy variable that indicates whether Venture Eco- 
nomics classified the investment as an early stage.” 

Supply of active venture capitalists.—Figures 2 and 3 display the lo~- 
cation of all active VC firms and active targets in the United States in 
1998. Since neither VC firms nor targets distribute evenly across regions 
or industries in the United States, the local supply of potential investors 
and investment opportunities varies tremendously both by region and 
market. We construct two control variables to control for this heteroge- 
nelty. The first variable measures the richness of the local supply of ven- 
ture capital with respect to a focal target firm. We create a localized 
venture capital density, vcdems, for target j at time t, which we calculate 
using the following equation: 


” The social psychological reasons for continuing investment seem to imply that the 
likelihood of future investment should increase with the number of prior investment 
rounds. To test this possibility, we also tried including a second term for the number 
of prior investments beyond the first. This count did not significantly affect the like- 
lihood of investment in any of the models. 

11 We defined the following financing types as early-stage investments: early stage, first 
stage, other early, R&D early stage, seed, and startup. We also generated a second 
dummy variable that indicated whether the investment was the first round of funding 
for a particular company recorded by SDC. The results behave similarly, so we only 
report the models using the early-stage dummy variabk. 
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Flo. 2 —Geographic distribution of venture capital investments. Larger circles (propor- 
tlonate to the square root of the number of target companies in a zip code) indicate the 


presence of more targets 


where i indexes all venture capital organizations, vc is the number of 
investments that firm i made in period t, and d, is the distance between 
target j and venture capitalist i. This variable sums the inverse distances 
between target j and all active VC firms (see Sorenson and Audia [2000] 
for a more lengthy discussion of geographically weighted density varia- 
bles). The analogous variable for the industry-specific supply of venture 
capital simply sums the proportion of VC investments in the target’s 
industry prior to the current period: 


ocinddens, = >, (1 — industry distance,), 


where i indexes all venture capital organizations. 

Supply of active targets.—We construct parallel measures to capture 
heterogeneity in the demand for venture capital across regions and in- 
dustries. We calculate a localized target density, tdems, around any VC 
firm i at time t using the following equation: 


Í, 

= A 

si a 2 (1+ d,)’ 

where j indexes all targets, f takes a value of “1” if firm j received financing 
in period ¢ or “0” otherwise, and dy denotes the distance between the 
venture capitalist ¿ and the target J. This variable sums the inverse dis- 
tances between VC firm i and all active targets. The complementary 
measure for variation in industry-specific demand for venture capital 
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Fic 3—Geographic distribution of VC firms. Larger diamonds (proportionate to the 
square root of the number of venture capitalists in a zip code) indicate the presence of a 
larger number of venture capitalists. 


weights, from the standpoint of a focal VC firm i, the number of active 
targets in each industry according to i’s past industry investment profile: 


binddens, = > Pa; 


where p denotes the proportion of venture capitalist t’s prior investments 
in industry m and N represents a count of targets in the current quarter 
in industry m. 

Prior state experience.—Venture capitalists may develop new relation- 
ships in a geographic region when they invest in companies in it. If this 
occurs, then an initial investment in a locale should increase the likelihood 
of subsequent investment. We operationalize this idea by including a 
variable that indicates whether the venture capitalist has previously in- 
vested in the same state as the target. To avoid simply picking up prox- 
imity to the venture capitalist’s office, we also include a dummy variable 
that marks whether the target lies in the same state as the venture cap- 
italist. The parallel term for industry distance would essentially dichot- 
omize the industry experience variable, so we do not need to include an 
additional measure in the industry distance models to capture this effect. 
(Table 1 provides the means and standard deviations of all variables used 
in the analysis.) 


RESULTS 


Table 2 reports the results of the rare events logit models for geographic 
distance. Model 1 provides baseline estimates of the probability of a tie 
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TABLE 1 
DESCRIPTIVE STATISTICS 
Variable Mean SD 
SL AAA 58 23 
Target density ...... 81 35 
VC industry density .................. 36.19 29.57 
Target industry density .. .............. 61.81 38.93 
Early stage ..........0.. 22 o o ooo o000. 15 35 
o its tera deneeeses 28 45 
In (distance) .... ... 5.75 206 
Industry distance .......... 77 24 
VC age ©... 12.64 9.21 
General experience ............ ooo oa 162.53 211.24 
Industry experience . ow... 34.77 63.36 
Mean affllation ........ 4.89 6.35 
Bonacich power ......... 1.28 1.83 
min (affiliate geographical distance) 37 42 
min (affiliate industy distance) 37 34 
Same state 20 .. ce eeeceeceee ee cee 28 45 
Prior state experience dummy ......... 75 44 





using the local density of VC firms around a target (VC density), the local 
density of active targets around a venture capitalist (target density), 
dummy variables indicating whether the target represents an early-stage 
investment and whether the venture capitalist has made a prior invest- 
ment in the target, the measures of the geographic and industry distance 
separating the venture capitalist from the target, and an interaction be- 
tween the early-stage dummy and the distance of the venture capitalist 
from the target. Consistent with the baseline expectation, the coefficients 
on the two distance variables indicate sharply lower investment proba- 
bilities as the distance between the VC firm and target increases; the 
likelihood that a venture capitalist invests in a target drops sharply as 
the target recedes from the venture capitalist in either geographic or in- 
dustrial space. Thus, the main effect of distance operates as expected. 
The coefficient on the investment stage x distance interaction term sug- 
gests that, as we anticipated, geographic distance reduces the likelihood 
of investment even more precipitously if the focal target is at the seed or 
startup stage. Not surprisingly, the baseline also shows that the probability 
of an investment increases substantially if a venture capitalist has invested 
in the target in a previous round. 

Model 2 adds the age of the venture capitalist and an interaction term 
between VC age and the geographic distance between venture capitalist 
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and target.” The positive and significant coefficient on the age x distance 

. interaction term suggests that older venture capitalists more frequently 
make investments in geographically distant targets; the magnitude of the 
negative effect of distance on the probability of investment declines with 
the age of the venture capitalist. Thus, the geographic scope of investment 
activities increases with tenure in the industry. In model 3, we begin to 
disentangle the concurrent processes that lie behind the age-distance in- 
teraction by adding the experience variables. The interactions between 
the two VC firm experience variables and the geographic distance between 
the target and venture capitalist both increase tie likelihood, demonstrat- 
ing that the negative effect of distance on the probability of an investment 
attenuates with experience. Model 3 shows that general investing expe- 
rience captures a larger share of the influence of the maturation process 
on the spatial reach of investments than does industry-specific experience. 
Contrary to our expectation, general investment experience extends the 
geographic scope of investment more substantially than past investment 
experience in the target’s industry.” 

Model 4 incorporates the network-based measures derived from the 
quarterly coinvestment matrices. After the regressions account for inter- 
actions between distance with (a) the strength of the relationship between 
a reference VC firm and the other members of the investment syndicate 
in a target and (b) the centrality of the VC firm in the coinvestment 
network, the interactions between distance and the age and experience 
variables no longer significantly affect the likelihood of investment. Thus, 
experience primarily influences the geographic scope of investing through 
the development of one’s network through syndication; general experience 
proxies well for the development of a venture capitalist’s network, but 


4 Because of the matched sample methodology, caution should be exercised when 
interpreting the main effects of VC firm attributes (as opposed to dyadic properties) 
on the probability of a tle. For example, it appears in model 3 that the probability 
that a venture capitalist will make an investment in a target declines with experience. 
Since the probability of an investment remains relatively constant in each year of the 
data, variables such as age and experience that increase over time will appear to have 
negative effects on the probabiHty of a tie. These terms should primarily be understood 
as intercepts for the interaction effects. In fact, in unreported models in which we 
include dummy variables for every calendar quarter, the reported results remain un- 
changed and most of the main effects fall to zero. 

™ The strong general experience interaction might stem from its relationship to the 
sire of the VC firm (past deal count, which is the measure of general experience, also 
happens to provide the best measure of VC firm sire available to us). As we noted 
above, in some instances—when the VC firm has a second office nearer to the target 
than its head office—our measure of geographic distance underestimates the actual 
distance between the VC firm and target since we do not know the location(s) of 
satellite offices. Assuming that larger firms more likely have satellite offices, this might 
explain the greater tendency of more experienced firms to invest in distant targets. 
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Fic. 4.—Investment Hkelihood by centrality and distance The solid line indicates the 
impact of distance on the probability of investment for a venture capttalist of average 
centrality, whereas the dotted lines depict the importance of distance for venture capitalists 
1 SD above and below the mean centrality level. 


has no net effect when we directly include the network-based variables 
in the models. The interactions in model 4 indicate that a venture capitalist 
more likely invests in a far off target when it has previously co-invested 
with other members of the syndicate financing that target. At a minimum, 
having other trusted partners in a syndicate augments a venture capi- 
talist’s confidence in the quality of a deal. 

The coefficient on the Bonacich centrality x distance interaction in 
model 4 demonstrates a substantial increase in the geographic range of 
investments among central VC firms. Thus, model 4 reveals that increas- 
ing geographic distance has less of a negative effect on the probability of 
a tie for highly central VC firms. Figure 4 depicts the relationship between 
centrality, geographic distance, and the likelihood that a venture capitalist 
invests in a target. The figure clearly illustrates that the decline in the 
probability of an investment as geographic distance increases is sharpest 
among peripheral (i.e., noncentral) firms. As we discuss in more detail 
below, this finding may stem from either central VC firms’ ability to learn 
about distant targets from their relatively dispersed network or the higher 
status and visibility of central VC firms, leading other venture capitalists 
and entrepreneurs with attractive business plans to seek these same firms 
out for investments. 

Model 5 further demonstrates the importance of the composition of the 
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syndicate for explaining spatial investment patterns. This model includes 
a separate term that represents the inverse of the geographic distance 
between a “trusted” member of a syndicate and the target, where a pre- 
vious coinvestment relation between the venture capitalist in a dyad and 
`- another member of the syndicate indicates trust. Thus, this variable mea- 
sures the distance between the target and the closest member of the syn- 
dicate with whom the reference venture capitalist has an established re- 
lationship. The results demonstrate that the probability of an investment 
decreases sharply when the closest member of the syndicate with whom 
the venture capitalist has prior experience recedes from the target (recall 
that this variable codes the inverse distance, so an increase in the variable 
implies a decline in the distance between the trusted syndicate member 
and the target). Thus, the likelihood of a reference venture capitalist 
entering a deal increases considerably when it has a trusted colleague both 
in the syndicate and geographically near to the target. We believe that 
venture capitalists’ willingness to rely on the monitoring and advisory 
capabilities of other VC firms when they know them explains this result. 
Hence, VC firms appear to utilize trusted colleagues to identify, screen, 
and monitor potential investments in distant locations. 

The final model in table 2—model 6—introduces two variables: a prior 
state experience dummy (an indicator of whether VC firm i has previously 
invested in the same state as target j) and a dummy indicating whether 
the VC firm and target are located in the same state. We include these 
measures to capture unobserved heterogeneity in VC firms’ geographic 
investment patterns. Not surprisingly, the results show that the likelihood 
of a venture capitalist’s funding a company in a state increases when the 
venture capitalist has previously funded a firm in the state. As the inclu- 
sion of the same state variable shows, location in the same state as the 
venture capitalist does not improve the likelihood of investment beyond 
the effects of geographic distance, thereby increasing our confidence in 
the specification of distance. Moreover, the inclusion of these control var- 
iables does not affect any of the coefficients of substantive interest. 

Table 3 reports the estimates for the models using “industry distance” 
as the measure of distance between the VC firm and target in each dyad. 
Here, we report the same series of distance X age, experience, and net- 
work position interactions, but define distance in terms of the dissimilarity 
of the venture capitalist’s industry investment profile from the industry 
of the target. Model 7 in this table forms the baseline and differs from 
model 1 in table 2 only in that it controls for heterogeneity in the supply 
of venture capital and of targets by industry, rather than by geography. 
Model 8 enters the VC age variable and an interaction between age and 
industry distance. Once again, the positive coefficient on the age x dis- 
tance interaction in the regressions shows that the negative effect of in- 
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dustry distance on the probability of an investment attenuates for older 
VC firms. In model 9, we include the terms for investing experience. As 
we found in the geography models, the previous investment experience 
variables appear to explain much of the variation in the maturation pro- 
cess with respect to the moderation of distance on the probability of an 
investment: the effect of the age x industry distance variable dissipates 
when we include the investment experience x industry distance 
interactions. 

Model 10 includes the VC network variables and their interactions with 
industry distance, which again behave as anticipated. First, highly central 
VC firms and those that have previous investment experience with other 
members of a syndicate invest in targets at a greater industry distance. 
These results again coincide with the claim that venture capitalists exploit 
their contact network to gain access to deals in new areas. Past syndicate 
partners provide information about new deals and also augment a focal 
venture capitalist’s confidence in the quality of investment candidates. 
However, unlike in the geography models, the experience terms remain 
significant predictors of the industry distance of investments. In model 
11, the addition of a term for the industry distance between the target 
company and the nearest affiliated syndicate partner allows us to inves- 
tigate the importance of syndication in more detail. Again, following the 
same pattern as the geography analysis, the likelihood of investment in- 
creases when a syndicate partner with whom the reference venture cap- 
italist has prior experience lies near to the target in industry space (Le. 
specialize in the target’s industry). Thus, the findings on how industrial 
distance influences the probability that a venture capitalist invests in a 
target almost perfectly parallel the effects in the models that explore the 
influence of geographic distance. 

The control variables generally behave as expected. Prior investment 
in a target has a very large effect on the likelihood of subsequent in- 
vestment. In the industry distance models, a prior investment increases 
the likelihood of future investment in a target by a factor of 50, while in 
the geographic distance models a prior investment increases the proba- 
bility by a factor of 75. The supply and demand variables have incon- 
sistent and generally weak effects, but this should not surprise us because 
the matched sampling design itself substantially controls for these factors. 
The fact that ties appear more likely in early-stage investments merely 
reflects a somewhat lower draw of early-stage investments in the matched 
sample. 


H Given the strength of this effect, we also estimated the models only using the first 
financing round of each target in the data. Qualitatively, the results of models Hmited 
to the initial formation of a tie match the final models presented here identically. 


1580 


Distribution of Venture Capital 


All models show that VC firms invest locally both in geographic and 
industrial space. Figure 5 depicts the likelihood of a venture capitalist 
investing in a target company as a function of both distance measures 
(using coefficients from model 6). One can readily observe that the like- 
lihood of an investment decreases rapidly. For example, venture capitalists 
invest in companies 10 miles from their offices at twice the rate of ones 
situated 100 miles away. Similarly, the probability of a financing rela- 
tionship declines quickly with industry distance. A venture capitalist that 
specializes exclusively in the same industry in which the target company 
operates (Le., it has no prior investments outside the target’s industry) is 
nearly six times more likely to invest in that target than a VC firm that 
has never before invested in the target’s industry. 

Because a variety of factors can explain the locality of investment, these 
results come as no surprise (other than perhaps the substantial magnitudes 
of the effects). Yet, the results also show that certain positional charac- 
teristics clearly affect venture capitalists’ likelihood of investing outside 
of their local industrial and geographic neighborhoods. The Bonacich 
centrality measure has the strongest effect in the models. Given the in- 
herent covariation between network centrality and actor prestige in com- 
munications networks, two subtly different processes probably combine 
to produce the overall result. On the one hand, central VC firms can 
deploy the extensive reach of their networks to identify and evaluate 
distant investment opportunities. Thus, highly central venture capitalists 
have the capacity to activate the search and screening process over large 
distances. On the other hand, other VC firms and entrepreneurs, who 
hope to build relations with prestigious venture capitalists, solicit the 
participation of high status VC firms in investment syndicates. Podolny 
(1993), Stuart (1998), and others discuss at length the value implicit in 
connections to highly central actors that, in this context, include enhanced 
legitimacy, quality certification, and access to the extensive social capital 
of central players.” 

Venture capitalists also look farther afield geographically and in in- 
dustry space when they have prior experience investing with the other 


H As one example, prominent VC firms typically have ongoing relations with presti- 
gious investment banks. As several studies in finance document, “underpricing” in 
initial public offerings—+he amount by which the closing price of a newly issued stock 
on the first day of public trading exceeds the price at which institutional investors buy 
the stock—typically declines when prestigious investment banks underwrite the IPO. 
Since underpricing lowers the amount of money raised in the IPO, the new venture 
typically benefits when prestigious investment banks underwrite its IPO. In addition, 
Stuart et al. (1999) show that startups with tes to high-status firms experience higher 
rates of IPO. Likewise, Podolny and Castellucci (1999) show that high-status VC firms 
can negotiate privileged access to investing in the later rounds of the best deals because 
of the many benefits they bring to the syndicate. 
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Fia 5 —Investment likelihood by geographic and industry distance. The Hnes show the 
importance of geographic distance from the target on tie likelihood at different levels of 
industry distance, from an industry specialist (thin solid line) to an industry novice (thick 
solid line). 


members of the syndicate. Two related factors—trust and reciproc- 
ity—account for this effect. First, conditional on a willingness to invest 
together in the future, prior coinvesting experience indicates both per- 
ceptions of trust and high competence between two VC organizations. 
Trust plays an important role in loosening localization constraints because 
it allows a venture capitalist firm to rely on the evaluations of another 
investor closer to the target in industrial or physical space. Second, the 
frequent appearance of repeated investments among the same sets of VC 
firms probably reflects an active norm of reciprocity in the VC community. 
VC firms build relationships with one another and then routinely “invite” 
trusted colleagues into new deals. Before the investing firm establishes 
itself in the venture capital community at large, however, it will probably 
not receive invitations from distant venture capitalists unaware of its 
existence or unsure about its competence. Only when a newly established 
venture capitalist identifies promising deals in its local community can it 
begin to establish a position in the industry network by inviting coin- 
vestors into the deal, assuming that it can convince potential investors of 
the investment’s attractiveness. 

The results also establish that venture capitalists care not simply 
whether their coinvestors include trusted associates, but also that a trusted 


1582 


Distribution of Venture Capital 


Multipiler of tie Micalihood 


off ERESLEER 


o 


200 400 600 800 1000 1200 1400 1000 1800 2000 
Afíillate dietance (miles) 


Fic. 6.—Investment likelihood by affiliate location. The Hne depicts the multiplier of tie 
likelihood given the affiliates geographic distance (in miles) from the prospective target. 


associate lies in close geographic proximity to the target company. Figure 
6 shows the relationship between the probability of an investment and 
the distance of a target from the venture capitalist and the nearest syn- 
dicate partner with whom the venture capitalist has experience. From the 
figure, one can clearly observe the preference of venture capitalists to 
enter deals when a trusted party can conveniently monitor the target. 
From a policy perspective, this finding suggests that regions that lack a 
VC community might have difficulty accessing venture capital. Although 
venture capitalists expand the radii of their active investment spaces over 
time, this expansion appears to occur primarily through joining syndicates 
with lead venture capitalists in distant communities. This links the ven- 
ture capital communities within the United States and possibly diversifies 
the risk specific to regional economic cycles.” Nevertheless, it does not 
open access to venture funding to those communities and potential en- 
trepreneurs that lie distant from venture capitalists’ offices. 


% This factor probably explains the bicoastal effect we observed in fig. 1 (a spike in 
the probability of investment in the 2,200-2,400 mile range). In the United States, the 
heaviest concentrations of VC firms lie in the coastal regions (m particular, Silicon 
Valley, San Francisco, New York, and Boston). The linkages among these firms created 
by coinvestment facilitate jomt investments by East and West Coast VC firms Our 
findings suggest that syndicates of East and West Coast firms occurs most often when 
one of the members Hes in close proximity to the target and has had previous dealings 
with its counterparts on the opposite Coast. 
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CONCLUSION 


Observers of economic geography often note that advances in commu- 
nications media and methods of transportation paradoxically have done 
little to efface the spatial concentration of many industries: from film 
production and textiles to semiconductor chips and biotechnology, pro- 
duction concentrates in particular regions. Despite these communication 
advances, we believe that inherent boundaries around the flow of timely, 
reliable, and high-quality information produce localized patterns of 
exchange. These boundaries exist because interpersonal social relations 
concentrate within industries and regions more often than they bridge 
industrial and regional boundaries. This observation reflects the simple 
fact that people converge in space and time more frequently when they 
live near one another and have occasion to meet in the course of work 
and play. Although the mass media and weak interpersonal ties routinely 
carry information across regional and community boundaries, we believe 
that the VC relation and other forms of exchange critical to the entre- 
preneurial process depend upon strong and embedded relations among 
the relevant actors. Because high uncertainty and significant information 
imbalances between market participants characterize this domain of ac- 
tivity, close and dense relationships among the relevant parties provide 
a critical catalyst to the process of mobilizing resources to build 
organizations. 

Whenever personal and professional networks play a central role in 
economic activity, we will likely observe spatial patterns in the unfolding 
of that activity. We have demonstrated the existence of these spatial pat- 
terns in the investments venture capitalists make. We have also shown 
that the evolution of interfirm relationships in the VC community appears 
to provide the mechanism for the erosion of geographic and industrial 
boundaries in the dispensation of a venture capitalist’s funds. VC firms 
with a history of provincial investment patterns and those without central 
positions in the industry’s coinvestment network tend to invest locally; 
those who have established many and dispersed relationships with other 
VC firms invest across geographic and industrial spaces more frequently. 
More generally, we believe that institutions supported by broad partici- 
pation among market incumbents must precede the expansion of the spa- 
tial range of exchange in markets that rely on private information or 
require a high degree of trust for transactions to occur. In venture capital, 
the industrywide coinvestment network provides this institutional 
infrastructure. 

We believe that the exploration of when and how the local character 
of relationships intervenes in the functioning of markets offers an im- 
portant avenue for further research. For example, anecdotal evidence 
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suggests that the founders of early-stage companies employ network-based 
recruiting strategies to staff their fledgling enterprises. Indeed, one need 
look no further than the semiconductor industry in Silicon Valley, in which 
dozens of firms including Intel can trace their origins to the departure of 
key technologists from an established company (Fairchild Semiconductor) 
and whose founders then recruited friends and former colleagues to staff 
their new ventures. If founders’ networks concentrate in space, then the 
reliance on network-based recruiting strategies may partially account for 
the geographic clustering of high-technology industries. 

Before concluding, we wish to highlight again that the positional char- 
acteristics have virtually identical effects in moderating the influence of 
the two measures of distance—geographic and industrial—on the prob- 
ability of an investment in a VC firm-target dyad. The similarity of the 
findings stems from the underlying correspondence of the two measures 
of distance: both influence the likelihood that the two parties are embed- 
ded in a network of secondary relationships. Although a variety of factors 
can explain homophilous patterns of interaction, we believe that our find- 
ings provide substantial support for an opportunity-based theory of mar- 
ket exchange. Accounts that would attribute localized interaction patterns 
to preferences would suggest that the spatial range of investments should 
not expand over time unless firm preferences evolve. A change in the cost 
of investing at a distance as an investor gains experience seems to offer 
the most likely cause of such a preference shift. Although we do in fact 
show that the spatial reach of VC firms expands as they mature, we also 
decompose this effect by demonstrating that the development of a set of 
trusted colleagues throughout the investment community drives this ex- 
pansion. Though we suspect that rational preferences for localized in- 
vesting continue to operate, deviations in the salience of geographic and 
industrial proximity appear to stem from the underlying social structure 
of the investing community, rather than from shifts in the preference 
distribution for propinquity and homophily in exchange. 
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This article examines the rise of diversity rhetoric in U.S. manage- 
ment and how that rhetoric reframes ideas inherent in civil rights 
law. Quantitative and qualitative content analyses of the profes- 
sional management literature (mid-1980s—mid-1990s) illustrate a 
managerialization of law, a process by which legal ideas are refigured 
by managerial ways of thinking as they flow across the boundaries 
of legal fields and into managerial and organizational fields. The 
managerial conception of diversity adds a variety of nonlegal di- 
mensions of diversity (e.g., personality traits) to the legally protected 
categories like race and sex, and it disassociates diversity from civil 
rights law. 


INTRODUCTION 


Diversity rhetoric is quite common in contemporary organizational man- 
agement. Talk of diversity pervades management periodicals, human re- 
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source networks, and business education. In each of these settings, di- 
versity rhetoric extols the virtues of a diverse workforce and advocates 
“managing diversity” and “valuing diversity.” The message of manage- 
ment professionals is that organizations must embrace diversity in order 
to thrive in the modern world; this requires a new management style that 
is respectful of the varying cultural styles and backgrounds and the diverse 
abilities, aspirations, and attitudes of a modern workforce. In contrast to 
old management styles, which were developed to deal with a homogenous 
workforce, new management styles must be flexible, willing to defer to 
the interests of individuals, able to resolve new types of conflicts that 
arise from varying cultural backgrounds, supportive of varying lifestyles, 
able to match different types of people to appropriate jobs, and accom- 
modating of different methods of getting work done and different ways 
of evaluating people. 

Organizations appear to be embracing the message: a 1991 survey of 
406 organizations showed that 63% of those organizations provided di- 
versity training for managers; 39% provided diversity training for em- 
ployees; 50% provided a statement on diversity from top management; 
and 31% had a diversity task force (Winterle 1992). Another study found 
diversity initiatives such as diversity action plans, endorsements of di- 
versity in mission statements, and diversity career development and plan- 
ning to be quite common (Wheeler 1995). As diversity rhetoric and action 
become more common in American management, a word that once had 
connotations of stock portfolios now conjures up pictures of a colorful 
and varied workforce and of civil rights goals achieved. 

The rise in managerial rhetoric about diversity is particularly interesting 
because it occurs at a time when support for civil rights appears to be 
waning and rules banning affirmative action are gaining legitimacy.’ Like 
the rhetoric and ideals of the civil rights movement, diversity rhetoric 
decries discrimination and extols inclusiveness. It condemns arbitrariness 
and supports fairness, discredits segregation and celebrates integration. 
Indeed, Kelly and Dobbin (1998) argue that diversity is simply “old wine 
in new wineskins”; in other words, that diversity practices are essentially 
the same as affirmative action practices. 

There is a curious paradox, however, in kow diversity rhetoric in man- 
agement carries forth the civil rights legacy: diversity rhetoric in fact 
expands the conception of diversity so that it includes a wide array of 
characteristics not explicitly covered by any law. Diversity of thought, 


* Examples of anti—civil rights action include Proposition 209, passed by the Caltfornia 
voters in 1996; Initiative 200, passed by Washington state voters in 1998; and the 
Texas u. Hopwood case (78 F3d 932), issued by the Fifth Circuit Court of Appeals in 
1996. 
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lifestyle, culture, dress, and numerous other attributes appear on a par 
with diversity of sex and race. Further, key proponents of the new man- 
agerial model—managers and management consultants—explicitly dis- 
associate their efforts from civil rights law, arguing that diversity is di- 
rectly valuable to organizational efficiency and important in its own right 
rather than because it might promote legal ideals. 

This paradox suggests that managerial rhetorics, especially when they 
concern law or issues central to law, may have the potential to transform 
how managers think about law and ultimately how law is implemented 
in organizational settings. Managerial constructions of law may, for ex- 
ample, affect the representation and distribution of women, minorities, 
and other legally protected groups; thus managerial rhetoric about di- 
versity can have critical implications for the realization of civil rights in 
the employment context. Scholars have focused on the import of mana- 
gerial rhetorics for shaping management styles generally (Bendix 1956; 
Barley and Kunda 1992; Guillen 1994; Abrahamson 1991, 1996, 1997) 
but there has been virtually no attention to how managerial rhetorics may 
reframe understandings of law. 

This article examines the managerial rhetoric of “diversity” as a means 
of determining how managerial rhetorics may appropriate and transform 
legal ideals. Of course, diversity rhetoric represents only a portion of all 
managerial talk about civil rights; there is, after all, much discussion and 
writing about civil rights generally, and in particular about affirmative 
action (AA) and equal employment opportunity (EEO). However, whereas 
those earlier writings were explicitly about law, compliance, or the prob- 
lems associated with legal compliance, diversity rhetoric—which begins 
to appear only in the mid-1980s—portrays itself as quite distinct from 
civil rights law. Rather, diversity rhetoric envisions a style of management 
that will promote harmony and productivity in a changing world. We 
assert, however, that in spite of rhetorical claims to the contrary, both 
diversity and civil rights concern an important legal principle: the fair 
treatment of employees with different attributes. 

That managerial rhetorics may have the potential to transform legal 
ideals even when they explicitly disassociate themselves from the law is 
precisely what makes the study of those rhetorics sociologically interesting. 
By studying how diversity rhetoric constructs and transforms the legal 
ideals underlying civil rights law, we can begin to formulate a general 
theory of how managerial rhetorics influence the construction and im- 
plementation of law within organizations. Thus, we focus on rhetoric 
specifically about diversity in management and seek to understand how, 
and to what extent, diversity rhetoric embraces and yet transfigures the 
legal ideals embodied in civil rights law. After discussing the rise of di- 
versity rhetoric, the key question we address concerns how managerial 
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rhetoric about diversity constructs both the notion of diversity itself and 
its relation to civil rights law. Specifically, we examine the extent to 
which—and how-—diversity rhetoric conceptualizes diversity as a man- 
agerial as opposed to a legal issue. We also examine how constructions 
of diversity change over time within diversity rhetoric, and we assess 
what factors predict managerial (as opposed to legal) conceptions of 
diversity. 

We use the example of diversity rhetoric to articulate a theory of the 
managerialization of law, or the process by which conceptions of law may 
become progressively infused with managerial values as legal ideas move 
into managerial and organizational arenas. The managerialization of law 
occurs as management consultants rhetorically refashion legal ideas that 
challenge traditional managerial prerogatives and suggest new ideas, 
which they claim are more innovative, more rational, and more progres- 
sive (cf. Abrahamson 1996; Abrahamson and Fairchild 1999). We suggest 
that the managerialization of law has the potential to undermine legal 
ideals as managers shift the focus of attention from law to management. 
But at the same time, the managerialization of law has the potential to 
legalize organizations by reframing the law in ways that make it appear 
more consistent with traditional managerial prerogatives. 

Because we are interested in the production and development of man- 
agerial ideas about law, we focus on how diversity is conceptualized in 
the professional management literature rather than on how it is imple- 
mented in organizations. Managerial rhetorics are important because they 
provide both the locale and rationale for trends in organizational man- 
agement; evidence suggests that managerial rhetorics usually precede or 
develop along with the implementation of ideas by firms (Guillen 1994; 
Abrahamson 1997). 

In the next section, we discuss two literatures that help to frame our 
inquiry into diversity rhetoric: the managerial models literature and the 
institutional law and organizations literature. Following that, we offer a 
theoretical model of the managerialization of law, which draws upon both 
these literatures and goes significantly beyond them. Our objective is not 
to test these theories; rather we wish to show how the two may be prof- 
itably combined to offer a more elegant explanation of how managerial 
rhetorics reconceptualize law. Next, turning to the diversity example, we 
discuss our methodology for studying diversity rhetoric. Our results are 
presented in three sections: (1) a description of the rise and progression 
of diversity rhetoric and its relation to managerial rhetoric about civil 
rights generally; (2) a qualitative analysis of the content of diversity rhet- 
oric; and (3) a quantitative analysis of factors that predict the manager- 
ialization of law. Finally, we draw on the diversity example to offer in- 
sights into the managerialization of law more generally, arguing that 
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managerial rhetorics about law may arise as ideas flow across the inter- 
stitial areas where legal, managerial, and organizational fields overlap. 


TWO SOCIOLOGICAL FRAMES 


Two diverse sociological literatures inform our analysis of managerial 
attention to diversity. These literatures offer complementary explanations 
for change in managerial patterns and rhetoric that, when taken together, 
suggest a new theoretical perspective on how law may be transformed in 
organizational settings. First, the literature on managerial models ad- 
dresses questions about why and under what conditions managers pro- 
mote new sets of ideas (Bendix 1956; Barley and Kunda 1992; Guillen 
1994; Abrahamson 1991, 1996, 1997) and thus provides a context for 
understanding the rise of managerial attention to diversity. Second, the- 
ories of law and organizations, which derive from neoinstitutional or- 
ganization theory, more explicitly address the role of the professions in 
diffusing and institutionalizing responses to law (Edelman 1990, 1992; 
Edelman, Abraham, and Erlanger 1992; Edelman, Erlanger, and Lande 
1993; Dobbin, Sutton, Meyer, and Scott 1993; Sutton, Dobbin, Meyer, and 
Scott 1994; Edelman, Uggen, and Erlanger 1999; for neoinstitutional the- 
ory, see Meyer and Rowan [1977], DiMaggio and Powell [1983], and Pow- 
ell and DiMaggio [1991]. We discuss each of these literatures below. 


Managerial Models 


Various terms are used to describe managerial models: Bendix (1956) 
discusses “managerial ideologies”; Barley and Kunda (1992) analyze “man- 
agerial rhetorics”; Guillen (1994) discusses “managerial paradigms” and 
Abrahamson (1991, 1996, 1997) writes about “managerial fashions” and 
“managerial rhetorics.” Although there are minor differences in conno- 
tations, all of these authors are referring to models of management such 
as scientific management, human relations, and more recently, total qual- 
ity management. We use the term “managerial models” to refer broadly 
to both managerial practices and ideas (thus, this term would encompass 
ideologies, paradigms, fashions, and rhetorics). Following Barley and 
Kunda (1992, p. 363), we define “managerial rhetorics” as “a stream of 
discourse that promulgates, however unwittingly, a set of assumptions 
about [managerial practices]. Thus, managerial rhetorics would include 


? Abrahamson (1991, p. 259), citing Barley and Kunda (1992), defines rhetoric as a 
“spoken and written discourse that justifies the use of a set of techniques for managing 
organizations or their employees.” 
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spoken and written ideas that describe, promote, and rationalize mana- 
gerial practices. 

Theorists have advanced a variety of explanations for the rise of new 
models of management. Some look to specific institutional contexts, such 
as levels of labor activity Jacoby 1985) or “performance gaps”—+that is, 
significant discrepancies between managerial aspirations and achieve- 
ments (Abrahamson 1997; Abrahamson and Fairchild 1999). Abrahamson 
and Fairchild (1999) point to the conjunction of three conditions that give 
rise to new managerial models: the collapse of a previous fashion, a wide- 
spread performance gap, and new rhetoric that emphasizes the perform- 
ance gap. Others point to a progression of phases from more coercive to 
more bureaucratic designed to legitimate shop-floor practices (Bendix 
1956; Edwards 1979). And still others posit a cyclical pattern between 
more rational and more normative rhetorics (Barley and Kunda 1992; 
Guillen 1994). Abrahamson (1997) shows that both the cyclical and the 
performance gap theories have validity and in fact work in concert. The- 
orists broadly agree, however, that specific managerial rhetorics generally 
arise in response to identifiable ideological or technical management prob- 
lems (ranging from labor union activity to poor performance to the in- 
ternational competitive environment). 

The literature on managerial models emphasizes the role of the man- 
agement profession, and in particular management consultants, in con- 
structing new managerial rhetorics. Abrahamson (1996) argues that man- 
agement consultants act as “fashion setters,” who actively shape beliefs 
within the management community about the value of new managerial 
models. Drawing on the institutional organizations literature, Abraham- 
son suggests that management consultants, business schools, and business- 
press organizations are particularly important fashion setters; these actors 
both supply and actively promote new rhetorics that rationalize and in- 
stitutionalize new managerial practices. 

Managerial rhetorics thrive where they offer models that appear to 
enhance organizational rationality and further organizational progress 
(Abrahamson 1996; Meyer and Rowan 1977). Thus, fashion setters market 
new ideas through rhetorics that construct the novelty and rationality of 
new models. To construct the novelty of new models, fashion setters may 
emphasize the failures of previous models, pointing to their lack of fit 
with current organizational goals or with societal norms. To construct the 
rationality of new models, fashion setters use rhetorics that underscore 
why it is critical that managers change their procedures and why the 
proffered solution is the most efficient (Abrahamson 1996; cf. Edelman 
et al. 1992). Rhetorics that describe the actions of “successful” companies 
or provide quasi-theoretical statements about how new models will ad- 
dress critical problems are especially common ways of constructing ra- 
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tionality (Abrahamson 1996). Academics may participate by conducting 
studies that further validate the claims of management consultants and 
others (Edelman et al. 1992; Abrahamson 1996). Managerial rhetorics, 
then, are cultural phenomena: they draw their legitimacy from their cul- 
tural environments and at the same time help to shape and to construct 
their cultural environments. Managerial fashion setters use rhetoric not 
only to construct the cultural problems that require attention but also the 
solutions that will solve those problems. Managerial rhetorics make new 
managerial models seem like the most rational—and natural—way to 
proceed. 

The case that we focus on—managerial rhetoric about diversity—differs 
somewhat from the models addressed in the literature on managerial 
models. Whereas the managerial models literature tends to address models 
that offer systemic approaches to the management of employees, the di- 
versity paradigm is more narrowly related to the management of em- 
ployees from different racial, cultural, and other social groups. Since this 
is an issue is also central to civil rights law, we turn to theories of law 
and organizations for further insights on managerial rhetoric that is related 
to law. 


Theories of Law and Organizations 


Institutional theories of law and organizations emphasize organizational 
responsiveness to the legal environment (Edelman 1990, 1992; Sutton et 
al. 1994; Dobbin et al. 1993). In contrast to earlier rational perspectives 
on law that emphasize organizational responsiveness to internal technical 
demands (Scott 1987), institutional perspectives suggest that organizations 
adopt a variety of legal structures in response to “institutionalized rules” 
or normative ideas or practices that have become widely accepted. These 
institutionalized rules, in turn, flow freely among “organizational fields.” 
Organizational fields refer to the immediate environments of organiza- 
tions, or “those organizations that, in the aggregate, constitute a recognized 
area of institutional life: key suppliers, resource and product consumers, 
regulatory agencies, and other organizations that produce similar services 
or products” (DiMaggio and Powell 1983, p. 143). The connectedness of 
organizational fields allows for the free flow—and hence the institution- 
alization—of ideas about law. These fields thus become the key source of 
ideas about how organizations ought to look, how they ought to behave, 
and even what elements of the environment are important and which 
ones constitute threats. 

Institutional theory posits that the professions are key carriers of ideas 
among and across organizational fields (DiMaggio and Powell 1983) and 
that the personnel, managerial, and legal professions are particularly im- 
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portant carriers of ideas about law (Jacoby 1985; Baron, Dobbin, and 
Jennings 1986; Edelman et al. 1993; Abzug and Mezias 1993; Edelman 
et al. 1999). Professionals carry ideas as they move among organizations, 
and through participation in professional networks: conferences, work- 
shops, and the professional personnel literature all offer forums for the 
exposition and diffusion of new ideas within professions. Ideas are trans- 
ferred among professions in part through the use of consultants. Both 
legal professionals and management consultants are often hired as con- 
sultants to organizations and thus their worlds intersect with the worlds 
of personnel officials within organizations. Professions, then, are a source 
of “normative isomorphism” within and across organizational fields: cer- 
tain ideas, practices, routines, and scripts become institutionalized as the 
professions offer normative solutions to perceived managerial problem 
(DiMaggio and Powell 1983). 

A number of empirical studies have elaborated the role of the mana- 
gerial professions in constructing the meaning of law (Edelman 1990, 1992; 
Edelman, Petterson, Chambliss, and Erlanger 1991; Edelman et al. 1992; 
Edelman et al. 1993; Edelman et al. 1999). These studies suggest that 
legal ambiguity amplifies the opportunities for professionals to identify 
management problems and to propose new ideas to remedy those prob- 
lems. Because law is broad and ambiguous and is rarely read directly by 
employers, most organizational actors rely at least indirectly on the legal 
profession for information about legal requirements. Lawyers often run 
workshops for personnel professionals or write for their professional pe- 
riodicals, and personnel professionals in turn communicate the law to 
their employers and others who make organizational policy. As law is 
communicated by and among professions, it is filtered through a variety 
of lenses, and colored by different professional backgrounds, training, and 
interests. 

For example, an analysis of wrongful discharge law (a common law 
doctrine that allows employees to sue their employers when they are fired, 
under a limited set of conditions), shows that the personnel and legal 
professions greatly inflated the threat that the doctrine posed to employers 
(Edelman et al. 1992). By repeatedly citing a few extreme jury verdicts 
in states most favorable to the doctrine, legal and personnel professionals 
created the impression that virtually all cases were won, with large 
awards, by employees. In fact, however, employers won the vast majority 
of these cases. But in part to inflate their own status within organizations 
and to expand the markets for their services, the professions created the 
impression of a much greater threat than wrongful discharge doctrine 
actually posed. 

Another study, which analyzed organizational complaint handlers’ ide- 
ologies about discrimination disputes, showed that complaint handlers 
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tended to recast discrimination complaints as managerial or interpersonal 
problems, thus deemphasizing the legal aspects of these claims (Edelman 
et al. 1993). In a third study, the personnel profession was shown to have 
promulgated the notion that internal grievance procedures offered or 
ganizations substantial protection from discrimination lawsuits (Edelman 
et al. 1999). Even though there is little hard evidence to support the idea 
that grievance procedures offer protection from liability, that idea has 
become widely accepted in organizational fields and has institutionalized 
the internal grievance procedure as a form of compliance with civil rights 
law. 

Theories of law and organizations, then, suggest that professionals play 
an important role in constructing ambiguous legal mandates and, further, 
that professionals tend to interject their own interests and training into 
how they understand law. The precise impact of the professions varies, 
however. For example, whereas the wrongful discharge study found that 
the personnel profession amplified the threat of law, the complaint han- 
dlers’ study found that the personnel profession transformed legal threats 
into nonlegal problems. And the grievance procedure study found that 
personnel professionals constructed the nature of the response rather than 
the legal threat itself. But all the studies suggest that ideas about law and 
compliance that originate with the professions tend to become institu- 
tionalized in organizational fields and, over time, generate a diffusion of 
new organizational practices. Although managerial attention to diversity 
is not explicitly an effort to comply with civil rights law, it does emphasize 
some of the same ideals: in particular, inclusion and fair treatment. Thus, 
theories of law and organizations suggest the importance of understanding 
diversity in the context of managerial efforts to comply with and to in- 
terpret ambiguous law. 


THE MANAGERIALIZATION OF LAW 


We draw on both the managerial models and the law and organizations 
literatures to formulate a theory of how rhetorics may introduce mana- 
gerial ways of thinking into the construction of law and legal ideas, thus 
managerializing the law. In most cases, the perspectives are complemen- 
tary rather than competing; however, we point out one instance in which 
the two lead to slightly different predictions. We elaborate both perspec- 
tives by suggesting that managerial rhetorics have the potential to reframe 
understandings of law and legal constructs even when they claim to dis- 
cuss a phenomenon different from law. Indeed, managerial rhetorics not 
explicitly about law may be especially powerful in shaping conceptions 
of law and the legal environment precisely because their relation to law 
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is veiled. This is an issue that has not been explored either by the man- 
agerial models literature, which addresses shifts in general styles of man- 
agement, or by the law and organizations literature, which focuses on 
organizations’ direct responses to law or legal environments. 

The potential of managerial rhetorics to reframe legal ideas derives 
from the ambiguity of law (Edelman 1992). This same study has shown 
that civil rights law offers employers only broad and general ideas about 
what employment policies ought to be, leaving wide latitude for managers 
to construct the meaning of law and the forms of compliance. Few em- 
ployers read statute books or judicial interpretations of those statutes. 
Even if they were to spend their time reading law on the books, that law 
is too vague to offer specific directives about what compliance entails. 

Employers, then, depend on a variety of professionals to interpret legal 
directives and to implement them within the firm. Managers (especially 
personnel managers and managers with compliance-related positions such 
as affirmative action officers), management consultants, and legal pro- 
fessionals are the key interpreters of civil rights laws (Edelman et al. 
1992). Lawyers, either within the firm or working on retainer, may also 
write or review policies designed to implement civil rights laws and may 
serve as consultants on issues that are recognized as potentially involving 
legal liability. But managers and management consultants are more in- 
volved in the routine activities that are the object of civil rights laws: 
hiring, firing, promotion, determining the rules of work, setting tasks, and 
so on. Although lawyers may influence what managers know and un- 
derstand about law, law in organizations is ultimately no more and no 
less than managerial understandings of law. 

Understandings of law among management professionals, moreover, 
are likely to be somewhat different from understandings of law among 
legal professionals. In part, this is because law is peripheral rather than 
central to the job of management; in part, these differences stem from 
divergent professional backgrounds and training. Managers are chiefly 
interested in how to process and to govern employees in a way that, in 
the end, will increase organizational efficiency and productivity and, for 
private firms, profit. Law is an important constraint but not necessarily 
an important objective. Managers seek to achieve efficiency in ways that 
will not incur lawsuits or the ire of regulators, but they are not primarily 
concerned with the principles of law. Thus, managers tend to understand 
law—and the legal environment more broadly—as a set of constraints 
upon the range of possible management styles that will achieve organi- 


* Top managers may function more like employers in that they are more likely to 
receive interpretations from the personnel managers and compliance officers below 
them than to produce those interpretations. 
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zational goals. Legal rules tend to be filtered through a set of managerial 
lenses chiefly designed to encourage smooth employment relations and 
high productivity. Thus, as legal ideas move into managerial and organ- 
izational arenas, law tends to become “managerialized,” or progressively 
infused with managerial values. 

We now formulate four hypotheses about factors that produce a man- 
agerialization of law. Based on the managerial models and law and or- 
ganirations literatures, we identify four factors that are important: the 
passage of time, the professions of actors who carry rhetorical messages, 
the emphasis on profit as a rationale for the new model (rationality), and 
the disassociation of the new model from law (novelty). With respect to 
the idea of diversity specifically, we will argue that these factors predict 
a construction of diversity that moves away from the ideas of formal civil 
rights law and toward a much broader understanding of diversity, which 


conforms to managerial ways of thinking. 


Time 


Both the managerial models and the law and organizations literatures 
would suggest that the managerialization of law is an institutional pro- 
cess—a process that occurs over time as new ideas become widely accepted 
and taken for granted (cf. Meyer and Rowan 1977; DiMaggio and Powell 
1983). The managerial models and law and organizations literatures, how- 
ever, suggest slightly different accounts of how this institutionalization 
process may work. 

The managerial models literature does not speak directly to the man- 
agerialization of law (i.e., the infusion of law with managerial values); 
instead, it discusses the typical shape of managerial rhetorics themselves 
(Le., the rise and fall of discourses about management practices). With 
respect to the latter, the managerial models literature suggests that new 
rhetorics become institutionalized as fashion setters establish the advan- 
tages of new managerial models and others jump on the bandwagon. 
After a latency period, there is a sharp rise and then a gradual decline 
(Abrahamson and Fairchild 1999). The managerial models literature sug- 
gests a number of possible reasons for the decline of a rhetoric (and the 
model that rhetoric describes): the crisis that gave rise to the rhetoric may 
be surmounted or may disappear, competing rhetorical claims may appear 
and take precedence, or the rhetoric itself may come under criticism for 
lack of internal consistency or empirical validity (Abrahamson 1991, 1996, 
1997; Abrahamson and Fairchild 1999). However, this literature addresses 
the rise and fall of managerial models and rhetorics themselves, and not 
how those rhetorics construct law and legal ideals. Therefore, it is not 
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clear whether managerialized understandings of law will decline along 
with the decline of the rhetoric itself. 

The law and organizations literature does address the construction of 
law and legal ideals per se but has not to date focused on the role of 
managerial rhetorics. Most of the research in this area has focused on the 
institutionalization of structural responses to civil rights law. These re- 
sponses tend to be fairly long lasting because structural responses to law 
tend to involve staffs who develop agendas of their own and become 
internal advocates for legal ideals (Edelman 1992; Edelman et al. 1993; 
Chambliss 1996; Edelman and Petterson 1999). This literature suggests 
that once organizational professionals adopt a construction of law, that 
construction is likely to become part of the organizational culture. More- 
over, recent work on the endogeneity of law suggests that lawyers and 
judges may embrace managerial rhetorics, lending them legal legitimacy 
(Edelman et al. 1999; Edelman, in press). 

Because the managerial models literature deals with the rise and fall 
of rhetorical fashions themselves, and the law and organizations literature 
addresses the construction of law outside of managerial rhetorics, neither 
literature provides direct predictions about how the managerialization of 
law may progress over time. However, one may draw slightly different 
inferences from the two literatures. From the managerial models literature, 
if we assume that the managerialization of law follows the same trajectory 
as managerial rhetorics themselves, one may infer that the manageriali- 
zation of law may rise and then decline within the lifespan of a rhetoric. 
Alternatively, since the law and organizations literature suggests that or- 
ganizational responses to law are likely to become institutionalized over 
time, one may infer that managerialized conceptions of law would remain 
prevalent even as the rhetoric begins to decline. The only difference be- 
tween the two predictions, then, is whether or not the managerialized 
construction of law will decline as the rhetoric itself declines. 

We suspect that the managerial rhetorics are simply vehicles that carry 
new constructions of law. Once those new constructions of law are in 
place, they are likely to become institutionalized and to have a lasting 
impact on how managers think about the law. This is especially likely 
where managerial constructions of legal ideas resonate with the broader 
political and legal culture. Thus, following the institutional law and or- 
ganizations literature, we predict that managerialized conceptions of law 
will become institutionalized; that is, that they will remain stable even 
as a managerial rhetoric itself declines. 

HYPOTHESIS 1.—The managerialization of law will increase over time 
and will then level off. 
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Managers (especially personnel managers and compliance officers) and 
management consultants are likely to be the key promulgators of man- 
agerialized understandings of law. As the key fashion setters and partic- 
ipants in managerial rhetorics generally, managers and management con- 
sultants have the opportunity subtly to shape or reshape ideas about the 
nature of the legal environment and appropriate responses to that envi- 
ronment. When these professionals frame a new managerial model as 
consistent with the law, as different from the law, as irrelevant to the law, 
or as better than the law, they shape public understandings not only of 
how to manage but also about the meaning and requirements of law itself. 
When professionals frame problems as managerial rather than legal—or 
when they frame solutions to those problems as therapeutic or pragmatic 
rather than punitive—they shape understandings of law (Edelman et al. 
1993). Thus, the media of managerial rhetoric, which includes professional 
periodicals, meetings, workshops, and electronic listserves, become pow- 
erful forums for the construction and transformation of ideas about law 
even when their discussions are not explicitly about law (Jacoby 1985; 
Baron et al. 1986; Edelman et al. 1992; Guillen 1994; Abrahamson 1996; 
Edelman et al. 1999).* 

HYPOTHESIS 2.—Managers and managerial consultants will contribute 
to the managerialization of law more than will other professionals. 


Rationality 

Managerial rhetoric about the rationality of a new managerial model will 
contribute to the managerialization of law. As discussed earlier, the man- 
agerial models literature suggests that managerial rhetorics seek to es- 
tablish the rationality of a new managerial model by portraying the model 
as beneficial to organizational goals (Abrahamson 1996). Usually, this 
takes the form of statements about how the new model will increase 
organizational efficiency, and ultimately, profit. We extend this idea by 
looking at the effects that rhetorics about the rationality of a new man- 
agerial model may have on the managerialization of law. As the rhetoric 
focuses on the benefits of the new managerial model for organizational 


in legal professionals are active participants in explicit constructions of law 

within organizations and in the construction of forms of compliance, they are far less 
líkely than management professionals to participate in the managerialiration of law. 
Legal professionals, even when they work within or around organizations, have dif- 
ferent professional backgrounds and networks and are less susceptible to manager- 
lalized understandings of law (see Edelman et al [1992] , which shows that lawyers 
are less likely than personnel professionals to inflate the threat of wrongful discharge). 
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profit, it becomes easier for managers to lose sight of the law and to adopt 
more managerial constructions of law and legal ideas. The focus on profit 
as a key reason for adopting a new managerial model tends to shift 
attention away legal ideals and, therefore, away from the categories of 
persons specifically protected by the law. Instead, managers think in terms 
of broad managerial goals such as consistent treatment of all employees. 
The law is thus reshaped in ways that fit traditional managerial goals 
and ways of thinking. 

HYPOTHESIS 3.—The greater the occurrence of managerial rhetoric 
about the rationality of a new model, the greater the managerialization 
of law. 


Novelty 


Managerial rhetoric about the novelty of a new managerial model will 
contribute to the managerialization of law. The managerial models lit- 
erature asserts that fashion setters construct the novelty of new managerial 
models by differentiating them from previous models and by establishing 
the need for the new model (Abrahamson 1996; Abrahamson and Fair- 
child 1999). To encourage attention to the new models, managerial rhet- 
orics emphasize the failure of previous models. As with rationality, we 
extend the managerial models literature by asserting that rhetoric about 
the novelty of a new managerial model will have effects on the mana- 
gerialization of law. Where previous managerial models are related to law, 
rhetoric about the novelty of new managerial models may delegitimize 
the law by suggesting previous models are no longer necessary or are 
inconsistent with the current political environment. To the extent that 
managerial rhetoric successfully portrays a law-related managerial model 
negatively or as outmoded, it will allow managers to appropriate and 
transform legal ideals in ways that are more compatible with managerial 
interests. 

HYPOTHESIS 4.—The greater the occurrence of managerial rhetoric 
about the novelty of a new model, the greater the managerialization of 
law. 

In sum, legal ambiguity leaves legal constructs open to influence by 
managerial rhetorics. Although managerial rhetorics serve chiefly to pro- 
mote new managerial models, we argue that they also tend to infuse 
conceptions of law with managerial values. 

We now turn to an empirical examination of a particular managerial 
rhetoric—diversity rhetoric—as a means of exploring these hypotheses. 
After discussing our methodology for studying diversity rhetoric, we pre- 
sent our results in three sections. First, we describe the rise and progression 
of diversity rhetoric, showing how it generally fits the pattern of other 
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managerial rhetorics. Second, we provide a qualitative analysis of the 
content of diversity rhetoric, which illustrates the managerial influence 
on the construction of diversity as well as how managers construct both 
the rationality and novelty of diversity. Finally, we turn to a quantitative 
analysis of factors within diversity rhetoric that engender a managerial- 
ization of law. 


METHODOLOGY 


In order to examine diversity rhetoric, we analyzed the professional man- 
agement literature on diversity. The professional management literature 
provides a rich source of data on both the timing of managerial rhetorics 
and the transformation of ideas over time, and it has provided data for 
numerous studies of management ideology and management rhetorics 
(Edelman et al. 1992; Edelman et al. 1999; Barley and Kunda 1992; Abzug 
and Mezias 1993; Guillen 1994; Abrahamson 1997). Not only does this 
literature reflect managerial ideals; it also constitutes one of the key forums 
through which ideas are introduced and disseminated to other managers 
and organizational actors (DiMaggio and Powell 1983; Jacoby 1985; Baron 
et al. 1986). 

It is important to reiterate that the professional management literature 
captures managerial rhetoric about diversity, but it does not necessarily 
capture what organizations are doing in the diversity realm. There is 
evidence that the two are correlated: Abrahamson (1997) reports that 
surveys carried out by the National Industrial Conference Board (NICB) 
reveal that the creation of personnel departments rises at times when 
there are higher percentages of articles advocating human relations tech- 
niques. Other theorists, however, suggest that managerial rhetorics reflect 
the practices only of the largest or most innovative firms (Brody 1968; 
Edwards 1979). We follow Barley and Kunda (1992) and others (Guillen 
1994, Abrahamson 1996), however, in suggesting that—irrespective of 
their relation to organizational practice—managerial rhetorics are im- 
portant phenomena for study in and of themselves because they represent 
the culture of business and the discourse of management fashion setters. 
Further, our data represent what organizations are hearing about through 
the professional literature on management. Not only are managers and 
human relations professionals the intended audience for these 
publications, but it is our strong impression that these articles reflect the 
content of diversity training seminars that many of these managers attend. 
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Sample 


To select a sample of professional management periodicals, we began with 
a list of all English-language publications listed under five headings in 
Business Information Sources (Daniells 1993): business and economics, 
Management, organizational behavior, human resources and personnel, 
and training and development. We excluded publications that were pri- 
marily academic journals. We also excluded those with circulation under 
20,000 or those published outside the United States, except in the case of 
human resources and personnel publications, since those were most central 
to the phenomenon under study. This left us with 18 professional man- 
agement periodicals. 

We used these journals to construct two separate data sets. The first 
data set—the diversity data set—is the focus of our study. This data set 
contains all of the articles that refer to diversity (some of which also refer 
to civil rights, including EEO or AA). This data set allows us to examine 
our key research questions concerning diversity rhetoric and the mana- 
gerialization of law. The second data set—the civil rights data set—allows 
us to place the diversity data set in historical context. The civil rights 
data set comprises articles that address civil rights (including EEO or 
AA) but that do not address diversity. We discuss these two data sets 
separately below. 

To construct the diversity data set, we used the ABJ/Inform (1975-96) 
database to search the 18 journals from 1975 to 1996 for all articles with 
the keyword “diversity.”* All articles that used the keyword “diversity” 
were included (some of which also discussed civil rights, EEO, or AA). 
We then excluded three types of articles: (1) those that were about types 
of diversity other than workforce diversity (e.g., stock portfolio diversity), 
(2) those that focused on issues exclusively related to globalization and 
multinational corporations in other countries, and (3) those that were 
fewer than 200 words in length. Omitting very short articles makes sense 
in a project that uses content coding, since space limitations rather than 
an author’s perspective may preclude mentioning an issue that the coding 
scheme aims to capture. We also excluded three early articles that used 
the term “diversity” in the lay sense rather than in reference to a style of 
management and that were isolated in time (two in 1976 and one in 1982). 
These three articles precede the apparent inception of diversity rhetoric, 
which (as we argue below), began in earnest in 1987. Our final diversity 
data set, then, consists of 286 diversity articles across 17 publications (one 
publication contained no articles on workforce diversity), which were 


* Keywords in the ABI/Inform database may appear in the title, subject terms, subject 
categories, or abstract of a citation. 
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published between December 1987 and December 1996. Appendix table 
A1 shows statistics on our sample of periodicals and on the articles on 
diversity published during the sample period. 

To construct the civil rights data set, we searched the same 18 journals 
for articles that included the keywords “EEO” or “equal employment” or 
“equal employment opportunity” or “civil rights act” or “affirmative ac- 
tion.” For this search, we excluded articles that also included the keyword 
“diversity,” since all articles that discuss diversity (including those that 
also discuss civil rights) are included in the initial diversity data set. We 
use the civil rights data set to document the pattern of broader civil rights 
rhetoric and therefore to place the rise and progression of diversity rhetoric 
in historical context. 


Coding 

We content coded only the diversity data set. We coded only explicit 
statements. Our coding scheme was conservative in that coders were 
instructed not to extrapolate from suggestive or ambiguous statements.’ 
The coding scheme was developed by all three authors and revised several 
times during the early part of the coding process. After each revision, any 
articles that had already been coded were recoded to account for the 
revisions. The third author coded 75% of the articles and closely super- 
vised a graduate student who coded the other 25%. If any questions arose 
during the coding, the two coders decided together how to code the article 
or, if necessary, brought it to the attention of the other authors.* 


Measures 


Table 1 shows the variables we coded in the diversity data set. In order 
to operationalize the managerialization of law, we coded authors’ con- 
structions of diversity. Specifically, we coded the extent to which authors 
discussed diversity along dimensions protected by law (race, color, sex, 
religion, national origin, disability status, pregnancy, age, veteran status, 
and sexual orientation)’ and along other, nonlegal, dimensions (such as 


? Often, articles contained clear definitions of diversity so that coding the author's 
conception of diversity was straightforward. At other times, constructions of diversity 
were taken from explicit illustrations peppered throughout an article. But only explicit 
statements were coded. Coders were instructed not to infer constructions of diversity 
from the overall tone of the article. 

* Because the coders worked closely, we are confident that there were no systematic 
differences between their coding and therefore did not calculate intercoder reHability. 
? All of these categories except sexual orientation are protected by federal law. Sexual 
orientation is protected under some state and local laws. 
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Diversity Rhetoric 


culture, geographic location, dress style, lifestyle). Table 2 shows all of 
the specific constructions of diversity that we found. The legal and non- 
legal categories shown in table 1 are composites of the constructions shown 
in table 2; that is, the nonlegal categories variable is operationalized as 
the number of constructions that are not explicitly consistent with law, 
and the legal categories variable is operationalized as the number of con- 
structions that are explicitly consistent with law.” Therefore, we opera- 
tionalized the managerialization of law as the number of nonlegal cate- 
gories. We also included the number of legal categories as a control 
variable, since some authors were more verbose about the construction 
of diversity. A related control variable was article length, measured as 
the natural log of the number of pages. 

In order to analyze time trends in the construction of diversity, we 
coded the date of the article. Time is measured as the number of months 
since the inception of diversity rhetoric, which (as we argue below) was 
December 1987. We also calculated the square of time to check for a 
nonlinear relationship and included a flag for articles published during 
the first year after the inception of diversity rhetoric. 

To measure the effect of different fashion setters, we coded the pro- 
fession of each author. Where there were multiple authors, we coded the 
profession of the first author. The categories of author were executive or 
manager, Management consultant, management academic, editor, staff 
writer, freelance writer, and other. Interestingly, only three of the authors 
listed their profession as “lawyer”; thus lawyers are included in the “other” 
category. 

In order to examine authors’ efforts to construct the rationality of di- 
versity, we coded each of the reasons that authors gave for promoting 
diversity, using a separate dummy variable for each reason. These reasons 
fell into five categories: profit, demographics (or demographic change), 
law, fairness, and a diverse customer base. We operationalized rationality 
with a dummy variable coded “1” if the author used profit as a reason 
for supporting diversity and “0” if he or she did not.” 

Finally, we coded two variables that measure different dimensions of 
authors’ efforts to construct the novelty of the diversity model. Authors 


* There is one exception to this categorization. We include “diversity as including 
whites or males” in nonlegal constructions even though race and sex are formally 
covered by federal law because this construction of diversity tends to be used in a 
way that seems inconsistent with the law, that is, to suggest that white males are 
excluded by the current poHtical climate. 

11 Some of the articles discussed demographics and diverse customer bases as market- 
based rationales for diversity. However, these dimensions were also discussed in the 
context of fairness-based rationales for diversity. Thus, we use profit as the clearest 
exemplar of attention to rationality in market terms. 
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TABLE 2 
CONSTRUCTIONS OF DIVERSITY 


Articles Using 
Construction Construction (%) 
Legal. 
Race or ethnicity . . .............. 850 
> ee E 77.7 
BGO: EEEE TE ees ce au 402 
Disability AN 28.7 
National origin . . . ........ cece eee eee 35.3 
Other legislation (e.g., religion) .... .. 245 
Gayflesblan® ............. ocn 28.3 
Mean (SD) ...........-.... 5. 5 32 (1.9) 
Nonlegal: 
Cultural io ese endnc3 do Sedan anden 434 
e IAS 98 
Level/rank within organization ...... 9.8 
Training. corir Sec) AN 11.9 
Communication style .............. ... 10.9 
Problem-solving style ....... ... ... 8.0 
Prior work experience 10 1 
Attitude, Hfestyle, behavior’ . ........ 227 
Diversity of employee benefits 129 
Diversity of customer/client base 8.0 
Diversity in services and products 1.0 
Diversity includes white men ........ 11.2 
Mean (SD) ...... 1.6 (1.6) 
NoTr.—N = 286 


* Some state and local level employment discrimmahon laws prohibit dis- 


t Excludes Hfestyle based on legally protected categones (e g , region and 
sexual preference), which would be coded under the relevant legal construction 
variable. 


seeking to establish the novelty of the diversity model might do so either 
by neglecting civil rights entirely, or, by referring to civil rights negatively 
or as outmoded. Thus, we include two measures of novelty: (1) a dummy 
variable measuring whether authors explicitly mentioned civil rights, and 
(2) a dummy variable measuring whether those authors who do mention 
civil rights portray civil rights negatively or as outmoded. The second 
variable was coded “1” if the author stated that the diversity paradigm 
is better than civil rights or if the author evinced a negative or hostile 
attitude toward civil rights, and, additionally, if the author did not state 
that the two paradigms can coexist within an organization or are both 
necessary.” No author portrayed civil rights as preferable to diversity. 

u We also coded a more liberal construction of this variable in which we dropped the 


constraint that to be coded “1,” the article must not state that the civil rights and 
diversity paradigms could coexist The results did not differ substantially. 
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METHOD OF ANALYSIS 


We analyzed the data both qualitatively and quantitatively. Our quali- 
tative analyses were designed to locate and document themes regarding 
constructions of diversity. Quantitative analyses included descriptive sta- 
tistics and ordinary least squares (OLS) regression using Stata software. 


THE RISE AND PROGRESSION OF DIVERSITY RHETORIC 


In this section, we describe the rise and progression of diversity rhetoric, 
showing how it generally fits the pattern of other managerial rhetorics. 
Further, we compare the rise and progression of diversity rhetoric to 
managerial rhetoric about civil rights more generally in order to show 
that diversity rhetoric appears to have displaced a great deal of the civil 
rights rhetoric. 

The managerial models literature suggests that new managerial mod- 
els—and rhetorics—tend to arise as managerial fashion setters devise 
novel solutions to apparent market-related crises. As managers jump on 
the bandwagon, rhetorics may become institutionalized and quickly gain 
acceptance. But managerial rhetorics tend to be short-lived because the 
status of managers depends on the production of novel ideas. Thus, new 
rhetorics and models (deriving from new crises) tend to engender the 
decline of previous ones (Guillen 1994; Barley and Kunda 1992; Abra- 
hamson 1996, 1997; Abrahamson and Fairchild 1999). 

The rise and progression of diversity rhetoric is fairly consistent with 
this pattern (although we will argue that the crisis spawning attention to 
diversity comes more from the legal and demographic environments of 
organizations than from market-related sources). Figure 1 places diversity 
rhetoric in historical context, showing the rise and progression of diversity 
rhetoric relative to civil rights rhetoric. The white line represents diversity 
rhetoric; it includes all articles that use the keyword “diversity,” including 
those that also mention civil rights. The black line represents all pure 
civil rights rhetoric, that is, articles that address civil rights generally or 
address EEO, or AA, but do not address diversity. The dotted line is a 
subset of the black line: it includes articles on civil rights generally and 
on EEO, but excludes articles on AA and on diversity. Thus, the white 
line includes all cases in the diversity data set, whereas the black line 
includes all cases in the civil rights data set. 

Figure 1 shows that attention to diversity in the professional manage- 
ment literature (white line) began in earnest in 1987, rose sharply through 
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Fic 1—The frequency and diversity of civil rights rhetorics over time (solid black line 
indicates civil rights-EEO/AA, broken line indicates civil nghts-EEO only; white line in- 
dicates diversity) 


1993, and then declined somewhat between 1993 and 1996.” Comparing 
the diversity data set (white line) to the civil rights data set (black line), 
figure 1 shows that diversity rhetoric rose during a period of relatively 
low attention to civil rights in the professional management literature, 
indicative of an opening for the evolution of a new managerial rhetoric. 
By 1993, the number of articles discussing diversity exceeded the number 
that discussed only civil rights. Figure 1 shows, then, that diversity rhet- 
oric is not just a “drop in the bucket” of all managerial rhetoric about 


2 The inception of diversity rhetoric is apparent in management textbooks as well as 
in the professional personnel Hterature. We reviewed management textbooks back to 
1976 and found no references to diversity prior to 1988; in fact the word “diversity” 
cannot even be found in the indexes of these texts (e g., Daft 1983; Dunham 1984; 
Filley, House, and Kerr 1976, French 1987; Hatmann and Hilgert 1987; Haimann, 
Scott, and Conner 1985). In contrast, it is common in management texts from the 
1990s to emphasize diversity and to offer advice for dealing with the “new workforce” 
(e.g., Robbins 1993; Schermerhorn, Hunt, and Osborn 1997). Treatment of this diversity 
in these later texts is generally given at least several pages, titles of chapters dealing 
with individual characteristics frequently include “diversity,” and entire chapters are 
devoted to “managing diverse employees” (e.g., Daft 1995; Moorhead and Griffin 1998). 
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civil rights but rather was as prevalent at its peak as was civil rights 
rhetoric at its peak. 

Further, comparing the civil rights line that includes both EEO and 
AA (black line) to the civil rights line that excludes AA (dotted line) reveals 
a sharp increase in articles discussing affirmative action during the late 
1970s and early 1980s, a time when civil rights enforcement was at its 
strongest and organizational attention to and support for civil rights law 
was relatively high (Edelman 1992). A second peak in civil rights rhetoric 
in the mid-1990s, which coincides with the peak in diversity rhetoric, may 
reflect a backlash against affirmative action and civil rights law generally 
(Stryker, Scarpellino, and Holtzman 1999). 

The rise of diversity rhetoric, then, appears to have come at a time of 
considerable legal ambiguity and controversy over affirmative action. The 
1980s saw a fair amount of opposition to affirmative action from the 
Reagan administration (Skrentny 1996) and from an increasingly conser- 
vative Supreme Court (Freeman 1990). While both political commentators 
(Bennett, Chavez, Cohen, and Decter 1988) and scholarly evaluations 
(Skrentny 1996) conclude that President Reagan did not accomplish the 
dismantling of affirmative action that he promised during his first can- 
didacy, the Reagan administration did instill a belief that it would curtail 
government involvement in affirmative action programs. Indeed, during 
Reagan’s second term, Executive Order 11246 (which mandates affir- 
mative action among government agencies and contractors) was almost 
replaced by language forbidding preferences. The budgets of the U.S. 
Equal Employment Opportunity Commission (EEOC) and Office of Fed- 
eral Contract Compliance Programs (OFCCP) were significantly reduced 
in the 1980s as well (DiPrete and Grusky 1990). Further, Reagan’s selec- 
tion of socially conservative judicial and nonjudicial appointees signaled 
a sharp ideological shift in federal support for affirmative action. The 
appointment of Justice Scalia to the Supreme Court and the elevation of 
Justice Rehnquist to Chief Justice created the potential for long-term 
change in legal policy, while the appointments of conservatives Clarence 
Thomas (to head the EEOC) and Bradford Reynolds (as Assistant At- 
torney General for Civil Rights) threatened short-term strategies and 
newly secured benefits. At the same time, court decisions such as the 1984 
Firefighters Local Union mo. 1784 v. Stotts (467 U.S. 561), which held 
that layoffs according to seniority systems are valid even if they com- 
promise a consent decree reached to remedy discrimination, created con- 
siderable ambiguity for employers concerning what types of remedial 
practices were legal. 

Figure 1 illustrates the fact that managerial rhetoric about civil rights 
declined in the early 1980s and had reached new lows by 1987, concurrent 
with the new legal environment. Personnel managers and affirmative 
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action compliance officers needed a new managerial model to replace the 
emphasis on civil rights law (and possibly to preserve their jobs, many 
of which had been organized around civil rights compliance). Against this 
environment, Workforce 2000: Work and Workers for the 21st Century 
(Johnston and Packer 1987) appeared in 1987 and predicted dramatic 
changes in the composition of the American workforce. This publication 
seems to have provided the perceived crisis that gave rise to new ideas 
about “managing diversity” and a rhetoric that supported and rationalized 
those ideas. Workforce 2000 suggested that the American workforce was 
in the process of undergoing a dramatic change in composition, becoming 
more diverse on attributes such as sex, race, ethnicity, and national origin. 
While its analysis of the workforce covered multiple topics, the most 
dramatic prediction of Workforce 2000, and the one most frequently cited 
in the professional personnel literature, was that by the year 2000, only 
15% of new entrants to the American workforce would be American-born 
white males, while the remaining 85% of first-time job seekers would be 
women, minorities, and foreign-born U.S. residents.“ Based on this dem- 
ographic shift, the report forecasted an impending economic crisis, which 
would result from the grave skill mismatch between jobs and people. 
There would be a tight labor pool for the highly educated, skilled, and 
talented, and a glut of marginally functional (illiterate) unskilled 
workers.” 

These claims appear to have been largely erroneous. The Bureau of 
Labor Statistics (BLS) reported that although growth rates for blacks, 
Hispanics, and Asians would be slightty higher than those for whites 
during the 1994-2005 period, whites would remain a majority of the 
civilian labor force (Fullerton 1997). The large discrepancy between pre- 
dictions by Workforce 2000 and the BLS can be explained by a simple 
error in the former: Parker and Johnson confuse “new entrants to the 
workforce” with “net additions to the workforce.” In “Myths about Di- 
versity,” Friedman and DiTomaso (1996) resolve the confusion: whereas 
new entrants mean people entering the labor force for the first time, net 
additions refers to the subset of new entrants who are not simply replacing 
someone else leaving the workforce. Friedman and DiTomaso showed 


“ The authors consistently acknowledge that their immigration projections are highly 
contingent upon unpredictable changes in the world and domestic political economies. 
4 Workforce 2000 also predicted that a changing age composition would produce a 
more rigid workforce that might be reluctant to relocate and would involve high health 
care costs. The rise in women’s labor force participation would be accompanied by 
costly social and business supports for working mothers and dual-earner families, such 
as day care, part-time work, and revised tax structures. And an increase in dual-earner 
families would augment the rigidity of the labor force by making relocation even more 
difficult. 
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that although non-Hispanic white males would constitue about a third 
of new entrants to the workforce, they would constitute only about 13% 
of net additions to the workforce between 1994 and 2005. Net additions, 
however, is small relative to the number in the total labor force and thus 
would have a relatively small impact on the composition of the total labor 
force. Thus, the proportion of non-Hispanic white males is decreasing 
slightly because a larger portion of new entrants are minorities or women, 
but the workforce as a whole would not change dramatically.** Today, 
even the Hudson Institute has acknowledged that its predictions were 
incorrect. In Workforce 2020, published 10 years later Judy and D’Amico 
1997), the authors in fact recant the earlier predictions and offer numbers 
similar to those of Friedman and DiTomaso, suggesting that the U.S. 
labor force continues to become more ethnically diverse albeit at a fairly 
slow pace. 

The claims of Workforce 2000 also fly in the face of work by economists 
on the impact of civil rights law. Economists who study the impact of 
Title VII have found that the act had a small but statistically significant 
impact on the workforce share of black males and white women, especially 
in firms that are government contractors. But economists find that the 
impact of civil rights laws has neither been dramatic, nor has it produced 
a very diverse workforce (Leonard 1984, 1986; Heckman and Payner 1989; 
Blumrosen 1993). 

Despite its inaccuracy, Workforce 2000 appears to have been the catalyst 
that precipitated diversity rhetoric. As shown in figure 1, the surge in 
diversity rhetoric appeared shortly after the publication of Workforce 2000. 
Within a few years after the publication of Workforce 2000, diversity had 
become a buzzword in management circles. Numerous books and articles 
about management echoed and uncritically accepted the claims of Work- 
force 2000, many explicitly mentioning that source. These publications 
rehearsed and elaborated the nature of the threat, outlining an education 
gap between workers and organizational needs due to the influx of un- 
skilled immigrants, older workers, workers with disabilities, two-career 
and commuting couples, and others who would need various accommo- 
dations. The articles did not merely discuss diversity along ethnic and 


“Friedman and DiTomaso note that this is part of a very long term change in the 
percentage of white males, from 69% of the labor force in 1940 to 46% in 1994, toa 
projected 44% in the year 2005. They point out, further, that the trend toward a smaller 
proportion of white males has in fact been slowing in recent years. They suggest that 
the increase in gender diversity is essentially over and that future mcreases in racial 
diversity will come primarily from immigration. And although net additions to the 
workforce may be largely comprised of immigrants, especially Hispanics, the relatively 
small initial numbers of these employees means that the overall impact of this change 
will be small. 
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socioeconomic lines, however. Rather, diversity proponents added a twist 
to these predictions that Workforce 2000 had not explicitly suggested: 
noting that women, minorities, and the foreign-born are culturally dif- 
ferent from the traditional workforce, proponents began to emphasize the 
need for a new managerial model that was far more cognizant of different 
belief systems, cognitive and interactive styles, and career goals. The 
diversity model embraces a style of management designed to be inclusive, 
to recognize multiple cultures, to encourage communication among em- 
ployees with different cultural backgrounds, and to take advantage of the 
creativity that flows from a diverse workforce. To establish the import of 
the proposed managerial model, the publications tie “managing diversity” 
not to law but rather to organizational survival.” 


It is now a fact that by the year 2000, the workforce will be dominated by 
groups, such as women and minorities, which traditionally have been dis- 
criminated against. .. . The bottom line lesson is straightforward: those 
organizations seen as hostile to (or perhaps merely not supportive of) dis- 
enfranchised groups simply will not be able to acquire the competent work 
force they need to do successful work in the business arena in the next 
century. Jackson et al. 1993, p. 23) 


“Managing diversity” is fast becoming the corporate watchword of the dec- 
ade—not because corporations are becoming kinder and gentler toward 
culturally diverse groups but because they want to survive. (Henderson 
1994, p. 3) 


While society has not succeeded in producing a true melting pot, employers 
have no choice [but to do so] It is not a question of political correctness; 
it is a matter of survival. Workforce 2000 is here. Simply put, there are not 
enough white males to fill all of the jobs available in the American economy. 
(Segal 1995, p. 31) 


Thus, Workforce 2000 appears to have spawned diversity rhetoric by 
constructing a threat regarding a major change in the demographics of 
the workforce. The professional management literature picked up that 
(largely erroneous) threat and made it into a full-blown crisis with rhetoric 
that justified and necessitated a major change in management style (cf. 
Abrahamson and Fairchild 1999). 


Y Diversity rhetoric is similar in management textbooks, as the following quote illus- 
trates: “Workforce diversity is increasing in both the United States and Canada, as it 
is in much of the rest of the world. For example, in the United States, figures indicate 
that between 1990 and 2005, about 50% of the new entrants to the labor force will 
be women and such racial and ethnic groups as African Americans, Latinos, and 
Asians. At the same time, those 55 and older are projected to make up nearly 15% of 
the labor force. All of this is in sharp contrast to the traditional, younger, mostly white 
American male, labor force” (Schermerhorn et al. 1997, p 42). 
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In sum, the events that surround the rise of diversity rhetoric are con- 
sistent with arguments in the managerial models literature, which em- 
phasize the role of crises in motivating new managerial rhetorics. The 
crisis motivating diversity rhetoric grows out of legal ambiguity, political 
controversy, and demographic threats that would aggravate both. Further, 
in this case, the crisis appears to be largely socially constructed by man- 
agement consultants who use erroneous statistics to portray the demo- 
graphic environment as far more threatening than it is (Edelman et al. 
1992; Edelman et al. 1999). 

Moreover, the general progression of diversity rhetoric follows the pat- 
tern suggested by the managerial models literature in that there is a grad- 
ual increase in attention to diversity (which at its peak exceeds attention 
to civil rights), followed by a decline as the diversity “fashion” declines.” 
Given that diversity rhetoric appears to have displaced quite a bit of 
managerial attention to civil rights during its duration, it becomes ex- 
tremely important to look at the meaning of “diversity” and its relation 
to civil rights. 


THE CONTENT OF DIVERSITY RHETORIC 


We present, in three sections, qualitative analyses and descriptive statistics 
to portray the content of articles on diversity in the professional man- 
agement literature. In the first section, we describe how managers con- 
struct the meaning of diversity, producing a “managerialization of law.” 
In the second and third sections, respectively, we describe how the lit- 
erature emphasizes the rationality and novelty of the diversity model. 


Diversity as Managerialization of Law 
We begin by elaborating an understanding of “diversity” derived from 
the statutory language of civil rights law as a kind of ideal-type. We then 
show how managerial rhetoric expands upon the ideal-type by framing 
diversity in ways that reflect managerial interests and ideals. 

The formal legal model_—Title VII of the 1964 Civil Rights Act protects 
employees from discrimination on the basis of race, color, sex, national 


€ Since our focus is on how diversity rhetoric constructs the meaning of law rather 
than on the reasons for the decline of the diversity rhetoric, an explanation for the 
decline of diversity rhetoric is beyond the scope of this paper. The managerial models 
literature, however, suggests several possible reasons for this decline: the crisis that 
gave rise to the rhetoric may have dissipated; the rhetoric may have come under attack 
for lack of internal consistency or empirical validity; or other rhetorical claims may 
have begun to compete (Abrahamson 1991, 1996, 1997). 
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origin, and religion. Other civil rights statutes protect employees from 
discrimination on the basis of age, veteran status, pregnancy, and disability 
status, and a few jurisdictions offer protection against sexual orientation 
discrimination. The categories that Congress chose to protect in enacting 
civil rights legislation are generally based on an understanding of the 
historical discrimination and disenfranchisement experienced by these 
groups. There has of course been considerable debate among legislators, 
political activists, lawyers, and judges about the intended meaning of civil 
rights law and about the coverage of the categories of employees that law 
protects. Thus, it would be wrong to suggest that there is a unitary or 
coherent understanding of diversity in the legal realm. Nonetheless, it is 
possible articulate a vision of diversity based on a literal reading of the 
statutory language of civil rights law. This “formal legal model,” in which 
diversity is understood as representation along the categories explicitly 
designated by statutes, is assumed by many legal actors and is well rep- 
resented in court cases addressing civil rights issues. The formal legal 
model is not a random list of attributes; rather, it embraces a moral ideal 
that groups of citizens who have been subject to past discrimination are 
now entitled to special protection against any further discrimination and 
to fair opportunity in employment. For now, we treat the formal legal 
model as an ideal-type against which to compare the managerial concep- 
tion of diversity. Later, we show that the legal field in fact shares some 
elements of the managerial understanding of diversity. Thus, although 
the formal legal model describes a central conception of diversity in the 
legal field, it is not the only conception. 

The managerial conception.—Our examination of the diversity rhetoric 
shows that managerial conceptions of diversity include, but expand sig- 
nificantly upon, the formal legal model. Virtually all the articles do men- 
tion at least one legally protected dimension along which diversity is 
important (usually race or sex). But in addition, the professional literature 
emphasizes the importance of diverse attitudes, work styles, communi- 
cation skills, and cultural competence. And some articles advocate di- 
versity on dimensions that seem even further from the formal legal model 
such as rank or function in the organization, educational background, 
and even characteristics that might seem trivial, like chattiness or thinking 
style. As the following examples illustrate, the managerial conception of 
diversity significantly expands upon the formal legal model and empha- 
sizes nonlegal dimensions. 


Although race and gender issues are given top priority in much current 
valuing diversity work, the concept of valuing diversity applies equally to 
` issues of religious and regional differences, class, age, disability, veteran 
status, sexual orientation, and lifestyle. Many of the conflicts that multi- 
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cultural trainers address also occur across educational lines (for example, 
employees with MBAs versus those without college degrees), function (cre- 
ative, marketing [types] versus engineers), or position (line versus staff). 

. Indeed, “culture” is a word that can be applied to any group. There 
certainly are regional, professional, class, and lifestyle cultures. (Copeland 
1988, p. 49) 


PG&E defines diversity as any difference in race, gender, age, language, 
physical characteristics, disability, sexual orientation, economic status, pa- 
rental status, education, geographic origin, profession, lifestyle, religion, or 
position within the hierarchy of the organization. Johnson and O’Mara 
1992, pp. 51-52) 


At Hallmark, diversity includes, but is not limited to: ethnic origin, religion, 
gender, age, sexual orientation, disability, lifestyle, economic background, 
regional geography, employment status, and thinking style. (Flynn 1996, p. 
52) 


[At Westinghouse we] value diverse opinions, diverse skills, and diverse 
personalities all working in harmony in a team. Diverse people each bring 
their special skills and the ability to see a problems from a different angle, 
and that’s important. .. . We look for people from all different social and 
academic backgrounds and try to mix quiet with talky people, electrical 
engineers with software and quality-assurance engineers. (Altany 1992, pp. 
32-33) 


Table 2, which shows the various constructions of diversity that we 
found, helps to put these examples in perspective. Almost all of the articles 
(91%) explicitly mentioned at least one of the legally protected categories, 
in particular diversity on the basis of race or ethnicity (85%) or sex (78%). 
But the construction of diversity goes far beyond legally protected cate- 
gories. The most commonly mentioned nonlegal construction of diversity 
is “cultural diversity,” which was mentioned by nearly half (43%) of the 
articles. While this might seem like another word for national origin, 
ethnic, or even for racial diversity, a close reading of the articles suggests 
that cultural diversity is meant to include but to be far broader than the 
legal categories. Further, many of the constructions of diversity have noth- 
ing apparent to do with legally protected categories. Articles discuss, for 
example, diversity on the basis of geography (10%), level or function 
within the organization (10%), training or educational background (12%), 
communication style (11%), problem-solving abilities or styles (8%), ex- 
perience (10%), attitude (23%), employee benefits (13%), customers (8%), 
and services (1%). And, interestingly, over 10% of the articles offered a 
construction of diversity that explicitty mentioned inclusion of or attention 
to whites and males. 

Table 2 together with the qualitative examples show that the managerial 
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construction of diversity goes well beyond the original legal categories to 
include nonlegal categories. The presence of nonlegal categories in con- 
structions of diversity suggests that managers have supplemented the 
purely legal categories protected by civil rights law with many new di- 
mensions of diversity. The nonlegal categories, moreover, reflect tradi- 
tional areas of managerial concerns. 

Thus, diversity rhetoric is not simply a grafting of a new name onto 
an old model, as Kelly and Dobbin (1998) suggest. Rather diversity rhet- 
oric represents a managerialization of law; that is, it supplants legal cat- 
egories of thinking about employee differences with a new schema that 
equates dimensions based on historical disenfranchisement with those 
based on managerial ideas about how to produce an effective and creative 
workforce. 


Rationality—Diversity as a Resource 

One of the most prominent themes in the diversity literature is the idea 
that diversity management (including the idea of “valuing diversity”) is 
not only necessary given a changing workforce, it is also a profitable 
resource for organizations. Diversity rhetoric asserts that different types 
of employees increase productivity and effectiveness because people with 
different characteristics have different attitudes, work styles, and cultural 
knowledge, which makes them assets to corporations in a changing world. 
Blacks, women, and older people differ from whites, males, and younger 
people because they have different cultural backgrounds, live in different 
geographical areas, walk in different circles, live different lifestyles, and 
have different worldviews. But people with different dress styles, tastes, 
lifestyles, and geographic backgrounds also bring variety in expertise and 
ideas. For that reason, these people can bring a variety of solutions to 
organizations that white males alone could not. The articles emphasize 
that innovative responses to challenges in the workplace flow largely from 
the associations that employees make between the problem at hand and 
the knowledge and experience they hold in their heads. Therefore, to 
obtain a variety of methods for attacking a problem, it is valuable to 
recruit participants with diverse backgrounds to contribute the compo- 
nents of a solution, to see the value in each type of employees, and to 
view their differences as a resource for the firm. 

Diversity rhetoric, then, encourages managers to view diversity as a 
resource that could aid in a firm’s movement into the “new economy”; 
that is, an economy based upon globalization and heightened competition. 
The firm well situated in this new economy is one that exemplifies cre- 
ativity, multicultural competence, and an internal flexibility that mandates 
nonhierarchical organization and personal empowerment. Diversity is 
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constructed as integral to both individual and workgroup productivity 
and to matters of hiring and retention. The message is that firms with 
environments inhospitable to any particular group of employees in the 
impending tight labor market (for skilled workers) will lose on all counts. 
Discrimination and exclusion become problematic under this view, not 
because they are unjust or illegal, but because they inhibit the firm’s 
ability to profit in a global and more diverse world. Moreover, the focus 
on creativity and flexibility (as opposed to fairness and morality) appears 
to engender a managerial conceptualization of diversity that expands upon 
the formal legal model. 

The idea of diversity as a resource looks beyond the essential categories 
of race, gender, national origin and the like to the reasons for valuing 
those people. The following excerpts exemplify the idea of diversity as a 
resource, and in particular the link between diversity and profit.’ 


Consider the following strategic business advantages to managing diversity: 
a richer variety of ideas, greater creativity, a better and more flexible un- 
derstanding of diverse markets, and an improved ability to go global. 
(McNerney 1994, p. 22) 


Academic studies confirm that heterogeneous work groups create solutions 
to work and business problems that are more innovative and more effective 
than those developed by homogenous groups. (Jenner 1994, p. 11) 


In recent years, diversity as an issue has taken on a new face in business, 
one not involving morals and laws, but one with potential consequences 
for business’s bottom line. (Jackson et al 1993, p. 23) 


Profit is in fact the most frequently cited reason offered by the articles 
in support of organizational diversity: nearly 50% of all articles refer to 
its value for organizational profit. In contrast, only 19% of the articles 
mention law as a reason for supporting diversity and only 30% mention 
fairness. The emphasis on profit in the diversity rhetoric, then, appears 
to be a means of rationalizing the need for management techniques that 
incorporate workforce diversity. 


1* Again, management textbooks make similar statements. “Valuing diversity is not 
only the right thing to do, it’s the right business thing to do. Companies that do a 
good job of valuing and effectively managing diversity in the 1990s will have a com- 
petitive advantage over the ones that don't. Paying attention to the diverse work force 
has become an economic imperative. There is no question that the work force is 
changing and U.S. organizations have to change to reflect the new work force com- 
position” (Daft 1995, p. 362). 
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Novelty—Diversity as Different from Civil Rights 


Another prominent theme in the diversity literature is that diversity is 
novel. One of the ways in which diversity rhetoric establishes the novelty 
of diversity management is to distinguish it from civil rights, either by 
omitting any mention of civil rights or by explicitly differentiating the 
two. Slightly fewer than half the articles that we coded (48%) even men- 
tioned civil rights. Of those that did mention civil rights, most did so in 
a rather perfunctory manner. 

Further, many of those articles draw explicit distinctions between civil 
rights law and the notion of diversity, and quite a few articles portray 
law as external, imposed, and in general a negative force on the organ- 
ization while portraying diversity as good for employee morale, harmo- 
nious employment relations, and profit. Specifically, 40 articles (29% of 
those mentioning civil rights) made statements to the effect that civil rights 
was passé while diversity was newer and better suited to today’s society. 
Many of the articles were explicitly negative about civil rights. Only 14 
(10% of those mentioning civil rights) had anything positive to say, while 
34 (24% of those mentioning civil rights) made negative or hostile state- 
ments and another 18 (13% of those mentioning civil rights) contained 
ambiguous evaluative statements. When comparing civil rights to diver- 
sity, no articles said that civil rights was better than diversity, while 55 
articles (35% of those mentioning civil rights) made statements indicating 
that diversity was preferable to civil rights. Combining these dimensions, 
67 articles (49% of those mentioning civil rights) made negative judge- 
ments about civil rights (either by saying that the notion of civil rights 
is passé or that diversity is superior to civil rights).” 

One of the more common forms of comparison was to note the artificial 
(imposed) nature of legal mandates in comparison to the natural character 
of the new diversity. The language of artificiality associated with the law 
appears to derive from the familiar conservative critique that civil rights 
law involves quotas and lowered standards and thus produces economic 
inefficiencies. Consider the following examples: 


Compliance implies a mandate. And conformity is exactly the opposite of 
what diversity is all about. Instead, your objective should be a culture that 
breeds harmony and rapport, which in turn provide creativity, effective 
decision making and better teamwork. (Thomas 1994, p. 62) 


” Using a more conservative measure that does not count articles that, despite their 
negative judgements about civil rights, also say that civil rights and diversity can 
coexist, 43 articles (31% of those mentioning civil rights) are explicitly negative about 
civil rights and imply thet the two cannot coexist. We use the more conservative 
measure in our multiple regression analyses. 
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[Managing diversity is] not a problem to be solved, not the excesses of legal 
recourse, nor, by extension, the extremes of the thought police or politically 
correct enforcers. Using diversity as a process to be managed unleashes 
performance energy that was previously wasted in fighting discrimination. 
(Vroman 1994, p. 107) 


While the doors of opportunity were opened to many who were previously 
excluded, new hurdles were created by the unnatural focus on special target 
groups in organizations, the perception by white managers that standards 
were being lowered to accommodate minorities and women, and the per 
ception that EEO and AA programs were artificial methods forced upon 
organizations and their managers to pay for the historical sins of U.S. society. 
(Torres and Bruxelles 1992, p. 31) 


Managing or valuing diversity differs from the conventional approach to 
equal opportunities in that it seeks to create a climate whereby those in- 
volved want to move beyond the achievement of [a] mere statistical goal. 
(Greenslade 1991, p. 28) 


These excerpts illustrate the role of diversity rhetoric in disassociating the 
new managerial model, managing diversity, from compliance with civil 
rights law.*”* Whereas legal compliance is portrayed as coerced, unnatural, 
and divisive, managing diversity is portrayed as voluntary, sensible, and 
good for rapport. 

Our qualitative analyses of diversity rhetoric, then, show that like other 
managerial rhetorics, diversity rhetoric emphasizes the rationality and 
novelty of a new model of management. But, perhaps more important, 
diversity rhetoric subtly alters formal legal ideas of diversity by advocating 
diversity on a variety of dimensions that go well beyond those specified 
by civil rights law. We now turn to an analysis of the factors that predict 
a managerialized conceptualization of diversity. 


FACTORS THAT PREDICT THE MANAGERIALIZATION OF LAW 


We argued earlier that the managerialization of law is an institutional 
process in which legal ideas become infused with managerial values. In 


7 Management textbooks also suggest that diversity is better than civil rights: “Some- 
. times managing diversity is confused with affirmative action or other laws directed 
toward those with certain characteristics, Actually, the two concepts are quite different. 
Affirmative action emphasizes achieving equality of opportunity in the work setting 
through the changing of organizational demographics—gender, age, racial and ethnic 
mixes, and the Hke. It is legally driven. ... Managing diversity, in contrast, emphasizes 
appreciation of differences in creating a setting in which everyone feels valued and 
accepted. ... Managing diversity assumes that groups will retain their own charac- 
teristics and will shape the firm as well as be shaped by it, creating a common set of 
values” (Schermerhorn et al. 1997, pp. 42-43). 
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the context of diversity rhetoric, the managerialization of law occurs when 
diversity is conceptualized not simply in terms of the representation of 
legally protected categories of employees, but more broadly in terms of 
difference itself. In this section, we model the managerialization of law 
in the diversity context by examining the effects of time, author’s pro- 
fession, the emphasis on rationality (the focus on profit as the motivation 
for diversity), and the emphasis on novelty (the lack of references to civil 
rights law or the negative characterization of civil rights law) on an ex- 
panded, nonlegal, construction of diversity. 

Table 3 shows five OLS regressions predicting the managerialization 
of law, as measured by the number of nonlegal constructions of diversity. 
We control for the number of legal constructions of diversity since some 
authors emphasize the construction of diversity more than others; thus, 
the dependent variable should be understood as nonlegal constructions 
of diversity relative to legal constructions of diversity. We also control for 
article length. 

Model 1 provides a test of hypothesis 1, which predicted an increase 
over time in the number of nonlegal constructions of diversity and then 
a leveling off. Three variables are relevant to Hypothesis 1: time (the 
number of months since the inception of diversity), time squared, and 
date flag (the dummy variable indicating that an article was published 
during the first year after the inception of diversity). 

Model 1 shows that date flag, time, and time squared all have statis- 
tically significant effects on the number of nonlegal constructions of di- 
versity. These results suggest an initial surge of attention to the managerial 
construction of diversity, followed by a sustained institutionalization of 
that conception through the height of diversity rhetoric, and then a lev- 
eling off beginning in November 1993. This pattern is consistent with the 
prediction we derived from the law and organizations literature. However, 
because diversity rhetoric has not yet disappeared, we do not know 
whether the managerialization of law will remain at its current level or 
will eventually decline. Thus, although the data appear to support the 
law and organizations literature, we cannot definitively rule out the idea 
(consistent with the managerial models literature) that the manageriali- 
zation of law will decline. 

Model 2 adds the author variable and thus provides a test of hypothesis 
2, which predicted that managers and management consultants would be 
most likely to adopt the managerial conception of diversity. “Management 
consultant” is the omitted category. The results show that management 
academics, staff writers, and editors are significantly less likely than are 
Management consultants to employ nonlegal conceptions of diversity. Ex- 
ecutives and managers do not differ significantly from management con- 
sultants. Consistent with this finding, our qualitative reading of the lit- 
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erature shows few differences between management consultants and 
managers. Thus, management consultants appear to be fueling the non- 
legal conception of diversity, to some extent in concert with managers 
within organizations. 

In our discussion of the content of diversity rhetoric, we showed how 
authors emphasized the rationality of the diversity model by claiming that 
it had value for organizational profit. Hypothesis 3 predicted that authors’ 
emphases on rationality would, in turn, affect the managerialization of 
law. Model 3, then, adds the reasons that authors cite when supporting 
diversity to the basic model. The omitted category is specifying no reason 
for diversity. The profit variable is statistically significant, suggesting that 
where authors rhetorically construct diversity as good for profit, they are 
also more likely to adopt managerial (nonlegal) conceptions of diversity.” 

Similarly, we showed that authors emphasize the novelty of the diversity 
model by explicitly distinguishing the ideas underlying the diversity model 
from those underlying civil rights law and by negatively characterizing 
civil rights law. Model 4 shows two measures of novelty (references to 
civil rights and negative judgements about civil rights) and thus provides 
a test of hypothesis 4, which predicts that portrayals of the novelty of 
diversity will contribute to the nonlegal constructions of diversity. Both 
measures have statistically significant coefficients, suggesting that where 
authors do not refer to civil rights and where authors have strongly neg- 
ative views of civil rights, they are more likely to adopt a managerial 
conception of diversity.” As model 4 shows, this finding holds even where 
author’s profession is included in the model.” 


2 Of course, regression cannot test causation explicitly, and to some extent, the cau- 
sation is probably bidirectional: managerialized constructions of diversity may give 
rise to new arguments about the rationality of diversity management. However, our 
qualitative data suggest that the primary direction of causation is from rhetorical efforts 
to construct the rationality of diversity management (including “valuing diversity”) to 
how managers construct the meaning of diversity. 

2 Appendix table B1 shows a year-by-year breakdown of the number of articles in the 
diversity data set that do and do not refer to civil rights and of the average percentage 
of the total constructions of diversity that are nonlegal. From 1987 to 1990, when there 
are very few articles on diversity overall, articles that refer to clvil rights appear more 
likely than those that do not to use nonlegal constructions of diversity. During the 
early phase of diversity rhetoric, it makes sense that authors would seek explicitly to 
distinguish the two (emphasizing the novelty of the diversity model) and to embrace 
the new, nonlegal, construction of diversity. From 1991 to 1996, however, articles that 
do not mention civil rights at all are more likely to use nonlegal constructions of 
diversity (the greater number of articles in these years is consistent with the negative 
effect of referring to civil rights on nonlegal constructions of diversity shown in table 
3). Again this is not surprising; as diversity rhetoric becomes more institutionalized, 
authors construct the novelty of diversity by omitting references to civil rights alto- 
gether, and this is associated with a greater Hkelihood of constructing diversity non- 
legally. Note that the table in app. B does not control for whether references to civil 
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Model 5 includes all the variables. Because reasons for supporting 
diversity are related to an authors’ stance on civil rights and to an author’s 
profession, the profit variable is no longer statistically significant in 
model 5. 

Summary —Taking our qualitative and quantitative analyses together, 
our findings are consistent with previous studies of managerial rhetorics 
in a number of ways. Diversity rhetoric is a form of managerial rhetoric, 
which appears to have arisen in response to the decline of political support 
for affirmative action and civil rights law generally, as reflected by the 
decline in managerial rhetoric on EEO and AA shown in figure 1 (cf. 
Stryker et al. 1999). Workforce 2000 apparently served as the catalyst for 
the inception of diversity rhetoric by suggesting that a dramatic change 
in the workforce was on the horizon. Diversity rhetoric enjoyed consid- 
erable renown during the early 1990s, but like other managerial fashions, 
appears to have entered a period of decline. 

Our study differs from previous work, however, by showing how man- 
agerial rhetorics about law have the potential to transform legal ideas. 
Diversity rhetoric, which tends to equate differences based on geography 
or taste in sports or dress style with differences based on race or sex, 
offers a conception of equal employment opportunity that is quite different 
from that embodied in the statutory language of Title VII or other civil 
rights laws. Diversity rhetoric replaced the legal vision of diversity, which 
is grounded in moral efforts to right historical wrongs, with a managerial 
vision of diversity, which is grounded in the notion that organizations 
must adapt to their environments in order to profit. More generally, our 
analyses of diversity rhetoric provide support for the argument that man- 
agerial rhetoric that is related to law may produce a managerialization 
of law, or the infusion of managerial values and understandings into legal 
ideas. 

The impact of managerial rhetorics on constructions of law and legal 
ideas, moreover, is probably part of a broader story about the blurring 
of organizational fields. To address that story, we add a twist to relation- 
ship between the formal legal model of diversity and the managerial 
construction of diversity. As we noted earlier, the legal ideal of diversity 
is itself not as simple as what might be found on the pages of a statute. 


rights are negative, a factor that is taken into account in the regression analyses shown 
in table 3. 

* As with the construction of rationality, the relationship between rhetorical construc- 
tions of the novelty of diversity management and how managers conceptualize diversity 
is likely to be, to some extent, bidirectional: managers’ conceptualization of diversity 
may encourage an emphasis on how managing diversity differs from compliance with 
law. Nonetheless, our qualitative results support the idea that efforts to construct the 
novelty of managerial models encourage a managerialized conception of diversity. 
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While we used statutory language as an ideal -type for the purpose of 
analyzing the managerialization of law, we now offer some theoretical 
ideas about the more complicated interplay between managerial and legal 
ideas. : 


MANAGERIAL RHETORICS AND BLURRING OF FIELDS 


Managerial rhetorics do not evolve in a vacuum. And in particular, law- 
related managerial rhetorics do not evolve independently of legal envi- 
ronments. We refine our discussion by suggesting that diversity rhetoric 
had some roots both in judicial doctrine and in preexisting managerial 
discourse. We discuss these possible origins of managerial thinking about 
diversity and suggest that they may have implications for theories of 
organizational fields. 

The idea that a heterogeneous workforce can be a resource for organ- 
izations is in fact similar to a logic that has appeared in at least three 
legal arenas: cases dealing with juries, educational institutions, and broad- 
casting. Moreover,’ in some of these areas, courts have implied that di- 
versity goes beyond categories—such as race, sex, age, and disability 
status—that are explicitly protected by law. In decisions concerning these 
social institutions, judges construe diversity as valuable for its democracy- 
enhancing, or difference-valuing, capacities; it brings together people with 
different attitudes and cultural competencies, thus creating better juries, 
better educational institutions, and better broadcasting. In the famed 
Bakke case, for example, Justice Powell explicitly distinguished diversity 
from legally protected categories, stating that race is only one of many 
factors that should be considered in pursuit of diversity in higher edu- 
cation (428 U.S. 265 at 314 [1978]. In Metro Broadcasting, the Supreme 
Court held that programs increasing minority representation were sub- 
stantially related to the broader goal of “broadcast diversity” or diversity 
of viewpoints (110 S.Ct. 2997 at 3008 [1990]. And the Fifth Circuit Court 
of Appeals in Hopwood (78 F.3d 932 at 941-948 [1996] clearly distin- 
guished racial diversity from a broader conception of diversity when it 
held that diversity was not a sufficiently compelling rationale to justify 
race-conscious admissions at the University of Texas Law School. In these 


™ See Ballord v United States, 329 U.S. 187 (1946) and Taylor v Lowisiana, 419 U.S. 
522 (1975) on the fair croas-section requirement in jury selection; Sweatt v Painter, 
339 U.S. 629 at 634 (1950), University of California Regents v Bakke, 438 U.S. 265, 
311-320 (1978), and Texas v Hopwood, 78 F.3d 932 at 941-948 (1996) on diversity as 
a justification for affirmative action in university admissions; and Metro Broadcasting, 
Inc., v FCC (110 S.Ct2997 at 3008 [1990) on diversity as a justification for affirmative 
action in the allocation of broadcasting licenses. 
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and related decisions, diversity is associated with the ideals of justice, 
community, and democracy. Although this logic is not identical to the 
notion of diversity as valuable for profit and industrial harmony, it is a 
parallel logic, one that employs similar ways of thinking and values in a 
different context. 

The idea of diversity as a resource is not completely new in management 
either. The idea that teams or groups in organizations benefit from “di- 
verse” membership has been supported in the management literature for 
decades, although it was usually called “group member heterogeneity.” 
The logic behind this construct was that most group activities require a 
variety of abilities, skills, and information and therefore a heterogeneous 
group will be more effective (Shaw 1976). Early work examined hetero- 
geneity along a number of dimensions including personality, sex, race, 
and conceptual systems (e.g., Hoffman and Maier 1961; Schutz 1955). The 
effect of group composition on performance has continued to be an im- 
portant aspect of group problem solving (Aldag and Fuller 1993). Recent 
work has supported the notion that heterogeneity in groups is generally 
an advantage and that it may offer opportunities for competitive advan- 
tage and increased creativity (Milliken and Martins 1996; Watson, Kumar, 
and Michaelsen 1993) but suggests that the positive effects may only occur 
in some situations. Whether a heterogeneous group is better depends, for 
example, on factors such as time constraints, type of group, type of prob- 
lem, and stage of group development (Watson, Johnson, and Merritt 1998). 
The diversity as a resource idea follows from the older ideas about group 
heterogeneity. 

Further, one of the key ideas in diversity rhetoric is that employees will 
perform better if managers value their diverse backgrounds and styles 
(referred to through out the management literature as “valuing diversity”). 
This idea has its roots in the human relations school, which originated 
in the 1920s (Bendix 1956; Barley and Kunda 1992). The notion that 
managers can enhance firms’ competitiveness through attentiveness to 
employees’ cultural needs is in fact a form of cultural logic that has 
resurfaced on numerous occasions in managerial rhetoric; Barley and 
Kunda suggest that the most recent wave of cultural logic (which includes 
“total quality management” and “world class engineering”) blossomed in 
the early 1980s. They also suggest that management academia has fed 
the cultural movement in management by engaging in studies of organ- 
izational culture (e.g., Deal and Kennedy 1982; Donnelly 1984; Schein 
1985; Van Maanen and Kunda 1989). Although we do not posit a strong 
link between the diversity rhetoric and other cultural rhetorics, the cul- 
tural logic that was prevalent in management in the 19803 may have 
played a role in the development of diversity rhetoric. 

Consciously or not, fashion setters who promulgated diversity rhetoric 
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were probably influenced, at least indirectly, by ideas about diversity that 
existed previously in both the legal and managerial arenas. Of course, 
few managers follow legal developments or read law cases closely enough 
to discover judicial rationales. Nonetheless, professional networks provide 
ways for flows of (filtered) information from legal fields to enter into 
managerial consciousness. Lawyers sometimes write manuals or articles 
that appear in the professional personnel literature and influence mana- 
gerial thinking. Lawyer-dominated organizations like the Bureau of Na- 
tional Affairs run workshops for managers on law-related issues. Some 
law firms offer websites with information on human resource practices.” 
Employment lawyers participate in the same market of business ideas as 
do managerial consultants. When lawyers write for managerial audiences 
or try to sell their own ideas to employers, they are likely to frame both 
legal problems and their solutions in ways that make sense to managers. 
Ideas may flow from legal to managerial arenas in more diffuse ways as 
well. Since Bakke, for example, talk of diversity has become commonplace 
on university campuses. Virtually all managers have spent a good deal 
of time on college campuses while getting their bachelor’s and master of 
business administration degrees and have therefore had some exposure 
to the concept of diversity, even if they did not consider it seriously. 

That the roots of diversity rhetoric may be found in law or in older 
managerial ideas has implications for the way that organization theorists 
think about the “fields” in which organizations operate. Most organization 
theorists lump all aspects of organizations’ environments into organiza- 
tional fields: they include not only similar organizations, their suppliers, 
and their customers but also regulatory organizations that influence the 
field and professionals that operate within and around organizations 
(DiMaggio and Powell 1983). Although the construct of a singular or- 
ganizational field has had a very useful place in organization theory, we 
suggest that it may oversimplify the nature of organizational environ- 
ments. In particular, we suggest that the idea that organizations exist in 
a unitary field obscures areas of ambiguity and confusion that are critical 
sources of new managerial rhetorics—and eventually—of new institu- 
tional models. 

Thus, it may be useful to consider the multiple and overlapping fields 
in which organizations operate (Stryker 2000). We continue to envision 
organizational fields as the primary environments of organizations, con- 
sisting of the other organizations with which a focal organization regularly 
interacts or to which it compares itself as well as the flows of commu- 
nication, logic, and ideas that evolve within that field. But at least for 


* One example we viewed on April 8, 2000, was the website of Epstein, Becker, and 
Green (http//www.ebglaw.com). 
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the purposes of understanding how managers shape organizational un- 
derstandings of law, we suggest that it is useful to distinguish at least two 
other fields that overlap substantially with organizational fields: legal 
fields and managerial fields. By legal fields, we mean the environments 
surrounding courts, including legislatures, administrative agencies, law- 
yers and litigants as well as legislation and legal logics regarding the form 
and content of justice that surround these actors (Edelman, in press). 
Managerial fields include business schools and their faculty, management 
consultants, and the management professions within organizations as well 
as the various logics about good management. Other somewhat more 
distant fields overlap with organizational fields as well. For example, 
educational fields, which would include all schools and universities, train 
and socialize most organizational actors and, largely through those actors, 
provide or refine much of the knowledge that is used in organizational 
fields. Or scientific-technical fields can compete with legal fields in influ- 
encing organizations (Stryker 1994, 2000). And even various activist ac- 
ademic fields, such as difference feminism or identity politics (e.g., Gilligan 
1982; Williams 1991), produce actors and ideas that influence organiza- 
tions. The list is probably endless and, of course, varies for different 
organizational fields. For now, we confine our discussion to organizational, 
legal, and managerial fields. 

Although there is significant overlap between organizational, legal, and 
managerial fields, each field in fact has its own institutional logic and 
hence its own institutionalized models (cf. Stryker 1994, 2000). The legal 
field, for example, is characterized principally by “liberal legal logic” or 
the ideal that laws are reasoned and apply impartially and universally to 
all actors and groups. But other logics also develop in legal fields, such 
as “alternative dispute resolution,” which advocates less formal and more 
participatory modes of settling disputes (Edelman, in press). The man- 
agement field is organized around the production and sale of new ideas 
about governance to employers. It is characterized by the logic of capi- 
talism so that managers can appeal to employers: the logics of control 
over employees, profit, and legitimacy play key roles. 

New ideas evolve in the interstitial spaces at the boundaries of these - 
fields, where the different logics of each field meet, intersect, and become 
intertwined, encouraging cross-fertilization of rhetorics and ideas (Edel- 
man, in press; Stryker 2000). Interstitial spaces create the arena for the 
production of new ideas by providing a locale for the comparison, 
exchange, and influence of institutional logics. There are many of these 
interstitial spaces where fields overlap, but a few are probably of key 
importance. For example, regulation is a key point of overlap between 
legal and organizational fields: as regulators grapple with how to imple- 
ment the law in organizational settings and how to respond to organi- 
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zational lobbying and the presentation of organizational problems, or- 
ganizational practices are bound to influence regulators’ ideas about 
compliance. Since organizational practices are themselves largely the 
product of managerial ideas, regulation also provides a vehicle for im- 
porting logic from managerial fields. Similarly, managerial consultants are 
a key point of overlap between organizational and managerial fields. 
Organizations regularly seek out the advice of managerial consultants and 
internal managers regularly interact with managerial consultants through 
professional networks, thus providing for the intermingling of ideas from 
organizational and managerial fields. And since managerial consultants 
often read works written by lawyers, they also constitute a key point of 
overlap between legal and managerial fields. In some cases, parallel ideas 
may develop separately in two or more fields, influencing one another 
only indirectly through exposure in the interstices. In other cases, models 
institutionalized in one field may be transformed substantially as they are 
carried across field boundaries. 

Managerial rhetoric, understood as both a reflection and a producer of 
organizational culture, constitutes a key vehicle for the flow of norms, 
scripts, and models of management across field boundaries (Fuller, Ed- 
elman, and Matusik 2000). Managerial rhetorics about law import legal 
norms and legal scripts into managerial fields, instill them with managerial 
logic, and promulgate them throughout organizational fields. But because 
organizational fields overlap with legal fields, managerial rhetorics also 
influence ideas in legal fields (Edelman, in press; Edelman et al. 1999; cf. 
Stryker 2000). 

Diversity rhetoric illustrates the flow of ideas and logics across multiple 
fields. The idea of diversity as a resource had surfaced in subtle ways in 
both the legal and managerial fields prior to Workforce 2000. But Work- 
force 2000 appears to have served as a catalyst for the institutionalization 
of diversity rhetoric—and for the managerialization of law—by helping 
to create a dilemma that managerial consultants could play upon to pro- 
mote their “new” ideas. The idea of diversity as a resource justified and 
legitimated an expanded nonlegal conception of diversity that found sup- 
port in the logics of heterogeneous groups in managerial fields and in 
judicial ideas regarding the enhancement of democracy in legal fields. 
Given a legal environment in which the meaning of civil rights law was 
becoming increasingly contested, the managerial conception of diversity 
appealed to actors throughout managerial fields and quickly gained wide- 
spread acceptance. 
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CONCLUSION 


Our analysis shows that managerial rhetoric may help transform law even 
when it explicitly seeks to disassociate managerial models from law. Di- 
versity rhetoric replaces the formal legal model (which privileges race, 
sex, religion, national origin, age, veteran status and disability status) with 
a managerial model that renders categories that are not legally relevant 
(such as geographic location, organizational rank, dress style, communi- 
cation style, and attitudes) equally important. The managerial conception 
of diversity de-emphasizes the law’s focus on discrimination, injustice, 
and historical disenfranchisement and supplants it with a conception of 
diversity grounded in organizational success. 

The managerialization of law has both costs and benefits for legal ideals. 
The costs come from the potential of the managerial vision to undermine 
law’s moral commitment to redressing historical wrongs. In elaborating 
the idea of diversity as a resource for organizational competitiveness, 
managerial rhetoric about diversity tends to deflect attention from the 
societal and historical practices that disenfranchised particular groups and 
instead emphasizes the value of recognizing all forms of difference. If the 
white farm boy from Idaho is considered as important to firm diversity 
as the black inner-city kid from Los Angeles on the basis of geographic 
diversity, then diversity can more easily be used to justify a workforce 
that is primarily white or male (but is diverse on other dimensions). This 
broad understanding of diversity is more consistent with older ideas about 
team heterogeneity and it is less threatening to managerial interests in 
competitiveness, but it tends to divest law of its moral component. Di- 
versity becomes conditional upon serving corporate interests rather than 
grounded in social justice. 

Nonetheless, the managerialized form of law may have a greater ca- 
pacity than the legal form to institutionalize legal values within organi- 
zations. Diversity rhetoric may represent a weakened ideal of civil rights, 
but it has the potential to have a broad impact on the daily lives of 
employees. Managers naturally resist law that seeks to constrain tradi- 
tional managerial prerogatives (Edelman 1992). And while civil rights law 
offers the ideal that lawsuits would eradicate discrimination, it in fact 
never provided a panacea for women, minorities, and other disenfran- 
chised groups. Barriers to mobilization (Bumiller 1987, 1988), cooling out 
by organizational complaint handlers (Edelman et al. 1993), and conser- 
vative interpretations of law (Schultz 1990; Crenshaw 1988; Freeman 
1990) all serve as important obstacles to the realization of civil rights. 
But precisely because diversity rhetoric constructs diversity as consonant 
with managerial interests and values, it may overcome much of the man- 
agerial resistance to nontraditional workers by transforming the notion 
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of “difference” from one of legal imposition to one of business advantage. 
Thus, the managerialization of law can evoke a legalization of organi- 
zations: as legal ideas are recast in managerial terms, they may be weak- 
ened but they are nonetheless more easily incorporated into organizational 
routines. 

The managerialization of law is not limited to the diversity context, 
nor is it limited to managerial rhetoric. Several studies, although not 
originally labeled as such, reflect a managerialization of law in managerial 
practices or rhetoric. Discrimination complaint handlers’ recasting of dis- 
crimination complaints as interpersonal or managerial problems (Edelman 
et al. 1993), managers’ inflation of wrongful discharge claims (Edelman 
et al. 1992), and managers’ claims about the value of discrimination griev- 
ance procedures (Edelman et al. 1999) are all examples of the manager- 
ialization of law. There are no doubt other examples in the context of 
workers’ compensation law, the Family and Medical Leave Act, the Amer- 
icans with Disabilities Act, and other types of law in which managers 
play a major interpretive role. Also, private police forces that focus on 
activities that are managerially rather than legally threatening and struc- 
ture their enforcement efforts more to minimize disruption of the cor- 
poration than to maximize public safety are another (Edelman and Such- 
man 1999). And Suchman’s work on the collaboration of managers and 
lawyers in Silicon Valley might exemplify a related form of the mana- 
gerialization of law (Suchman 1994; Suchman and Cahill 1996). 

This article illustrates the potential for fruitful linkages between the 
literature on managerial rhetorics and models and the literature on law 
and organizations. More study of managerial rhetorics about law and 
their potential to transform law would help to strengthen organizational 
theory in both areas. Clearly, future scholarship should address the re- 
lation between managerial rhetorics about law and organizational prac- 
tices. In the diversity context, for example, future research should address 
how managerial constructions of diversity relate to the operation of di- 
versity councils, diversity training, and diversity offices in organizations. 
We have suggested that, although diversity rhetoric may undermine the 
legal rights of women and minorities, it may also promote the institu- 
tionalization of legal ideas in organizations. Only empirical studies of 
organizational practices can reveal whether diversity is ultimately harmful 
or helpful to previously disenfranchised groups. And since there is likely 
to be substantial variation in the extent to which diversity programs 
embrace (or neglect) civil rights law, the sources of that variation ought 
to be studied. 

Our study also calls attention to the blurring of boundaries between 
legal, managerial, and organizational fields. The import of these overlap- 
ping fields is that the potential for organizations to influence the law that 
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regulates them may be substantially greater than previously demonstrated 
by studies of regulatory capture or organizational compliance. But because 
ideas flow across field boundaries in multiple directions, law may also 
influence organizations as managers appropriate and transform legal prin- 
ciples, leading to the institutionalization of (metamorphosed) legal ideas 
in organizational fields. 
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APPENDIX B 


TABLE B1 
PROPORTION OF CONSTRUCTIONS OF DIVERSITY THAT ARE NONLEGAL 
(ARTICLES IN THE DIvErstry DATA SET) 











ARTICLES 
Diversity Diversity with 
YEAR Only Civil Rights 
1987. 
N of cases ....... 0 1 
Nonlegal constructions .. ..... wa 60 
1988: 
N of cases AN 1 4 
Nonlegal constructions .... ... 36 49 
1989, 
N of cases... ÓN 2 2 
Nonlegal constructions ........ 8 19 
1990, 
N of oases ÓN 2 6 
Nonlegal constructions ..... .. 17 19 
1991. 
N of cases 0.002 cece eee 8 10 
Nonlegal constructions . ...... 30 26 
1992: 
N of cases 0... cece eee 13 9 
Nonlegal constructions . 40 31 
1993: 
N of cases IA 44 27 
Nonlegal constructions ....... 30 28 
1994: 
N of ases A 34 20 
Nonlegal constructions .. 41 42 
1995: 
¿de TA 21 28 
Nonlegal constructions ....... 32 25 
1996. 
N of ses AN 18 25 
Nonlegal constructions ...... 31 16 
Total: 
N of cases IA 143 132 
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Effectively Maintained Inequality: Education 
Transitions, Track Mobility, and Social 
Background Effects! 


Samuel R. Lucas 
University of California, Berkeley 


This article proposes a general explanation for social background- 
related inequality. Educational attainment research indicates that 
the later an education transition, the lower the social background 
effect. While some suggest life course changes in the parent-child 
relationship or between-family competition explain this pattern, oth- 
ers contend the result is a statistical artifact, and that the analytic 
strategy presupposes agents are irrationally myopic. This article ad- 
dresses these criticisms by framing educational transitions in terms 
of students’ movement through the stratified curriculum. Students 
select their stratum, one of which is dropping out. To make these 
choices, they consider their most recent salient performance. Using 
time-varying performance measures to predict students’ track place- 
ment/school continuation sustains the validity of the educational 
transitions approach and suggests substantively important social 
background effects even for nearly universal transitions. Results are 
consistent with the general perspective termed effectively main- 
tained inequality. 


Two distinct literatures have developed to examine students’ movement 
through secondary school. One strain of research uses ethnography and 
statistical analysis to focus on students’ placement in the stratified cur- 
riculum or, in other words, students’ track location. This literature treats 
track location both as a determinant of important factors such as achieve- 


'T thank Adam Gamoran for comments on an earlier draft, and Aimée Dechter and 
three anonymous reviewers for additional helpful comments. All analyses were con- 
ducted at the Survey Research Center of the University of California-Berkeley using 
LIMDEP 7.0 running on a Sun Sparc (for all model estimation) and SPSSx version 
4.0 running on an IBM RS6000 (for all data manipulation). An earlier version of this 
article was presented at the American Sociological Association Annual Meeting in San 
Francisco, August 1998, under the title “Bringing the Tracks (All the Way) Back In: 
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ment (e.g., Gamoran and Mare 1989), college entry (e.g., Rosenbaum 1980), 
and political efficacy (e.g., Paulsen 1991), and as an outcome in its own 
right (e.g., Rehberg and Rosenthal 1978; Hallinan 1992). 

A second line of inquiry has viewed educational attainment as a process 
of completing a sequence of transitions. In this line of research, henceforth 
termed the educational transitions tradition, students in any given grade 
either continue on to the next grade or level of schooling or end their 
formal education. Thus, educational attainment is analyzed as the cu- 
mulation of a sequence of yes/no decisions (e.g., Mare 1980; Shavit and 
Blossfeld 1993). By treating educational attainment in this manner, an- 
alysts have sought to discern the transition points where social background 
effects are largest, and they have apparently shown a nearly universal 
pattern, whereby effects of social background appear lower for later ed- 
ucational transitions than for earlier ones. However, as analysts have 
attempted to explain this common pattern, the entire enterprise of edu- 
cational transitions analysis has come to be questioned; indeed, some 
researchers now call for the abandonment of educational transitions re- 
search (e.g., Cameron and Heckman 1998). 

This division between the tracking literature, on the one hand, and the 
educational transitions literature, on the other, is unfortunate for at least 
four reasons. First, tracking research provides important findings that can 
justify continuing to study educational attainment as a sequence of tran- 
sitions. Recent research suggests that educational transitions analyses are 
based on untenable behavioral assumptions, but findings from research 
on tracking imply at least some of those assumptions are actually 
appropriate. 

Second, a problem in the treatment of dropouts in the tracking literature 
is resolved by adopting an educational transitions perspective. The track- 
ing literature sees each grade as composed of a set of stratified positions, 
and students’ location in the stratification system in a given year has 
potentially important implications for social and psychological develop- 
ment. For this reason, tracking researchers are interested in the deter- 
minants of track location and track mobility. When studying these pro- 
cesses, track researchers have usually deleted dropouts from the analysis 
(e.g., Rosenbaum 1976; Hallinan 1996). When viewed through the lens of 
the education transitions tradition, however, deleting dropouts from the 
analysis is problematic. Instead, given the logic of educational transitions, 
research on track mobility can treat dropout as an important absorbing 
state or destination in the track structure. 

Third, an inconsistency in the educational transitions literature’s treat- 
ment of the stratified curriculum can be resolved by elaborating the des- 
tinations students may reach. The educational transitions literature fo- 
cuses on the accumulation of an additional year of formal education; 
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educational attainment is of interest because of its well-documented im- 
plications for social, political, and economic behavior. While the educa- 
tional transitions literature has often considered students as existing in a 
stratified curriculum prior to making a transition, and thus has not ignored 
the insights of tracking research, it has not seen the destination as also 
composed of stratified curricular positions. Thus, the treatment of origins 
and destinations in the educational transitions literature is often not con- 
sistent (e.g., Lucas 1996). This inconsistency is consequential, because 
treating origins and destinations consistently will facilitate investigation 
of two important explanations for the commonly found pattern of waning 
social background effects across transitions: the life course perspective 
(LCP) and maximally maintained inequality (MMO). 

Fourth, when we consider educational transitions through a stratified 
curriculum, it becomes feasible to estimate the effects of social background 
both on making a transition and on reaching particular locations in the 
stratified curriculum. Obtaining both estimates will make it possible to 
investigate whether results support a comprehensive explanation for both 
school continuation and in-school processes of tracking. As school con- 
tinuation and track placement are two parts of the same pro- 
cess—accumulating resources to enter the world of adults—a compre- 
hensive explanation may be appropriate. 

These two literatures separately have provided important insights into 
the workings of schools and society. However, each literature has been 
pursued largely in isolation from the other. This article seeks to bring the 
two together. With respect to educational transitions, the article treats 
both in-school origins and in-school destinations as stratified, the better 
to estimate direct effects of social background and thus allow comparative 
assessment of LCP and MMI. With respect to track mobility, this article 
makes dropping out a part of the process of allocating students to positions 
in school, the better to estimate the effects of factors of interest. And with 
respect to their convergence, the article seeks to address rising doubt as 
to the value of educational transitions analyses and to offer a compre- 
hensive explanation of both facets of educational attainment, an expla- 
nation I term effectively maintained inequality. 

To pursue these aims, I first describe the logic and results of educational 
transitions research. I then relate two theories devised to explain the 
common pattern of results educational transitions analysts have found. I 
then shift attention to tracking research, detailing changes in tracking 
that encourage the convergence of these research traditions. This con- 
vergence facilitates development of a comprehensive explanation for social 
background effects on school continuation and track placement. I propose 
effectively maintained inequality as one such comprehensive explanation. 
After outlining this explanation, I return to educational transitions re- 
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search, presenting several existing criticisms of educational transitions 
research and suggesting responses to them. Afterward, the particular 
methods used in this analysis are described, and the results and conclu- 
sions of the research are presented. Thus, to begin this effort, it is im- 
portant to relate the logic and results of educational transitions research. 


EDUCATIONAL TRANSITIONS RESEARCH: LOGIC AND RESULTS 


The educational transitions literature flows from the long-held interest in 
whether the effect of social background on educational attainment differs 
over time or across societies. One way to investigate educational attain- 
ment is to regress years of school completed on a set of explanatory 
variables. However, ordinary least squares (OLS) coefficients reflect not 
only the level of association but also the variance of educational attain- 
ment. Because the variance of educational attainment has changed over 
time given the expansion of educational systems, comparisons of OLS 
coefficients across different cohorts or countries will not reveal whether 
the structural parameters governing the process of educational attainment 
differ. 

To obtain estimates that can be compared across cohorts, Mare (1980), 
following on the work of Fienberg and Mason (1978), proposed that an- 
alysts treat education as a series of transitions or school continuation 
decisions. Mare reasoned that total years of school completed is the result 
of a series of decisions to stop or continue schooling. If each year students 
have the option of continuing or ending their formal education, each 
decision to stop or continue is a binary variable, scored one for students 
who continue on and zero for students who elect to stop. Figure 1 details 


Drop Out 


Fig. 1.—Perspective of traditional education transitions analyses 
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the process of educational attainment as seen through the lens of the 
educational transitions tradition. 

Using Mare’s approach, analysts have investigated whether logit co- 
efficients for social background change across transitions, and within a 
given transition whether the coefficients change across cohorts, in at least 
18 countries (e.g., Garnier and Raffalovich 1984; Kerckhoff and Trott 
1993; Treiman and Yamaguchi 1993; Shavit 1993; Szelényi and Aschaf- 
fenburg 1993; Raftery and Hout 1993). In nearly every cohort in country 
after country, researchers have found logit coefficients to decline across 
transitions. 


EXISTING EXPLANATIONS OF THE WANING COEFFICIENTS 
PATTERN 


The vast majority of educational transitions analyses have found logit 
coefficients for social background to decline across transitions (e.g., Mare 
1980; Shavit and Blossfeld 1993). Two major explanations of this finding 
have been constructed.’ 


Life Course Changes 


Müller and Karle (1993) speculate that a life course perspective (LCP) 
could explain the waning coefficients pattern. They suggest that changes 
in the relationship between children and their parents may be at the root 
of the pattern. Allegedly, parental characteristics decline in value because 
students, who are older at each transition, are also less dependent eco- 
nomically and socially on their parents with each transition. If students 
are less dependent on their parents for later transitions, then social back- 
ground should be less important for determining who receives additional 
schooling. 


Maximally Maintained Inequality 
Raftery and Hout offer a second explanation (Raftery and Hout 1993; 
Hout, Raftery, and Bell 1993). These authors argue that the pattern of 
background effects across cohorts can be explained by a theory of max- 
imally maintained inequality (MMD, which has four tenets. 

First, ceteris paribus, expansion of secondary and higher education will 
reflect increased demand generated by two typically glacial forces: (1) 


* Mare (1980) originally proposed that selective attrition might explain the waning 
coefficients pattern, but recent research has led analysts to doubt this explanation is 
sufficient (e g., Mare 1993; Shavit and Blossfeld 1993; Lucas 1996). 
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population increase and (2) social origins upgrading. Second, if enrollment 
rises faster than demand (where demand is the amount of schooling ex- 
pected on the basis of population level and social class background com- 
position), then lower-class persons obtain more schooling. Even so, the 
social class effect remains the same. Third, if completion of a given level 
of education becomes universal for upper-class persons, then the effect of 
social background on that transition declines over time, but only if ed- 
ucational expansion cannot be maintained otherwise (hence the name of 
the theory). And fourth, falling effects of social background can reverse 
to become rising effects of social background. If public support for ed- 
ucation is reduced, social class effects will increase (Hout et al. 1993). 

MMI was devised to explain cross-cohort variation in social background 
effects. Yet, MMI is potentially relevant to understanding the pattern of 
background effects across transitions. Most important in this context is 
the fourth precept of MMT, which states that when public support for a 
particular level of education changes, the impact of social background on 
completion of that transition will also change. Notably, if support for 
making a particular transition declines, then social background will be- 
come more important for making that transition. Given sufficient change 
in the support for a particular transition, MMI suggests that social back- 
ground may actually become more important for later transitions than it 
is for earlier ones. This is the key difference between MMI and LCP; 
LCP emphasizes that as children age they become more and more in- 
dependent of parents, whereas MMI implies that adolescents’ independ- 
ence itself depends on the sociopolitical context and the resulting social 
support for particular levels of education. 

Educational transitions research has furthered our understanding of 
educational stratification. However, research on tracking has also been 
concerned with stratification issues and thus is potentially relevant for 
educational transitions research. Therefore, consideration of tracking re- 
search is in order. 


TRACKING RESEARCH AND IMPLICATIONS FOR EDUCATIONAL 
TRANSITIONS 


MMI and LCP were devised to explain the results of educational tran- 
sitions analyses. Educational transitions research highlights the cumula- 
tive nature of the process of educational attainment. Although researchers 
have at times referenced qualitative differences in schooling to make sense 
of educational transitions results, the qualitative differences these inter 
pretations highlight are collinear with the level of schooling. This is the 
result of the strategy of educational transitions analyses, which, as figure 
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1 suggests, suppresses within-level differences in schooling.’ Thus, in ed- 
ucational transitions research, it is impossible to assess the role of social 
background on qualitative dimensions of schooling that are not collinear 
with the level of schooling. 

This is a limitation of educational transitions analysis, because quali- 
tative differences in schooling are an important feature of educational 
systems in their own right, and many of those qualitative differences are 
not collinear with the level of study. One such qualitative feature is cur- 
ricular tracks. Educational systems are often composed of explicit or im- 
plicit curricular strata or tracks. Such tracks are not collinear with the 
level of schooling. The strategy of educational transitions analysis, there- 
fore, may systematically miss or greatly deemphasize processes of in-school 
stratification. If qualitative differences in students’ experience are an im- 
portant pathway through which social background affects educational 
transitions, de-emphasizing processes of in-school stratification may lead 
one to mischaracterize the pattern of social background effects on edu- 
cational attainment. 


Change and Stability in Tracking in the United States 


In order for the treatment of tracking to matter for educational transitions 
research, two conditions must be met. First, track location must be an 
important part of the process of educational transitions, and second, social 
background must be a determinant of track location. 

With respect to the first condition, it is important to distinguish more 
recent cohorts from earlier ones. At the inception of the educational tran- 
sition tradition, it may have been sufficient to ignore track location in 
analyzing the cumulation of years of schooling for American cohorts. 
However, considering the way tracking has come to work in the United 
States raises questions as to whether suppressing the complexities of track- 
ing is still warranted. These questions arise because the in-school strat- 
ification system—the track system—is more complex than formerly be- 
lieved. Indeed, as I have noted elsewhere, the simple view of how schools 
work implicit in many theories of the school-economy relation is based 


? Buchmann and Charles's (1993) analysis of educational transitions in Switzerland is 
an exception. Notably, they included qualitative dimensions in their analysis and failed 
to replicate the waning logit coefficients pattern. Yet, the sample sizes are small (ap- 
proximately 500 for each sex/cohort studied, and each sex/cohort combination is an- 
alyzed separately), the response rate of the study is low (45%), and the data were 
collected long after the modal person would complete high school, allowing dropouts 
to possibly obtain enough later schooling to endanger accurate estimation of the effect 
of parental status characteristics. For these reasons, the implications of the results 
remain unclear. 
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on an outdated understanding of how schools track students in the United 
States (Lucas 1999). The difference between the old understanding and 
the new reality is best reflected in research on track mobility. 

Historically tracking researchers found low levels of track mobility and 
a greater incidence of downward mobility than upward mobility. This 
finding was explained by a theory of tournament track mobility. Tour- 
nament track mobility (Rosenbaum 1976) explained students’ extremely 
low mobility chances by highlighting the ideational and institutional sup- 
ports for tracking. Ideationally, mobility might call into question the le- 
gitimacy of earlier placements and thus was subtly discouraged. Insti- 
tutionally, school personnel appeared to hold overwhelming power to 
assign students to tracks, and evidence suggests they wielded this power 
on a regular basis (e.g., Cicourel and Kitsuse 1963). In such a world, it 
would be sufficient to treat track location as an independent variable in 
educational transitions analyses and ignore the track destinations of stu- 
dents, because mobility was so rare that one could regard students in 
disparate tracks as following unalterably separate paths. 

Recent analyses, however, have been unable to replicate earlier findings 
concerning track mobility. Indeed, more recent research finds a high in- 
cidence of mobility, and none find upward mobility to be rare (e.g., Wilson 
and Rossman 1993; Hallinan 1996; Lucas 1999; Lucas and Good 2001). 
As analysts have looked closer, they have come to explain this new set 
of findings by pointing to an unremarked revolution in how schools track 
students (Lucas 1999). Evidence suggests that schools no longer formally 
assign students to overarching tracks that determine their course-taking 
(e.g., Carey, Farris, and Carpenter 1994; Hayes 1990). Now tracking is 
activated in many separate, yearly, subject-specific decisions rather than 
in one global assignment to a track at one pivotal point in a career (e.g., 
Moore and Davenport 1988). 

The above suggests that tracking has become a possibly important part 
of the process of educational transitions. Students are not allocated to one 
track that governs their schooling for multiple years. Instead, students 
encounter many separate decision points that implicitly allocate them to 
different curricula. In order for changes in tracking to matter for edu- 
cational transitions research, however, social background must matter for 
students’ track placement. Are these decisions made under conditions that 
might allow social background to matter for placement? Evidence sug- 
gests that students make enrollment decisions with comparatively less 
counselor input than previously, as counselors have retreated from the 
proactive role they formerly played (e.g., Rosenbaum, Miller, and Krei 
1996). The retreat of the counselors ushers in an environment of ostensible 
student choice; in this environment, researchers continue to find social 
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background to matter in track placement (e.g., Garet and DeLany 1988; 
Hallinan 1992). 

Sociologists have identified two ways that social background appears 
to matter. Middle-class parents appear to be proactive both as individuals 
and as a class, maintaining tracking in general and securing for their 
children the best positions within the track structure of the school (e.g., 
Useem 1992; Wells and Serna 1996). Class conflict around schools’ de- 
cisions to track or not track is well-documented, as socioeconomically 
advantaged parents often work to maintain tracking in the schools (e.g., 
Wells and Oakes 1996; Wells and Serna 1996). 

Social background also affects individual placements. Socioeconomi- 
cally advantaged parents can secure advantaged places for their child, 
not only because they may use a wide array of resources in a given 
instance, but perhaps more important, they have personal experiences 
that make it more likely they will be able to recognize the pivotal “given 
instance” to which they may want to bring those resources to bear. They 
know which decision points involve high stakes and which can be ignored 
safely. Having been to college themselves, they can help their children 
navigate the high school curricular structure in ways that make subse- 
quent college entry a real possibility. 

Socioeconomically disadvantaged parents may certainly cheer their 
children on in their efforts to reach college. But socioeconomically ad- 
vantaged parents may not only cheer, but also coach, their children in 
their efforts to reach college. The difference between cheering and coach- 
ing is information as to what matters for college. Given the retreat of the 
former school gatekeepers, as evidenced by existing research, and the 
episodic nature of track decisions, students whose parents lack such 
knowledge are on their own. This environment, with many more decision 
points, unclear signs for the uninitiated, and continued effects of social 
background, has two important implications for how we may study ed- 
ucational attainment. 


Implications of Tracking Research for Educational Transitions 
Research 


The first implication of tracking research is that we need to change our 
focus from the yes/no decision of educational transitions research to de- 
cisions about a more varied set of options. Students make decisions yearly 
as to which structural path they will follow—drop out; stay in school in 
college preparatory courses in academic subjects such as math, English, 
foreign language, science, and social studies; stay in school in noncollege 
preparatory courses in academic subjects; or stay in school but avoid 
academic subjects. Thus, this new state of affairs means both that the 


1650 


Inequality 


Drop Out 


Fic. 2 —Education transitions with stratified destinations 


vision of educational transitions evident in figure 1 is no longer sufficient 
and that educational attainment is produced through a sequence of tran- 
sitions. This suggests it is potentially useful for the tracking and educa- 
tional transitions literatures to converge. When these literatures converge, 
the vision of the process of educational transitions is as illustrated in figure 
2. In figure 2, dropping out becomes one of a small subset of potential 
locations to which students may move, and students who decide to con- 
tinue also decide within which curriculum they will continue. Seen in this 
way, the distinction between the educational transitions literature, on the 
one hand, which treats school as a sequence of transitions, and the track 
mobility literature, on the other hand, which sees students as moving 
through a stratified curriculum, dissolves. 

Second, the unremarked revolution heightened the potential importance 
of student performance for students’ placement in the stratified curric- 
ulum. The consistent finding from the recent research is that there is far 
more mobility in the recent period than previously observed. Students 
routinely obtain updated information as to their performance in school 
in the form of grades; these grades are obtained in particular curricular 
positions. Students and others can use this updated information—this 
time-varying covariate—to aid in placements for the following academic 
year. This is possible because placements for the following year are not 
completely determined by prior placement, as seemed to be the case prior 
to the change in school practice. Thus, time-varying covariates are a 
routinized and potentially consequential feature of the educational career. 
Analysts, therefore, may use such information to aid their study of the 
determinants of educational transitions. 
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The joint study of educational transitions and track mobility raises the 
possibility of providing a comprehensive explanation of the role of social 
background in educational attainment. I propose that a theory of effec- 
tively maintained inequality (EMI) may illuminate both processes of 
school continuation and track mobility. 

Effectively maintained inequality posits that socioeconomically advan- 
taged actors secure for themselves and their children some degree of ad- 
vantage wherever advantages are commonly possible. On the one hand, 
if quantitative differences are common, the socioeconomically advantaged 
will obtain quantitative advantage; on the other hand, if qualitative dif- 
ferences are common the socioeconomically advantaged will obtain qual- 
itative advantage. 


EMI 


For educational attainment, the above claims could be true in one of at 
least two ways. It may be that as long as a particular level of schooling 
is not universal (e.g., high school completion throughout the first half of 
the 20th century in the United States), the socioeconomically advantaged 
use their advantages to secure that level of schooling. Once that level of 
schooling becomes nearly universal, however, the socioeconomically ad- 
vantaged seek out whatever qualitative differences there are at that level 
and use their advantages to secure quantitatively similar but qualitatively 
better education. 

The above suggests that the focus of activity may change over time as 
qualitative differences supplant quantitative differences in importance. 
Alternatively, it is possible that even when quantitative differences are 
common, qualitative differences are also important; if so, I posit that the 
socioeconomically advantaged will use their socioeconomic advantages to 
secure both quantitatively and qualitatively better outcomes. 

Unfortunately, it is not possible to adjudicate between these two pos- 
sibilities in the current analysis, because we lack detailed data on the 
track placements of earlier cohorts. However, it is possible to determine 
whether in one cohort studied we find consequential effects of social back- 
ground on qualitative placements, even where quantitative differences 
are virtually nil Finding such a pattern of effects will provide evidence 
consistent with effectively maintained inequality; the central implication 
of EMI for educational attainment is that for nearly universal levels of 
schooling, background will affect differences in kind. 
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EMI, MMI, and LCP 


EMT and LCP are not mutually exclusive, but they do highlight different 
processes. LCP emphasizes the relationship between children and their 
parents, and how that relationship changes over time as children mature. 
Owing to changes in the parent-child relationship, parental status char- 
acteristics are expected to wane in importance over time. EMI is agnostic 
with respect to whether this causal story explains the waning logit co- 
efficient pattern. The pace at which persons’ own status characteristics 
replace their parents’ status characteristics is both an empirical question 
whose answer may depend on the outcome under study and an issue in 
need of theoretical reflection beyond the scope of this analysis. However, 
EMT implies that for as long as parents’ status characteristics are students’ 
status characteristics, the children of the socioeconomically advantaged 
will be allocated to quantitatively and qualitatively advantaged positions 
on that basis. Further, EMI implies that once parents’ status character- 
istics are truly replaced by children’s own status characteristics, the adult 
children’s own status characteristics will serve the same purpose parents’ 
status characteristics served in the past. 

EMI and MMI are somewhat more divergent. Both highlight class 
competition between families, but MMI suggests competition will be nil 
for any level of education that is universal. In contrast, EMI implies that 
for levels of education that are universal, competition will occur around 
the type of education attained. Thus, for some levels, MMI implies the 
maximum amount of background-related inequality is virtually zero, 
whereas EMI implies that for those very same levels inequality will not 
only be nonzero but also nontrivial, that is, the background-related ine- 
quality will be consequential. However, MMI and EMI do not disagree 
about every aspect of schooling, for both perspectives would predict social 
background effects to be nontrivial at levels of education that are not 
universal. 


EVALUATING EDUCATIONAL TRANSITIONS RESEARCH: 
CRITIQUES AND RESPONSE 


MMI and LCP were crafted to make sense of the waning coefficients 
pattern educational transitions researchers have found. I have suggested 
that bringing tracking and educational transitions research together is 
appropriate, given changes in tracking, and useful, given the possibility 
of developing a comprehensive explanation of school continuation and 
track mobility. 

However, several analysts have suggested that the waning coefficients 
pattern of educational transitions research may be a statistical artifact. 


1653 


American Journal of Sociology 


The arguments concerning this issue have raised doubt as to whether 
researchers should continue to study educational attainment in this man- 
ner. Thus, in order to continue to conduct educational transitions research, 
even educational transitions research informed by research on tracking, 
a response to the critiques is required. 


Improper Functional Form and Untenable Behavioral Assumptions 


Cameron and Heckman’s (1998) criticisms of educational transitions re- 
search culminate in their declaring that the near universal finding of 
declining logit coefficients should be ignored. They argue that the finding 
is an artifact of ad hoc decisions regarding the functional form of the 
model. They reach this conclusion by contending that the functional form 
for the model makes it possible to obtain estimates, but absent time- 
varying covariates, it is impossible to assess whether effects of social 
background vary across transitions. They show that the logistic regression 
model with transition-specific coefficients can be identified when the val- 
ues of at least one covariate vary across transitions, but they maintain 
that time-varying covariates are not usually available. 

The lack of time-varying covariates is important because if covariates 
do not vary across transitions, then one must assume a distribution for 
the error term in order to identify the model with transition-specific pa- 
rameters. Further, if there are no time-varying covariates, the only factor 
inducing differences on the dependent variable across transitions is the 
error term. Cameron and Heckman argue that under these conditions the 
sequential decision-making process underlying the educational transitions 
logit model implies students assume that their current school experience 
perfectly predicts their future school experience, and thus students act 
only on contemporaneous conditions. Cameron and Heckman regard this 
as an untenable assumption of student myopia.* In response, they offer 
what they regard as a rational choice perspective in which persons know 
their endowments, know the costs of investments (including foregone earn- 
ings), and know the payoffs that attend certain levels of education. Using 


* Cameron and Heckman argue that under the sequential model with time-invariant 
covariates, the error term (e) is the only person-specific factor varying over transitions. 
The person-specific, across transition distribution of e is a martingale; the most im- 
portant implication of this is that A(e,) = s,,, Le, if e is a martingale, then persons 
use their most immediate experience to decide on their next move, ignoring experiences 
earlier than ¢— 1. Because the sequential decision-making model with time-invariant 
regressors implies that persons make decisions sequentially, and the only factor in- 
ducing variation is a martingale in this situation, Cameron and Heckman contend that 
the model impHes myopic agents. 
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this information, persons obtain just enough schooling to maximize net 
benefits.’ 

Cameron and Heckman propose a statistical model that reflects their 
preferred behavioral assumptions, an ordered discrete choice model with 
multiplicative heteroskedasticity. The dependent variable for their anal- 
ysis appears to be years of school completed, but the discrete choice model 
relaxes the assumption of equal intervals between years. Were one to 
assume that the error was normally distributed and homoskedastic, their 
model would become the ordered probit model, with years of schooling 
as the dependent variable. Thus, essentially Cameron and Heckman are 
calling for a return to years of school completed as the focus of analysis 
and suggesting that a different functional form will resolve the problems 
that attend using OLS regression. They allow for heteroskedastic errors 
to allow for differences they do not observe; they assert that a major 
portion of the unobserved heterogeneity reflects differences in ability. 

On the basis of estimates derived from their preferred model, Cameron 
and Heckman argue against expanding financial support for college. They 
contend that such support entices students from the bottom of the dis- 
tribution of their unobserved variable to continue schooling. Given their 
assertion that the unobserved variable is ability, they suggest that though 
those induced to continue schooling with increases in financial aid are 
more productive than they would otherwise be, the productivity gains to 
their continued education fall short of the productivity gains typically 
observed for college entrants. Thus, they contend, financial aid for college 
should not be expanded. 

In sum, Cameron and Heckman reject the educational transitions lit- 
erature. They essentially argue that without time-varying covariates one 
cannot identify the education transitions model with transition-specific 
parameters independent of the assumption for the distribution of the error 
term. But solving the identification dilemma by assuming a distribution 
for the error term leaves the error term to induce all over-time variation 
in the dependent variable. This implies that as soon as the error term 
pushes a person below the school continuation threshold, the person falls 


‘I have attempted to use the term Cameron and Heckman use—rational 
choice—without joining the debate about the meaning of the term. Suffice it to say, 
their reasons for assuming that students use a particular kind of information in a 
particular way are clear, but I see no analytic justification for labeling use of that 
information in that particular way as rational. The assumption is what it is, regardless 
of whether one judges behavior that matches the assumption as rational or not. And, 
as an unsubstantiated assumption, it is as subject to Manski’s (1993) cogent criticism 
of social scientific presumptions about how students form expectations, as are any of 
the analyses Cameron and Heckman critique. Thus, in this article, I will attempt to 
defend the behavioral assumption that underlies the model used but will offer no 
judgment as to the rationality of the behavior. 
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out of school, even if a later error term would have been such that they 
would have accrued substantial gains upon continuing. Cameron and 
Heckman argue that this is a restatement of myopia, and thus the be- 
havioral theory behind the sequential decision-making model requires 
myopic agents. Thus, they reject the sequential decision-making model 
and the educational transitions literature. (See appendix B for a more 
technical explication of their position as well as my response). 

However, although Cameron and Heckman argue that the sequential 
decision-making model creates the assumption of myopia, in actuality, the 
sequential decision-making model in concert with the lack of time-varying 
covariates creates the assumption of myopia; the presence of both features 
is necessary in order to force the error term to take the decisive role in 
creating over-time variation in a student’s decision of whether to stop or 
continue schooling. Cameron and Heckman decide to retain the idea that 
observed characteristics are stable over time and change the functional 
form of the model to jettison the implication of myopia. But their response 
seems arbitrary in the face of research on tracking showing that students’ 
structural locations change, implying that their performance—a key piece 
of information—can also in principle change over time. 

Cameron and Heckman call for researchers to use statistical models 
that reflect defensible behavioral models and processes. A response con- 
sistent with their counsel would be to retain the sequential decision-mak- 
ing behavioral model and add theoretically defensible time-varying cov- 
ariates to the statistical model. That is the approach taken here, because 
evidence shows that the process of educational attainment in the United 
States occurs in a stratified curriculum that requires sequential decision 
making. Once time-varying covariates are allowed, the educational tran- 
sitions model with transition-specific parameters is identified without re- 
course to an assumption for the distribution of the errors. Once it is 
identified, person-specific variance in the dependent variable over time 
is not driven only by the error term. Once the error term does not drive 
the variance in Y, the claim of myopia is not sustained. 

The observations above are sufficient to support the work that follows. 
However, several other important observations concerning the existing 
critiques of the educational transitions approach need to be made for at 
least two reasons. First, these observations will suggest further dangers 
analysts must avoid should they seek to conduct educational transitions 
research specifically. Second, some of the critiques actually violate gen- 
erally good research practice and thus may have created much confusion. 
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Myopia 


Despite the discomfort of some with the prospect of myopic decision 
making, all students pursue options as the sequence of opportunities un- 
folds. All students select from among immediately visible options whose 
long-term implications they cannot accurately estimate and perhaps would 
not know how to begin to evaluate (Manski 1993). The extent to which 
decisions are myopic is likely dependent on the resources a student has 
to gather pertinent information. All students have records of their per- 
formance in courses by which to judge the appropriateness of continued 
study at different levels. However, socioeconomically advantaged students 
have college-experienced parents, and students in college track courses 
have teachers pointing them to college with time-tested strategies of suc- 
cess. Other students lack these resources. Access to knowledgeable parents 
and teachers makes savvy and strategic forward-looking behavior more 
likely, while lack of access to these resources diminishes its likelihood. 

Thus tracking research, and a line of reasoning consistent with it, sug- 
gests that students—even socioeconomically advantaged students—make 
decisions both sequentially, as Mare contends, and potentially myopically, 
as Cameron and Heckman deny (Powell, Farrar, and Cohen 1985). How- 
ever, myopia is likely to be a differentially allocated feature of students’ 
educational decision making. If one considers the process of schooling, 
one may adopt modeling strategies that make it unnecessary to assume 
myopia in the modeling process. However, myopia may be present even 
if it is not assumed; indeed, known and commonly measured social factors 
such as parents’ education and students’ track location are associated 
with the extent to which students are exposed to persons who can dispel 
the myopic elements of their decision making. Thus the social background 
coefficients from a sequential decision model may reflect in part the el- 
ement of myopia. Similarly, the social background coefficients from an 
ordered probit model of years of school completed with heteroskedastic 
errors may reflect in part the element of myopia. 


Appropriate Parameters to Estimate 

One criticism of educational transitions analysis is that the waning logit 
coefficients pattern is the result of a particular parameterization of the 
relationship between social background and educational attainment. The 
case for the logit coefficient as an appropriate index of the relationship 
between social background and attainment is based on the claim that 
logit coefficients are not contaminated in the same way as OLS coefficients 
are. However, De Graaf and Ganzeboom (1993) note that although OLS 
coefficients are an amalgam of different processes, including socioeco- 
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nomic upgrading of parents’ characteristics and expansions in educational 
systems, they remain descriptively accurate for the entire cohort, whereas 
logit coefficients for later transitions do not. Mare (1993) agrees but main- 
tains that logit coefficients are more structural than OLS coefficients, 
because logit coefficients reflect the way that educational attainment ac- 
tually occurs, namely, through an unfolding sequence of decisions. 

Clearly, different summaries of nonlinear relationships can lead to dif- 
ferent substantive conclusions (e.g., Long 1997). However, as Long sug- 
gests, the fundamental focus of many analyses is the probability of a 
discrete outcome. Advances in computing have facilitated determining 
how changes in independent variables are associated with changes in the 
probabilities of an outcome. As probabilities are the fundamental focus 
of many analyses, analysts may want to make probabilities the funda- 
mental focus of presentation. Indeed, in order to assess EMI, it will be 
necessary to focus on how probalities differ for students with different 
social background characteristics, rather than on coefficients from the 
model. 


Appropriate Dependent Variables to Study 


When Cameron and Heckman resurrect years of schooling as the depen- 
dent variable of interest for investigation of educational attainment, they 
alter the functional form of the model to avoid estimating OLS coefficients; 
they also inadvertently suppress potentially important qualitative differ- 
ences in schooling even more than education transitions analyses do. Al- 
though they allow the scaling distances between years of schooling to 
differ, this forces all qualitative differences to be translated into the un- 
idimensional scale of years of school completed. This translation may 
obscure more than it reveals, because persons who obtain the same num- 
ber of years of schooling, but do so in different types of programs, will 
be assigned the same value on the dependent variable. This may lead 
some persons to be misclassified, in that a proper translation of their 
educational attainment into years of schooling would put them higher or 
lower on the scale than their nominal years in school actually does. Be- 
cause of these difficulties, even with the altered functional form, Cameron 
and Heckman lose the ability to investigate whether social background 
matters, not only for how much education a person receives, but also 
what kind of education a person receives. As the research discussed above 
suggests, however, this is an important question that is directly related 
to the concerns, if not the common analysis, of educational transitions 
researchers. 

The existence of the stratified curriculum certainly suggests that a return 
to years of school completed may mask important pathways through 
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which social background has its effects. Yet, adding qualitative dimen- 
sions to educational transitions analyses may complicate cross-national 
comparison. A central motivation for the development of the educational 
transitions tradition was the desire to conduct cross-national comparative 
analysis, and thus this limitation is an important one. Many qualitative 
characteristics may be country-specific, and even for those that are com- 
mon across nations (e.g., the existence of stratified curricula), the actual 
categories and the meaning of the categories may differ markedly from 
country to country. Balancing the desire to make cross-national compar- 
isons, on the one hand, with the aim of accurately capturing important 
country-specific features, on the other, is of course a challenge. Most likely, 
approaches that de-emphasize qualitative dimensions can aid in estimat- 
ing the total effect of social background on the amount of educational 
attainment; these analyses may be most directly comparable across 
nations. This seems to be the strategy followed to great effect by Shavit, 
Blossfeld, and colleagues. However, to discern the mechanisms producing 
the pattern of social background effects, it is likely we will need to attend 
to the nuances of particular educational systems.* 


Unobserved Heterogeneity 


An additional criticism of educational transitions research contends that 
unobserved heterogeneity may make the waning coefficients pattern an 
incorrect representation of students” experience (e.g., Mare 1993). In some 
analyses, omitted variables must be correlated with independent variables 
included in the model in order to bias results, but in other analyses, 
including education transitions research, biases arise even if the omitted 
variable is not correlated with the included independent variables (Vaupel 
and Yashin 1985). 

Analysts have long been concerned with unobserved heterogeneity. A 
big improvement is to use data that would allow one to directly measure 
the factors that are typically unobserved, if measurement is possible. Using 
good data is a helpful response because unobserved heterogeneity is an 
omitted variable problem. However, any finite set of regressors leaves a 
vast untapped reservoir of additional variables with the potential to create 
bias by their omission. Hence, although this strategy may be useful, it 
cannot dismiss the possibility that findings are driven by unobserved 
heterogeneity. 

Thus, even when good data are available, some unobserved hetero- 
geneity may remain, in which case one may adopt one of the many prom- 


‘I am thankful to an anonymous reviewer for drawing my attention to these 
implications. 
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ising statistical tools analysts have devised to account for unobserved 
heterogeneity when indicators for the potentially biasing variables are 
lacking. Such methods include the use of sibling models (e.g., Mare 1993; 
Mare and Chang 1998), parametric mixture distributions (e.g., Manton, 
Stallard, and Vaupel 1981), nonparametric mixture distributions (e.g., 
Heckman and Singer 1982), and a general nonparametric latent variable 
approach (Vermunt 1997). Each of these approaches is promising, but 
each has limitations as well. Limitations for the different approaches 
include variously the possibility that one may overcorrect for unobserved 
heterogeneity and thus produce results biased in the opposite direction, 
the necessity of assuming that unobserved heterogeneity is independent 
of included covariates, and sensitivity to the functional form assumed for 
the time-dependence or mixture distribution (e.g., Griliches 1977, 1979; 
Yamaguchi 1991). 

However, taken together, these methods have an important use, for 
with these methods, it would be possible and advisable to calculate bounds 
on the parameters of interest, as Manski (1995) proposes. The calculation 
of bounds and detailed investigation of the sensitivity of results to different 
assumptions about unobserved heterogeneity is an effort worth conduct- 
ing, but the purpose of this article is different. Here I seek to complete 
an essential precursor to such a project, for before assessing the sensitivity 
of results to a variety of different assumptions concerning unobserved 
heterogeneity, it is important to determine whether the framework under 
investigation has promise sufficient to justify deeper investigation. 

The strategy I adopt in this article, therefore, is to include a lengthy, 
theoretically defensible set of covariates (e.g., Gamoran and Mare 1989). 
As I later describe, I will attempt to use a healthy set of time-varying 
and time-invariant covariates, given the possibilities and limitations of 
the data, the data collection strategy, and sociological theory. I admit this 
approach cannot remove the possibility of unobserved heterogeneity, but 
because it is grounded in the current state of sociological knowledge, as 
we learn more over time, the specification can be adjusted explicitly. 
Further, should the findings prove sociologically interesting, the baseline 
established by this investigation can and should be probed with detailed 
analysis, using a variety of specifications for unobserved heterogeneity, to 
determine the sensitivity of results to different assumptions. 

Still, it should be noted that statistical developments may soon ease the 
treatment of unobserved heterogeneity. As these advances cumulate, it is 
extremely important that analysts refrain from making inferences beyond 
what such techniques allow. It is important to remember that the unob- 
served heterogeneity claim is an arguably admirable admission of igno- 
rance. It means we do not know what induces the observed variance. 
Despite this obvious point, analysts often strain to attribute some positive 


1660 


Inequality 


content to the unobserved component. A case in point is the Cameron 
and Heckman (1998) analysis. The analysis asserts that unobserved (by 
the analyst) heterogeneity is largely ability, and policy prescriptions are 
then based on this assertion. By definition, however, unobserved heter- 
ogeneity could be any of a number of factors, and the relative weight of 
those factors cannot be established and remain unobserved heterogeneity. 
Under these conditions, analysts should refrain from making policy pre- 
scriptions that depend on identifying the content behind unobserved 
heterogeneity. 


Data Selection and Handling 


Although the discussion of unobserved heterogeneity touched on the se- 
lection of data, additional issues make data selection an important concern 
for educational transitions research. The elegance and, consequentially, 
apparent simplicity of educational transitions analysis belies the high data 
requirements for effective research. Both Cameron and Heckman (1998), 
implicitly, and Mare (1993), explicitly, call for analysts to attend closely 
to the data requirements of such modeling, which depend in part on the 
theories one seeks to assess. If the data requirements for a theory are not 
met, the theory cannot be evaluated. For example, in order to assess the 
key claim of LCP, one must use data that does not allow persons decades 
to complete their schooling. Otherwise, the LCP explanation is true but 
trivial; educational and other life pursuits of older persons are virtually 
always made in the context of weakened power of parental status char- 
acteristics relative to the power of those characteristics when the child 
was not yet an adult. But the life-course explanation of Miller and Karle 
is not trivial; it has sociological punch owing to its potential to explain 
why disadvantaged youngsters continue their schooling in such numbers 
that the logit coefficients for social background decline across transitions. 
In order to assess this claim, analysts either need to restrict their analysis 
to relatively young potential students, or use information on the timing 
of educational transitions in their analysis. Unfortunately, many educa- 
tional transitions analyses have used samples of the general adult pop- 
ulation that lack information on the timing of educational transitions. 
Thus, they do not meet the data requirements for assessing LCP and can 
provide no evidence with respect to the validity of the LCP claim. 
Again, Cameron and Heckman illustrate how problems ensue when 
data are not matched to the research question. Cameron and Heckman 
conclude that widening financial aid availability is unwise because it 
induces low-ability students to continue schooling. Setting aside the earlier 
observation that assertions concerning the content of unobserved heter- 
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ogeneity are by definition problematic, Cameron and Heckman cannot 
speak to policy generally because of several other errors in data analysis. 

First, they restrict their analysis to white males, thus preventing their 
analysis from reflecting parameters of the process generally considered. 
This should reduce one’s faith in the general applicability of the policy 
prescription they offer, although on the basis of their analysis, one might 
infer that one should restrict the financial aid available to white males. 

However, Cameron and Heckman also delete all cases missing on any 
variable, a tactic that necessitated removal of approximately one-third of 
the white males. This approach to missing data substantially lowers the 
likelihood that their analysis can generalize even to the population of 
white males.’ 

An additional data problem is that the bulk of their analysis is based 
on Occupational Changes in a Generation (OCG) data. Thus, Cameron 
and Heckman study educational transitions for persons who may have 
completed their education at any time within a window spanning several 
decades. Given that OCG respondents’ reports may summarize educa- 
tional experiences that occurred decades after leaving their parents’ house- 
hold, the Cameron and Heckman analysis is insufficiently precise to ad- 
dress the policy issue of the role of financial liquidity in constraining 
college entrance. That is not to say that the finding would not be replicated 
with better data, but it is to say that the Cameron and Heckman analysis 
fails to address the question.? 

The Cameron and Heckman analysis is unable to address their research 
question; this is unfortunate because they raise an intriguing possibility. 
In general, then, analysts need attend to the theoretical and substantive 
claims they seek to address and assure that the methods and data are 
sufficient to address them. 


' For analogous analyses led astray in pert by problematic treatment of missing data, 
see Herrnstein and Murray 1994; for a correction, see Fischer et al. 1996, and for a 
discussion of strategies for handling missing data and the implications of failures to 
effectively treat missing data, see Little and Rubin 1987. 

* However, if we ignore both the deletion of nonwhites and women and the mistreat- 
ment of missing data, the data may be appropriate to assess their explanation This 
is so because it is likely that, all else equal, a person who has several decades of 
adulthood in which to make further investments in education will make those decisions 
based on more reflection and knowledge than will a youth who has little experience 
with the world of work. A 45-year-old person considering their own educational in- 
vestments will probably more closely approximate the nonmyopic agent than will the 
17-year-old student of the American high school The implication of this observation, 
however, is that the Cameron and Heckman analysis cannot comparatively assess their 
theory and others, because they inadvertently used data that is biased in favor of their 
explanation. 
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Evaluating Educational Transitions Research: Brief Summary of Key 
Responses 


The foregoing discussion has offered several observations on the conduct 
of education transitions research. Analysts have raised many questions 
about education transitions analyses, but three are most prominent: (1) 
how can one account for unobserved heterogeneity, (2) how can one iden- 
tify transition-specific effects without making an ad hoc assumption, and 
(3) how can one select appropriate parameters to index social relations? 

Unobserved heterogeneity is an escapable possibility in every analysis. 
In the education transitions approach, it is certainly more difficult to 
protect against because an omitted variable may bias results even when 
that variable is not associated with included independent variables. In 
this situation, there are four possible responses. One response is to stop 
conducting education transitions research. However, our knowledge of 
tracking suggests that sequential decision making produces educational 
attainment. Were we to halt education transitions research, our statistical 
model, rather than our knowledge of social processes, would be driving 
our investigatory strategy. Our knowledge of social processes suggests that 
the educational transitions framework, or something very much like it, 
is needed, and thus efforts to address the unobserved heterogeneity prob- 
lem seem worthwhile. 

In that vein, a second response is to attempt to account for unobserved 
heterogeneity using one of the methods developed for missing data anal- 
ysis. A third approach is to attempt to account for unobserved hetero- 
geneity by including a lengthy, theoretically defensible set of covariates. 
Neither approach offers certainty that one has succeasfully removed unob- 
served heterogeneity, for both may miss important heterogeneity. 

A fourth approach is to use a variety of methods, either separately or 
together, to establish bounds on parameters and to investigate the sen- 
sitivity of findings to assumptions concerning unobserved heterogeneity. 
In the present case, drawing on existing theories and within the limits of 
the available data, I use the third approach. Should the findings prove 
of sociological interest, detailed investigation of unobserved heterogeneity 
using a variety of specifications would be in order. 

To avoid ad hoc identifying assumptions, findings from tracking re- 
search should be considered. These findings not only affirm that students 
follow a sequential decision-making process as they navigate schools, but 
also reveal a higher incidence of track mobility than previously existed. 
This suggests the potential importance of the time-varying covariate of 
performance. Because time-varying covariates identify the education tran- 
sitions model with transition-specific parameters, one need only invoke 
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the substantively defensible assumption that grades matter, and use mea- 
sures consistent with this assumption, to identify the model. 

Different summaries of nonlinear relations may lead to different con- 
clusions. Given this observation, one response is to highlight the funda- 
mental unit of measure underlying the analysis, which for education tran- 
sitions analysis is the probability of obtaining particular categorical 
outcomes. 

These three questions have been raised, and responses to them have 
been offered. Although the first and third questions and responses are 
important, the foundational basis of the analysis concerns the second, the 
fusing of educational transitions and tracking research. It is that fusion 
that highlights the time-varying covariate of performance, facilitates the 
comparative assessment of existing explanations of school continuation 
(see below), and allows investigation of a comprehensive explanation for 
both school continuation and track location. The following pages describe 
the methods used to address the questions this fusion brings to the fore, 
relate the results of the analysis, and close with concluding remarks. 


ANALYSIS PLAN 


The prime aim of the analysis is to assess effectively maintained inequality. 
Before doing so, however, it is possible and useful to evaluate LCP and 
MMI comparatively, for several reasons. First, MMI and LCP are existing 
explanations for this subject area. Thus, I would be remiss to ignore them 
at this juncture. Second, MMI and LCP have not been studied in an 
analysis that fuses tracking and educational transitions. Yet, as 1 will note 
below, it is just this kind of analysis that is needed to comparatively assess 
MMI and LCP. Third, the same model that produces the information 
needed to evaluate EMI also produces the information needed to com- 
paratively assess MMT and LCP. Thus, it will not be necessary to estimate 
multiple models to consider the different explanations. Yet, because as- 
sessing the different explanations entails perusing different types of pa- 
rameters (from the same model), it will be most effective to proceed by 
first comparing MMI and LCP and then turning to consideration of EMI. 
For these reasons, therefore, I will spend some time investigating MMI 
and LCP, even as the bulk of the analysis is focused on considering the 
evidence for and against effectively maintained inequality. 


Conditions for Comparing MMI and LCP 


LCP and MMI do not have mutually exclusive implications for the total 
effect of social background under all circumstances. However, they do 
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imply potentially different patterns of direct effects under certain con- 
ditions. LCP implies that the direct effect of social background declines 
across transitions, for it is the direct effect that captures parents’ contem- 
poraneous role in students’ successful completion of a particular transi- 
tion. Whatever that role, LCP implies it is lower for later transitions than 
for earlier ones. In contrast, MMI implies that direct effects will be re- 
sponsive to shocks to the system, rising or falling depending upon the 
changing structure of social support for different stages of education. If 
so, it is possible for direct effects on later transitions to exceed the direct 
effects for earlier ones, if the shock to the relevant transition(s) is strong 
enough. 

Still, under general conditions, it is not necessarily possible to differ- 
entiate between LCP and MMI explanations. However, Reagan-era policy 
change increased the financial costs of attending college for early 1980s 
cohorts (e.g., Evangelauf 1987). Of course, the Reagan-era policy change 
may not have been sufficient to raise the effect of social background on 
college entry above the effect of background on high school graduation. 
But many elements made up the Reagan-era reforms. The 1981-82 ac- 
ademic year marked the start of a period of sharply rising tuitions. In 
addition, that year saw the introduction of decreases in the availability 
of grants, increases in the interest rate for Guaranteed Student Loans 
(GSLs; from 7% to 9%), and the inception of a 10% surcharge on GSLs 
(points). The magnitude of these changes certainly suggest that the era 
provides a chance to discern a greater effect of social background on 
college entry than on earlier transitions. Thus, although the analysis can- 
not provide the proverbial “crucial experiment” that will adjudicate for- 
ever between MMI and LCP, the conditions of the Reagan era do seem 
to provide an opportunity to assess them comparatively. 

However, in order to estimate the direct effect of social background, it 
is important to account for the role of social background in allocating 
students to different locations in the stratified curriculum in earlier years. 
Otherwise, indirect effects of social background that work through prior 
placement will be inadvertently included in estimates of the direct effect 
of social background. In order to compare LCP and MMI, therefore, it 
is necessary to study school continuation and students’ track mobility 
simultaneously, for doing so facilitates proper estimation of the direct 
effects of social background. 


The Sample 


The investigation uses the 1980 sophomores from High School and Be- 
yond (HS&B). Analyzing these data is particularly advantageous for pur- 
suing our concerns, for these data include the variables necessary to es- 
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timate direct effects of social background and, owing to the period of data 
collection, these data allow one to consider the transition to college for a 
cohort that encountered both higher tuitions and a public policy envi- 
ronment that had just become decidedly less generous with financial aid 
(e.g., Evangelauf 1987). Note that the legislative changes mentioned above 
were all passed after January 1981 and took effect for the 1981-82 aca- 
demic year. Thus, one advantage of using this sample is that not only did 
support decline, but also the decline was decidedly abrupt. Moreover, by 
using only the 1980, 1982, and 1986 waves, this analysis is constructed 
so that it is plausible to regard social background effects as indexing the 
relevance of parental characteristics for youths’ educational attainment. 
Further, these data include information on the curricular tracks students 
followed, as well as the grades they obtained. Taken together, these fea- 
tures are essential for comparing LCP and MMI. 

HS&B used a complex sample design. Thus, standard errors are ad- 
justed using a design effect of 2.4, which reflects a slight upward ad- 
justment from the design effect of 2.19 estimated by Sebring et al. (1987). 
Moreover, weights were constructed to account for nonresponse; I use 
weights in the analysis. 


Methods 


Appendix A describes the measurement of social background, measured 
achievement, and performance, and table 1 provides descriptive statistics 
for these variables. It is necessary to use at least one time-varying covariate 
in this analysis. The measures of performance—grades—are time-varying 
covariates obtained from students’ transcripts. HS&B also contains sev- 
eral other time-varying measures, including standardized measures of 
achievement in the form of tenth and twelfth grade tests, measures of 
student behavior such as cutting class, and measures of student experience 
with the disciplinary authorities of the school such as suspension. Each 
of these indicators, however, has some problems. First, students did not 
have access to their HS&B test scores, and thus neither their scores nor 
changes in the scores could have informed their decision making. More- 
over, some argue that achievement tests actually measure stable ability. 
To give this position the benefit of the doubt, I use only the tenth grade 
test scores and thus treat standardized measured achievement as constant 
over time. 

The indicators of class cutting, suspensions, and other involvement with 
disciplinary authorities are only partially time varying, because they were 
recorded during the two survey years (grades 10 and 12) but not in grade 
11. Thus, to include these variables as time-varying covariates is to specify 
different lag structures for different academic years, which might differ- 
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VARIABLE AND VALUE Grade 11 Grade 12 College 

Fathers education: 

< High school graduate .......... 218 207 202 

High school graduate . ......... 289 291 291 

Some college .........ooooomomoooo- 204 209 212 

College graduate or more 203 209 214 

Missing n.se. oro 087 084 .081 
Mother’s education: 

< High school graduate .......... .182 171 .162 

High school graduate ............ 409 414 419 

Some college .............. eeoa 220 223 226 

College graduate or more .135 .139 .141 

Missing 2... ienirt dense 055 054 052 
Father’s occupation: 

Mean. eo isc cis iii cece 38 968 39.317 39.518 

Dia 17 703 17.764 17.807 

Proportion missing ... . ......... 084 083 079 
Family earnings: 

Mean uoo n nepie nera 25,697.376 25,957.542 26,157.868 

A eee sts 21,262.278 21,655.499 21,961.580 

Proportion missing 081 061 047 
Farm background: 

VES eed E T 046 047 047 

Missing . .084 083 079 
Siblings: 

Mean .. 2.889 2.852 2.822 

SD add 1.631 1.610 1596 

Proportion missing ........... . .127 .131 .135 
“Broken” family. 

VOR TE ETET 385 372 360 

Missing AN 132 099 074 
A cc dee gesaseentnnes (eects 485 483 482 
Blache sa ecm vues aE Ra rece: seeds 097 095 095 
Latino/a ......o ooooooo . > ...oooo vo. .117 .114 .112 
Math 1 

Mean" is ae 15.105 15.267 15 406 

SDs. arrasa 5.269 5.286 5.292 

Proportion missing .......... ... 162 148 .140 
Math 2: 

Mean 4.519 4.560 4.597 

SDP o Yous oS cheats 1.901 1.925 1.937 

Proportion missing ... ....... ... .166 .153 .144 
Science 

MED gerin din nión 11335 11.415 11 486 

SD ist airis 3.289 3297 3.294 

Proportion missing . ....... ..... .167 .153 145 
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TABLE 1 (Continued) 


VARIABLE AND VALUE Grade 11 Grade 12 College 
Writing. 
Meat o E TE 10.819 10.910 10.998 
OD ia 3.465 3.460 3.437 
Proportion missing ............... 175 161 153 
Civics. 
Maii os cic ees ea bait tle Saree cs 6031 6.066 6.099 
SD! inst dioss 1.833 1.843 1.846 
Proportion missing ....... .... 181 167 158 
Reading: 
Mean sion 9457 9.538 9.619 
SD ii dio oras 3 543 1.925 3581 
Proportion missing ............... 160 146 139 
Vocabulary. 
Mean ee eee 11 333 11.420 11509 
MD isis dons 3 888 3.896 3.899 
Proportion missing .. 161 147 139 
No course last year . 145 310 470 
College prep last year ....... ... 529 385 270 
Last math grade: 
A weet’ 2.110 2.142 2.209 
SD ih E E A hi o(pistors 1.139 1.030 1112 
Proportion missing .............. 007 006 005 
Incomplete previous time 001 003 003 
Audited course previous time 001 000 .001 
Sample size o.a 10,292.000 9,709.000 9,268.000 


Not —Proportions of mutually exclusive catogones (e.g., father’s education) may not sum to 1 due 
to rounding 


entially bias coefficients for the different transitions. Further, analysts 
have yet to settle whether cutting class, involvement with disciplinary 
school authorities, and other correlates of delinquency are causes or con- 
sequences of track placement (e.g., Wiatrowski et al. 1982; Berends 1994). 
Indeterminance about causal direction problematizes the inclusion of these 
variables in this analysis. 

Students’ destinations are the dependent variable for the analysis. Be- 
cause changes have made tracking a subject-specific phenomenon, it is 
necessary to select a subject for study. The ideal subject to study would 
be central to the curriculum, on the one band, but would allow students 
some freedom to be allocated to any of the logically possible categories 
(including not taking a course in the subject), on the other. Mathematics 
and English are the two central subjects in American high schools. How- 
ever, in comparison to English, math allows more freedom of placement, 
because it is not required every year in most schools. For the HS&B 
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cohort, the modal requirement for English was four years, while the modal 
requirement for math was two. Thus, mathematics is the subject of study 
in this analysis. For this reason also, I use mathematics grades, because 
these grades are probably most salient for math course-taking. As test 
scores in other subjects are controlled, ability in other domains is allowed 
to influence mathematics course-taking. 

I have analyzed students’ movement from grade 10 to grade 11, from 
grade 11 to grade 12, and from grade 12 to college. In moving to grade 
11 and grade 12, students have the possibility of moving to any one of 
four destinations. In each transition, a student may drop out (coded 0), 
in which case the student is not considered for later transitions (which is 
the common treatment of dropouts in educational transitions analyses).’ 

.Or they may remain in school and take a college preparatory mathematics 
course (coded 3), remain in school and take a noncollege preparatory math 
course (coded 2), or remain in school and take so math course (coded 1). 
Thus, the destination variable is an ordered categorical variable with four 
categories—drop out, no math, noncollege math, or college preparatory 
math. 

The ordered probit model: 


Py, = 0) = Pu, — BX), 
PO, = 1) = Bu, — BX) — Hu, — BX), 
Pty, = 2) = Yu, — B'X) — Pu, — BX), 


PY, = 3) = 1 — Hu, — BX), 

where ® signifies the normal cumulative distribution function (CDF), p’s 
represent thresholds, X represents a matrix of explanatory variables, and 
8 represents a vector of estimated parameters linking variables to the 
outcome, is appropriate for analyzing just such an ordered categorical 
variable. Essentially, the model posits that an interval-level latent variable 
with a standard normal distribution underlies the observed categorical 
variable. The thresholds carve the latent variable into the appropriate 
number of categories. 

The high school destinations have four categories, but the last desti- 
nation, college entry, has only two categories—either the student did not 
enter college (coded “0”) or the student did enter college (coded *1”). In 


* This is not the same as deleting dropouts from the overall analysis. Former tenth 


proach provides estimates of the correlates of dropping out. Were dropouts deleted 
from the analysis completely this would not be possible. 
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reality, colleges are stratified. Yet there is no consensus as to how one 
should measure the stratification of collegiate institutions. Thus, college 
entry is coded as a dichotomous variable. 

The ordered probit model in this analysis is estimated as 

PO, = 0) = Hu, — BX,), 

PG, = 1) = Bu, — BX,) — Pl, — PiX u) 

PO. = 2) = Pu, — BX.) — Hu, — B:X 0), 

PO. = 3) = 1— Yu, — PiX a), 
for ¢ individuals across ¢ transitions. Note that this means that the thresh- 
olds for all transitions are the same, but the effects of characteristics as 
well as some of the characteristics themselves are allowed to vary over 
time. Note that parameters for each outcome, including college entry, are 
estimated simultaneously, even though college entry has but two possi- 
bilities. This is addressed, however, by treating the college entry outcome 
as right-censored. College entry is censored because I know whether or 
not a student entered college, but if a student did enter college, I do not 
know where their institution falls in the distribution of collegiate insti- 
tutions. Thus, all that can be done for college entry is to distinguish 
between those who enter college and those who do not. 

Also note that each year students who continue in school reach a des- 
tination in the stratified curriculum. This destination is a particular track 
location and serves as the track origin for the students’ next transition. 
Thus, for each year, I constructed two dichotomous variables to capture 
the effects of track location on students’ likelihood of making the next 
transition—one variable scored 1 for students who did not take math and 
zero for others, and one variable scored 1 for students who enrolled in 
college preparatory math and zero for others. Thus, the omitted category 
is students who took noncollege prep math (because dropouts are not at 
risk of entering a grade above the grade they never entered). Note that 
this implies two types of time-varying covariates are included: (1) students’ 
performance and (2) students’ track location the previous year. 


Parameters 


Above, I described the debate concerning which parameters one should 
use to assess the relation between social background and school contin- 
uation. Because the fundamental dependent variable in the analysis is 
students’ probability of reaching particular destinations in the stratified 
educational system, I focus on how the predicted probability is associated 
with differences in social background. 
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Indeed, predicted probabilities, not regression-type coefficients, are re- 
quired in order to assess EMI, because effectively maintained inequality 
contends that the advantages that accrue to the socioeconomically well- 
placed are sufficient to secure for them desired outcomes. The dependent 
variable of this analysis is composed of a small set of discrete ordered 
locations; a student experiences a particular outcome if the student exceeds 
the threshold dividing that outcome from the one just below it. Given 
this image, social background becomes effective if it can move people 
over thresholds. 

Regression-type coefficients by themselves cannot reveal whether social 
background moves people over thresholds. Thus, a good question to ask 
is, What kind of evidence would allow us to assess whether empirical 
findings are consistent with a process of effectively maintained inequality? 
One kind of evidence would be whether the effect of social background 
is such that we would predict a different outcome for a person at the low 
end of the social background continuum than one at the high end. A 
common decision rule for predicting discrete outcomes is to predict the 
outcome that has the largest predicted probability. Thus, in order to eval- 
uate whether the effect of social background works to effectively maintain 
inequality, we need see whether its effect on the predicted probability is 
such that our best prediction will change simply on the basis of differences 
in social background. Surely, it would be possible to find some values of 
covariates for which this is true, but some protection from a search for 
such values is to use theoretically interesting values and investigate 
whether the prediction would change for such focal students. Thus, pre- 
dicted probabilities are the focus of the analysis. 

However, because MMI and LCP were devised with respect to logit 
coefficients (which differ from probit coefficients by a fairly constant mul- 
tiple), it will be useful to briefly consider the findings one would reach 
on the basis of ordered probit coefficients. By comparing the results ob- 
tained by the focus on coefficients to those obtained by considering dif- 
ferences in probabilities, it will be possible to determine whether the 
preference for one or the other explanation is consistent given two different 
ways of indexing effects of social background on school continuation. 


RESULTS 
MMI, LCP, and the Ordered Probit Coefficients 


Table 2 contains the ordered probit (OP) coefficients for a model predicting 
students’ educational transitions. The coefficient for broken family rises 
in absolute value then falls, while the coefficient for farm background 
rises in absolute value across the three transitions. Further, the coefficient 
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ORDERED PROBIT COEFFICIENTS AND ¿- VALUES FOR MODELS PREDICTING SCHOOL 
CONTINUATION AND TRACK LOCATION 


Father’s occupation 


Family earnings in $10,0008 . 


Grade 11 


.004 
(8.624) 

.021 
(6.016) 
—.015 


(-1.571) 
011 
(2.727) 
008 
(1.415) 
.011 
(2.825) 
002 
(645) 
—.065 
(—3.427) 


Grade 12 College 
424 —3 042 
(8.256) (—32 942) 
059 .160 
(5.581) (10.278) 
037 164 
(3.717) (9 938) 
-002 .003 
(3.516) (2.812) 
.005 .043 
(1.762) (8 104) 
—.070 .076 
(—1.566) (1.019) 
—.015 —.041 
(-3.196) (-4.814) 
—.149 —.099 
(-7.395) (3.677) 
131 055 
(7.514) (1.810) 
402 290 
(9.391) (4.350) 
141 ` 

(4.659) (1 318) 
013 .026 
(4.159) (5 937) 
025 .022 
(3.602) (2.024) 
— 002 .003 
(377) (434) 
.006 017 
(1.477) (2.507) 
014 056 
(2.140) (6.127) 
O15 012 

(3 539) (1.855) 
—.002 -031 
(— 456) (5.162) 
—.232 —.115 
(—10.620) (—3.390) 
604 343 
(30.716) (15.626) 
185 .138 
(25.047) (10.061) 
1.738 -000 
(— 017) (000) 
—.680 ALL 


Inequality 


for father’s occupation declines from grade 11 to grade 12 and rises there- 
after, but the coefficient on college entry does not exceed the coefficient 
for grade 11. These complexities occur, however, amid a more general 
story. OP coefficients for parents’ education, family earnings, and siblings 
are higher for college entry than for the other transitions. Indeed, OP 
coefficients for these four indicators of social background decline between 
grade 11 and grade 12, and rise for college. These last four indicators of 
social background suggest that an equalization is reversed; such a reversal 
provides more support for MMI than for LCP. 


MMI, LCP, and School Continuation Probabilities 


To broaden the assessment of the effects of social background on students’ 
school continuation, I graph the predicted probabilities of school contin- 
uation by earnings holding all remaining interval-level factors except pre- 
vious mathematics grade constant at their means; J hold previous math- 
ematics grade constant at a B to signify a capable but not outstanding 
performance. Ordinal- and nominal-level indicators are held constant such 
that the graph represents white females from intact nonfarm families who 
were taking noncollege prep mathematics the previous year. For such 
middling white female students of middling family backgrounds, the ef- 
fects of earnings are nearly linear throughout so that one may compare 
the effects of earnings on different outcomes by comparing the difference 
between the predicted probability for students from extremely poor fam- 
ilies and the predicted probability for students from wealthy families (see 
fig. 3). Owing to the near certain probability of school continuation for 
such eleventh and twelfth graders, the effects of earnings appear trivial, 
approximately one percentage point difference for eleventh grade, and 
smaller for twelfth grade. 

However, the difference between poverty and wealth is associated with 
a difference of approximately 20 percentage points in the likelihood of 
entering college, certainly a substantively nontrivial difference. This find- 
ing suggests that the effects of earnings are higher for college entry than 
for high school continuation, which is the same inference we drew above 
on the basis of the OP coefficients. And, as above, the evidence favors 
MMT over LCP. 

Table 3 presents differences between the predicted probability of reach- 
ing a particular outcome for the most disadvantaged and most advantaged 
students for each of five social background variables, holding all other 
variables constant as above. Because the underlying dependent variable 
in this analysis is the probability of reaching a particular track destination 
in a given year, the numbers that appear in the “diff” columns, which 
reflect the difference between the probability for the most advantaged 
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BP a o gaa ga, wor? gat? 


Famy Earnngs 


Fic 3 —Selected destinations and earnings 


and most disadvantaged students, can reveal the effects of social back- 
ground for particular parts of the educational transitions process. When 
one peruses the table, a common pattern is evident—the effect of social 
background is far larger for college entry than for high school 
continuation. 

For example, the difference in the probability of college entry for those 
with fathers who did not complete high school and those with fathers 
who completed college or more is 0.185 points. In contrast, the same social 
locations differ by only 0.013 points in the probability of continuing in 
school to grade 11 or grade 12. 

Thus, when we consider the pattern of effects of social background on 
school continuation probabilities, we find that social background matters 
more for college entry than for high school completion. This finding is 
consistent with MMI. 

One caveat, however, is that I have not determined whether the pattern 
of social background effects on the probabilities of college entry were 
different for older cohorts, and thus cannot say whether Reagan-era policy 
change lies behind my results. But it is apparent that the pattern of effects 
on school continuation probabilities are more consistent with MMI than 
with LCP, because LCP implies that social background effects on college 
entry will be lower than social background effects on high school grad- 
uation. I do not find this to be true when I consider probabilities of school 
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TABLE 3 
PREDICTED PROBABILITIES FOR SELECTED VARIABLES FOR DISADVANTAGED (Low) AND ADVANTAGED (HIGH) STUDENTS 


moonso 
Ant Maa 





Low 


Grade 11 


High Diff Low High Diff Low High Diff 








College Entry 





AF 

E 3 

MOETE : 
A(858888]|» 


Grade 11 
High 
981 
.986 
.985 
987 
985 
studen 





families, a mother/father who completed college or more education, a father at the highest value in the SEI scale, or no mblinga. 
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continuation, which mirrors the conclusion reached after perusal of the 
OP coefficients. Thus, the preference for MMI over LCP does not depend 
on whether OP coefficients or probabilities are considered. 


Effectively Maintained Inequality 


The key question to ask to assess effectively maintained inequality is 
whether one’s prediction of a student’s destination will change on the 
basis of social background. Consider figure 3 again, which illustrates how 
the probability of college entry differs as family income differs for mid- 
dling female students who are in the noncollege track, from an intact 
nonfarm family, and with mean values on other social background factors. 
The results in figure 3 show that the difference in the predicted probability 
of college entry is enough to move us from predicting that our focal student 
would xo! enter college to predicting that the student would enter college, 
because the predicted probability of college entry for the poor student is 
below 50%, while the analogous estimate for the wealthy student is above 
50%. 

Of course, estimating the probability of a given outcome by itself will 
not help in other cases, a fact well-illustrated by the bottom of figure 3. 
Figure 3 also shows predicted probabilities for college track location, 
conditional on prior track location and all other variables in the model, 
for grades 11 and 12. The difference between poverty and wealth is as- 
sociated with a difference of nine points in the probability of enrolling in 
college prep math in eleventh grade. This effect is less than half as large 
as the effect of earnings on college entry but substantially larger than the 
effect of earnings on school continuation. Similarly, results in table 3 show 
that the effects of social background on college prep entry approach and 
may sometimes even surpass those for college entry (e.g., the effect of 
broken family status on college prep entry exceeds the effect of broken 
family status on entering college itself). These results allow us to assess 
whether social background effects are higher for college prep assignment 
or college entry, but they do not provide the kind of evidence needed to 
evaluate effectively maintained inequality, primarily because in order to 
assess EMI, one must calculate probabilities for all discrete outcomes. 
When there are more than two discrete possibilities, one must determine 
which outcome has the largest predicted probability. Although none of 
these probabilities may exceed 50%, the largest still identifies our best 
categorical guess. 

Table 4 lists estimated probabilities of experiencing each of the four 
outcomes in grade 11 for our focal students when they have extreme values 
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TABLE 4 
PREDICTED PROBABILITIES FOR ELEVENTH GRADE DESTINATIONS FOR DISADVANTAGED 
AND ADVANTAGED STUDENTS 


Dropout No Course Non—College Prep College Prep 


Family earnings: 

Disadvantagec™ ... .022 343 336 299 

Advantaged ....... .012 264 334 391 
Father’s education: 

Disadvantaged" .. 027 371 333 269 

Advantaged ....... 014 283 336 368 
Mother’s education: 

Disadvantaged* .. 024 355 335 286 

Advantaged ....... 015 295 337 353 
Father's occupation: 

Disadvantaged" ... 023 353 335 288 

Advantaged ....... 013 279 336 372 
Broken family: 

Disadvantaged . .026 367 333 274 

Advantaged ..... .019 328 337 315 
Number of siblings: 

Disadvantaged" ... 025 365 334 276 

Advantaged ....... .015 294 337 353 
Socioecanomically: 

Disadvantaged’ ... .064 496 287 153 

Advantaged ....... 012 272 335 381 


* Disadvantaged students are females having, variously, ponintact familes, a motherffather who did 
not graduate from high school, a father at the lowest value in the SE scale, or with mx or more dbiinga. 
In contrast, advantaged students (high) are females having, vanously, intect fam{Hes, a mother/father 
who completed college or more education, a father at the highest value in the SEI scale, or no sibhngs 

* Disadvantaged students are females from families with both parents and srx children, a mother and 
father who did not graduate from high school, a father whose occupation is approxmately that of a 
forging machine operator, and whose family income ts apprommately $14,936 (in 1980 dollars). Advan- 
taged students are females from mtact families with two children, parents who have some college, a 
father whose occupation is approximately that of a general pubhc administrator, and whose family income 
ls approxumately $36,267 (in 1980 dollars). 


on each separate indicator of socioeconomic background taken in turn.'” 
Note that this means that a prediction of college prep track entry is also 
a prediction of upward mobility, while a prediction of noncollege prep 
course-taking is a prediction of stability. If one would predict the student 
does not take a course in the subject, one predicts downward track 
mobility. 

In every case, altering the students’ socioeconomic status would change 
the prediction for their destination. In most cases, the prediction change 
is so large that social background appears to send students in opposite 


2 When comparisons concern parents education, family background, earnings, or num- 
ber of siblings, they refer to persons from intact nonfarm families. 
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directions. For example, if a student’s father did not complete high school, 
then we would predict that the student would not take math in grade 11 
and thus would be downwardly mobile. In contrast, if a student’s father 
completed college, then we would predict that the student would take 
college prep math in grade 11, and thus would be upwardly mobile. This 
“diverging trajectories” pattern holds for all of the social background 
factors considered and is extreme for five of the six indicators. 

Note, however, that the predictions above assume that social back- 
ground factors are statistically independent of each other. The last two 
lines of table 4 relax this assumption, presenting predicted probabilities 
for focal students from intact families who are one standard deviation 
below (and above) the mean in father’s education, mother’s education, 
father’s occupation, and number of siblings, and one-half standard de- 
viation below (and above) mean family income." This comparison, which 
reflects some degree of association between these indicators of social back- 
ground, show that we would predict the disadvantaged student would 
take no course. Indeed, the predicted probability for the “no course” cat- 
egory for the disadvantaged student approaches 50%. Yet, our prediction 
for the socioeconomically advantaged student is again very different; we 
would expect this student to rise from noncollege prep math to enter the 
college preparatory track. 

The stark contrast between socioeconomically disadvantaged and ad- 
vantaged students is consistent with effectively maintained inequality and 
is a far cry from the suggestion that social background effects decline to 
zero when a level of education becomes universal. Instead, it suggests 
that when a level of education is universal, social background may matter 
for qualitative dimensions of education that are relevant at that level of 
education and that have implications for students’ chances of making 
later transitions. 


CONCLUDING REMARKS 


When students move from grade to grade, they also move from one strat- 
ified curriculum to another. The factors that determine whether they con- 
tinue schooling also determine where in the stratified curriculum that 
schooling will occur. Students’ location in the stratified curriculum has 


1! Given the values on the varlables we can flesh out the illustration. The disadvantaged 
student is a female from an intact family of six children, whose parents dropped out 
of high school, whose father’s occupation is a forging machine operator, with a family 
income of $14,936 (in 1980 dollars). The advantaged student has one sibling, parents 
who have some college, a father who is a general public administrator, and whose 
family mcome is $36,267 (in 1980 dollars). 
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implications for their likelihood of making additional transitions, and thus 
their location in the stratified curriculum is an integral part of the process 
of educational attainment. The observations above serve to connect two 
distinct literatures—research on educational transitions and analyses of 
track mobility. Their convergence addresses issues raised in a provocative 
critique of the educational transitions tradition by supporting the se- 
quential decision-making model, highlighting the theoretical importance 
of the time-varying covariates of track location and performance, and 
invalidating the suggestion that the sequential model of educational at- 
tainment requires a behavioral assumption of myopia. Thus, the foregoing 
analysis has investigated students’ movements through the stratified cur- 
riculum. Although I present and discuss ordered probit coefficients to 
connect the findings to previous work, the argument that findings may 
be sensitive to the functional form of the model led me to assess existing 
theories using the effects of social background on the probabilities of 
school continuation as well. Finally, I evaluated whether the role of social 
background on the probability of reaching particular destinations was 
consistent with effectively maintained inequality. Because the analysis 
produced evidence consistent with this alternate explanation, it appears 
that the convergence of the tracking and education transitions treatments 
is not only statistically useful but also sociologically intriguing. 

When we consider the life course explanation and the theory of max- 
imally maintained inequality, we find that the pattern of background 
effects is most consistent with the maximally maintained inequality ex- 
planation. Key social background factors rise in importance as students 
attempt to enter college in comparison to their effects on previous tran- 
sitions, just as MMI should predict for Reagan-era cohorts. Although few 
would deny that as children mature their dependence declines, and thus 
the life course perspective remains useful, the evidence suggests that de- 
clining dependence on parents does not explain the pattern of social back- 
ground effects on educational transitions. 

However, owing to the simultaneous treatment of school continuation 
and track placement, I found important social background effects oper- 
ating even before the college entry transition, and thus the findings con- 
tradict a key implicit tenet of MML While MMI suggests that background- 
related inequality will go to zero when a level of education is nearly 
universal, EMI states that at that very level social background will al- 
locate students to different types of education that have different impli- 
cations for educational attainment. The evidence presented here is most 
consistent with the EMI claim. 

The idea of effectively maintained inequality is not only that advan- 
taged social background is associated with increased chances of better 
placements, but also that the increases are consequential for theoretically 
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focal students. Given that school strata are discrete locations, effectively 
maintained inequality suggests that social background works efficaciously 
if we would alter our prediction of students’ destinations simply on the 
basis of differences in social background. In other words, if social back- 
ground can move an otherwise “average” student over a threshold, then 
social background effectively maintains inequality. When we considered 
the predictions we would make on the basis of the model results, we found 
that social background advantages consistently serve to “move” children 
(or our predictions for them) from disadvantageous discrete locations to 
advantageous ones. Thus, even though the increment for social back- 
ground effects may be small, we observed it to be effective. 

These findings have important implications for how to understand the 
provision of educational opportunity and the process of educational at- 
tainment. The findings lend credence to the postulation that even though 
high school completion is nearly universal, high school remains an im- 
portant site of competition in which social background matters. Maximally 
maintained inequality implied that when a level of education is universal, 
socioeconomically related contestation (i.e., class conflict) would center on 
higher levels of education. Other research contradicts this vision, pointing 
to intense class conflict around the maintenance of tracking (e.g., Wells 
and Serna 1996). The findings here might help us understand this class 
conflict. 

We find that in high school social background appears to matter for 
the kind of education received rather than for high school completion. 
Clearly, however, social background cannot allocate persons to qualita- 
tively better or worse positions if all positions are equal. Thus, class 
conflict may occur at the high school level as different actors attempt not 
only to obtain advantageous positions for their children, but also to secure 
or dismantle the stratified curriculum. 

Indeed, contrary to the MMI implication, universality of access may 
be largely irrelevant to the intensity of class conflict, as the focus of conflict 
may simply change once access is universal. Further, the results of this 
analysis, which show that social background continues to matter even in 
the presence of universal access, gives reason to speculate that the simple 
extension of universal access to institutions is unlikely to undo the effective 
power of social resources indexed by common indicators of social back- 
ground, at least in the United States, This speculation, if true, would 
greatly complicate efforts to ameliorate the inheritance of social disad- 
vantage, although it of course has no implications either way for the 
desirability of such efforts. 

One may speculate further as to whether the idea of effectively main- 
tained inequality has applicability to other educational situations or to 
even broader arenas. It is easy to suggest at least one other place one 
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might search for the operation of effectively maintained inequality. As 
one example, consider that the proportion of a cohort that enters preschool 
is increasing. It would be illuminating to ascertain whether the effect of 
social background on whether the child enters preschool declines as the 
proportion of the cohort entering preschool increases, while the effect of 
social background on the features of the child’s preschool was maintained, 
increased, or became consequential for otherwise average children. If so, 
this would be consistent with effectively maintained inequality. 

That said, there is ample additional research to do even on students’ 
high school experiences and educational attainment. Will we find similar 
patterns if additional subjects are studied? Would similar patterns be 

" observed for additional cohorts? Would other qualitative dimensions of 
education in the United States, and studies of other nation’s systems of 
education, reveal similar patterns of effects? Would models with more 
regressors, or different treatments for unmeasured heterogeneity, reveal 
similar patterns of effects? Thus, although the research presented here 
provides support for effectively maintained inequality, more work is 
needed to assess the robustness and generalizability of this claim even in 
the realm of social stratification and education. 

Even as we acknowledge the importance of the questions above and 
more, we can also imagine the mechanisms behind effectively maintained 
inequality working in many different arenas, areas as important as health 
care (e.g., the role of social background shifting from allocating scarce 
organs to persons to allocating leading surgeons to perform the surgery), 
as well as the relatively trivial (e.g., the role of social background shifting 
from allocating personal transportation to allocating the type of vehicle 
for personal transportation). However, at this juncture, it should suffice 
to say that when we investigate both school continuation and track mo- 
bility, we find consequential effects of social background in each year 
studied. This suggests that the effects of social background occur in at 
least two ways: (1) they determine who completes a level of education if 
completion of that level is not nearly universal, and (2) they determine 
the kind of education persons will receive within levels of education that 
are nearly universal. Either way, social background advantages seem to 
work to effectively and continuously secure for the children of advantage 
advantaged locations of their own. 


APPENDIX A 


Independent Variables 
All variables are recoded to the midpoint for missing cases, and a control 
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for missing on each category of variable is used (owing to high collinearity 
on the missing indicators within each category) 


Social Background 


Fathers occupation.—Students” responses to a 17 category question 
were recoded to the 1980 SEI score of the mean of the illustrative oc- 
cupations in the questionnaire using Stevens and Cho's (1985) updated 
occupational scores for the total labor force. Students? 1980 responses 
were taken unless the responses were missing, in which case, the 1982 
response was taken. Homemakers and military were coded as missing 
given that there is no SEI code for those pursuits. 

Mothers education and father’s education.—Students’ responses to a 
10-category question were recoded into the following ordered levels: 1 = 
less than high school graduate; 2 = high school graduate; 3 = some 
college only; and 4 = college degree or more. 

Farm background.—Mother or father was a farmer (1) or not (0). 

Male.—Students’ self-report of sex was used. 

Family income —Students were asked twice in the base year and twice 
in the follow-up to report family income. Base-year data are used unless 
missing; follow-up data are used if the base year is missing. Responses 
are coded to the midpoint of categories; the unbounded upper category 
is coded using the Pareto transformation for an unbounded category. 

Number of siblings—Students were asked to report on the number of 
siblings in 1980; as herein used, the codes are 0 = none, 1 = 1 sibling, 
and so on, up to 6 = 6 or more siblings. 

“Broken” family.—Students’ report of whether in 1980 or 1982 the 
student lived with both mother and father (0) or not (1). 


Race/Ethnicity 


The omitted category for two dichotomous race/ethnicity indicators is non- 
black, non-Latino/a (approximately 95% of the omitted category are 
white). Black indicates students’ self-report of black or not. Latimo/a in- 
dicates students’ self-report of Latin ancestry. 


Measured Achievement 


Tenth grade tests in vocabulary (range 0-21), reading (0-19), math 1 
(0-28), math 2 (0-10), science (0-20), writing (0-17), and civics (0-10) 
capture measured achievement. 


1682 


Inequality 


Academic Performance 


Last performance grade.—Grade in last math class taken, ranges from 
0.0 (F) to 4.3 (A+). 

Audit —Student audited last math course taken. 

Incomplete—Student given an incomplete in the last math course 
taken. 


APPENDIX B 


Each year students encounter an opportunity to either continue their 
schooling or elect to stop. Thus, educational attainment is the cumulation 
of a sequence of decisions. For an individual student, then: 


where i refers to the individual student, t refers to transitions, and s refers 
to the sum of the transitions. Educational transitions analyses do not 
model Y, but, instead, typically model several Y,’s using logistic regression 
techniques. The full set of transitions is implied below: 


xs) 
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¿bm 
Prob (y, = 1|ys = 1)= (1 + et) + €. 


The key portion of the Cameron and Heckman critique can be demon- 
strated as follows; consider the logistic regression equations below: 











K 

log ( 2) = ZA bes LO BY 
=P 1 
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tog, ( 7 ) = DBuXatea 6 LO 13), (B2) 
f? x 

log (22) = DeX tes e LOF), (B3) 
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or, in general: 
p x 
log, (~ > = Y Bu Xu + En E 7 L(0,x7/3), 

= kml 


where p„ refers to the probability of person i making transition t, 6, is 
the association between variable X, and the probability of making tran- 
sition t, and e, is a person-specific, transition-specific error term. 

Consider equations B1 through B3 from the point of view of a single 
student; suppress until further notice the assumption for the distribution 
of the error term. Assume, for illustration purposes, that the student com- 
pletes the first two transitions but not the third. Thus, for this student, 
there is clearly variation in the dependent variable.” 

The question Cameron and Heckman ask is what, according to equa- 
tions B1-B3, induces the variation? The ĝ’s and e’s are unknowns esti- 
mated from the data, and for any given student, the value of X, does not 
vary across transitions. In this situation, an infinite number of estimates 
for the §’s satisfy the constraints imposed by the data. Therefore, the 
equations are not identified. 

To obtain identification, researchers assume a distribution for the error 
term. This identifies the logit model. However, recall the single student. 
The assumption for the error term essentially implies a value for the error 
term for this student for each transition. This error term can, in principle, 
vary across transitions for the individual student. What this implies, how- 


2 Of course, the probability of making transitions varles for any student across tran- 
sitions, because the (un)conditional population proportion making transitions may vary, 
and the best estimate of a student’s probability of completing a transition is the 
(un)conditional population proportion making the transition. However, this illustration 
may be facilitated, and certainly is not hindered, by assuming variation in the student's 
decisions across transitions. 
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ever, is that the only right-hand side element that varies independently 
across equations is the error term. Thus, when the X, do not vary for a 
person across transitions, the assumption for e identifies all estimated 
differences between the §’s. Cameron and Heckman correctly note that 
changing the assumption for the error term may change the findings. 

This problem has an obvious solution: introduce a time-varying cov- 
ariate. The foregoing analysis followed just this strategy. Requirements 
that the time-varying covariate must meet are not stringent. Consider 
equations B4—B6: 


Py 
1 7. Pu 





K 
tog,( ) < 2 ByX y + YZ, ter €,~L(0,x7/3), (B4) 


x 
log, (- $ ) = D BuXat Ze tez €, ~ L(0,x?/3), (BS) 
T Pa 1 





K 
log, (- an ) = > BaX a t YZa tes €, ~ L(0,07/3), (B6) 
13 1 

where Z,, Z,, and Z, are time-specific realizations of Z. To simplify the 
presentation, assume they are centered on their means. Because the Z 
variables vary across transitions, two factors potentially create variation 
in a student's decisions across transitions: Z and e. Now, assume, the 
following relationships: 


Za = AZ, +ô, 


Zy = ,Z,, + 4,, 


where ô, and 4, are errors from regression equations with the standard 
assumptions of regression. 5, and 6, imply that the regression equation 
does not perfectly predict the dependent variable. The relationships above 
transform equations B4—Bé6 as follows: 











x 
10, (- = ) = 2 BuX a + nL, + €. €; E L(0,x7/3), 
sl as 
t x 
log, ( PA) = D bukat nO Zata) +e, e~ LON), 
Si a 
t x 
log, (. Sn ) = D BuXey + VAA Za + 8, +28.) +6, €, ~ L(0,x/3). 
E 


Rearranging terms produces the following three equations: 
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x 
log, ( 7 ) = 5 ByE y + NL, tEn E L(0,x7/3), (B7) 
~ a1 1 





x 
log, (- =F ) 2 buku + NAL, + Vð, + En 
12 


e, ~ L(0,1*/3), (B8) 


Pa 
1-9, 





x 
log, ( ) = 2 Bu Xu HYAA Za + 13 (8, + 28,) + € 


e, ~ L(0,x7/3), (B9) 


Note that for identifying the 6’s in the sequence of logistic regression 
equations (equations B7—B9), what we require is a variable that varies 
across transitions independently. Even if there is a relationship between 
the time-varying covariate over time, the 6s—the portion of Z, that is 
independent of Z,_, (where u is a positive integer)—satisfy that require- 
ment. This implies that as long as one can find a single time-varying 
covariate whose cross-time correlation is not perfect, one can identify 
differences between the 6’s by including that variable as a time-varying 
covariate. 

Nothing is changed by assuming that some of the variables in X also 
determine Z,, for unless the X variables in concert with the lagged values 
of Z perfectly determine Z,, there is still some independent variation in 
Z, It is that independent variation—not to say measurement error in 
Z —that provides the leverage needed to identify the differences in the 
Bs across transitions. 

Once the differences in the fs are identified in this manner, Cameron 
and Heckman’s claim that the education transitions model assumes my- 
opia no longer holds. This claim depends upon the error term being the 
only right-hand side determinant that varies independently across tran- 
sitions. However, because time-varying covariates also vary at least some- 
what independently across transitions, introducing time-varying covar- 
iates makes the claim about myopia inaccurate. 

The nonidentification of the differences in the fs, and the description 
of why this is so, is the major contribution of the Cameron and Heckman 
article. However, introducing time-varying covariates solves that problem. 

In the foregoing analysis, two types of time-varying variables were used: 
(1) measures of student performance and (2) previous track location. Ev- 
idence certainly supports the contention that track location is not perfectly 
correlated over time (e.g., Lucas 1999). For theoretical reasons, the fore- 
going analysis highlighted the role of tracking, but surely there are likely 
other time-varying covariates that can serve to identify the standard lo- 
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gistic regression education transitions model. This still leaves the issue of 
unobserved heterogeneity, about which many analysts have written. An- 
alysts may continue to work on this problem, because the identification 
issues Cameron and Heckman raise can be resolved by making theoret- 
ically defensible assumptions concerning time-varying covariates. Hence, 
given appropriate data, it is still too early to reject the education tran- 
sitions logistic regression approach. 
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Gender and the Career Choice Process: The 
Role of Biased Self-Assessments' 


Shelley J. Correll 
University of Wisconsin, Madison 


This article develops a supply-side mechanism about how cultural 
beliefs about gender differentially influence the early career-relevant 
decisions of men and women. Cultural beliefs about gender are 
argued to bias individuals’ perceptions of their competence at var- 
ious career-relevant tasks, controlling for actual ability. To the extent 
that individuals then act on gender-differentiated perceptions when 
making career decisions, cultural beliefs about gender channel men 
and women in substantially different career directions. The hy- 
potheses are evaluated by considering how gendered beliefs about 
mathematics impact individuals’ assessments of their own mathe- 
matical competence, which, in turn, leads to gender differences in 
decisions to persist on a path toward a career in science, math, or 
engineering. 


Women and men hold different kinds of jobs, as abundant evidence shows 
(for reviews, see Reskin 1993; Jacobs 1995a; Jacobsen 1994). While ex- 
planations of the persistence of sex segregation in paid work remain in- 
complete, the consequences for gender inequality are clear. The differential 
occupational distribution of men and women explains the majority of the 
gender gap in wages (Peterson and Morgan 1995; Treiman and Hartman 
1981). Most attempts by sociologists to explain the persistence of sex 
segregation in the labor force document the importance of demand-side 
processes, such as statistical discrimination, internal labor markets, and 
the gendering of job queues (for reviews of this research, see Reskin and 
Roos 1990; England 1992). Far less attention has been given to supply- 
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side processes by which males and females differentially move into various 
activities associated with different kinds of work (Peterson and Morgan 
1995). However, supply-side processes are important because the supply 
networks from which employers recruit are already segregated by gender 
(Granovetter and Tilly 1988). Further, sex segregation often emerges early 
in the path toward many careers. For example, Jacobs (19955) finds that 
one-third of all women would have to change college majors to be dis- 
tributed in the same manner as their male counterparts. Since males and 
females appear to be voluntarily making career-relevant decisions that 
will carry them, on average, in substantially different occupational di- 
rections, it is important to examine these early stages in the supply-side 
process and ask why men and women make the choices they do. 

In this article, I develop and test a simple supply-side mechanism to 
illustrate how cultural conceptions of gender serve to constrain the early 
career-relevant choices of men and women. I argue that widely shared 
cultural beliefs about gender and task competence bias actors’ perceptions 
of their competence at various skills. Focusing on perceptions of com- 
petence is crucial for understanding modern stratification systems since 
the presumption of competence legitimates inequality in achievement- 
oriented societies such as the United States. When competence at a certain 
skill is thought to be necessary for a particular career, then gender dif- 
ferences in the perceptions of task competence, over and above actual 
ability, foster gender differences in commitment to paths leading to that 
career. 

As a specific location of this process, I examine how gender differences 
in the perception of mathematical competence influence high school and 
college students’ educational decisions that lead to careers in engineering, 
math, and the physical sciences. As these professions have been especially 
impervious to the entrance of women (Hanson 1996), they provide a con- 
venient window from which to examine the process by which cultural 
beliefs about gender differentially influence early career decisions of men 
and women. Further, since the “quantitative professions” are among the 
more rewarding financially (Frehill 1997; Babco 1988; Dossey et al. 1988), 
gender differences in the movement into them has consequences for the 
continued gender gap in wages. Certainly, discrimination and other struc- 
tural constraints continue to limit the occupational opportunities available 
to women. However, a fuller understanding of the persistence of sex seg- 
regation in the labor force can be gleaned by also examining the seemingly 
voluntary processes by which men and women make career-relevant 
choices. 
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GENDER AND CAREER CHOICE PROCESSES 


The career choice process occurs throughout the life cycle as individuals 
make a series of decisions that have occupational consequences. Sociol- 
ogists who examine the processes by which individuals choose careers 
have focused primarily on later stages when individuals actually choose 
to enter jobs rather than on the decisions to move into activities at earlier 
stages on the paths leading to specific careers. However, as noted above, 
gender differences in the selection of activities that constrain occupational 
choices often occur earlier in the life cycle. This is especially evident in 
the case of professions like engineering, where a college degree in the field 
is necessary to pursue a career. Due to the sequence of required classes, 
the decision to pursue a degree in engineering or the physical sciences 
must usually be made during the first or second year of college (Seymour 
and Hewitt 1997). Further, those who fail to take advanced-level math 
classes in high school are highly unlikely to select college majors in science, 
math, or engineering (MclIlwee and Robinson 1992). Since gender differ- 
ences in the selection of activities relevant to careers in these fields emerge 
as early as high school, it is important to examine decisions made at this 
stage in the life cycle.’ 


Gender and the Path to Math 


The ratio of females to males declines as young people move further down 
the path toward the quantitative professions (Mcllwee and Robinson 
1992, Catsambis 1994). By high school, males are more likely than females 
to be enrolled in advanced-level math and science elective classes (AAUW 
1992; National Science Board 1993; National Science Foundation 1994). 
Of the bachelor’s degrees earned in 1990, 31.2% of physical science degrees 
and 13.8% of engineering degrees were awarded to women (Jacobs 19950). 
In the United States workforce in 1993, only 8% of all engineers and 9% 
of all physicists were female (National Science Foundation 1996). Thus, 
in contrast with the vast movement of women into other professions, such 
as law and medicine, engineering and the physical sciences remain ex- 
tremely male dominated. 

In considering the process by which males and females differentially 
move into activities relevant to careers in engineering and the physical 
sciences, it is important to establish what is #ot causing this gender dif- 
ference. We need to keep in mind that, unlike other systems of difference 


1 Gender differences in career aspirations may, of course, emerge even earlier in the 
life cycle. However, gender differences in occupationally relevant behavior, such as 
course enrollment decisions, are minor prior to high school when students begin to 
choose elective courses 
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such as race and class, males and females grow up primarily in mixed- 
sex families and attend similar kinds of high schools. Since most young 
people attend coeducational high schools and high schools tend to have 
very balanced sex ratios, gender differences in career choice are not pri- 
marily due to differences in the type of high school attended by males 
and females. Further, gender differences in the entry into the quantitative 
professions are not due to differences in family structure or socioeconomic 
status since males and females are distributed roughly equally across these 
groups. Finally, a gender difference in the choice of a quantitative college 
major is not the result of a higher rate of transition from high school to 
college by males, since females are slightly more likely than males to attend 
college (National Center for Education Statistics 1998). In sum, compared 
to differences between students of different ethnic groups or social classes, 
there is considerable similarity in the structural location and resources 
available to male and female youth. What is puzzling is that a gender 
gap emerges early in the path toward careers in the quantitative profes- 
sions in spite of this structural similarity.’ 

While many and varied explanations have been offered for the contin- 
ued dearth of women in engineering and the physical sciences, most ex- 
planations implicate a linkage to mathematics (for a thorough review, see 
Oakes 1990). Mathematics has been described as the “critical filter” on 
the path to careers in math, science, and engineering (Sells 1973; Dossey 
et al. 1988). But, how does this filter serve to remove women dispropor- 
tionately from the path to the quantitative professions? 


Gender and Mathematical Aptitude 


One explanation for the shortage of women in the quantitative professions 
is that males have a biological aptitude for math that females lack (Peng 
and Jaffe 1979; Rudisill and Morrison 1989; Benbow and Stanley 1980, 
1983; Kolata 1980). However, cross-national studies have found wide var- 
iation in both the direction and magnitude of mathematical gender dif- 


> Given the vast differences in social class, type of school attended, and other factors 
between members of different ethnic groups that have existed historically and continue 
to the present time, the argument I make here has less to say about the reproduction 
of inequality by race or ethnicity This is not to suggest that cultural beliefs about race 
do not exist or do not influence career decisions, but rather that the differences in 
resources available to members of different ethnic or racial groups probably overwhelm 
the impact of cultural beliefs about race in reproducing inequality. Further, whereas 
males and females mostly grow up together in the same families, members of different 
race or ethnic groups are much more likely to grow up in different families. Families, 
for racial or ethnic minorities, can provide insulation or refuge from the dominant 
white culture. For example, some have suggested that African-American families teach 
their children ways of resisting hegemonic beliefs (see, e.g., Portes and Wilson 1976). 
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ferences, casting serious doubt on biological superiority theories (Baker 
and Jones 1993; Finn 1980; Harnisch 1984). Further, a meta-analysis of 
over 100 studies demonstrates that gender differences in mathematical 
performances are small, have declined over time, and vary in direction 
depending on the mathematical domain (e.g., computation, understanding 
of mathematical concepts, etc.; Hyde, Fennema, and Lamon 1990). Since 
analyses of gender differences in math aptitude are often conducted using 
large national surveys or populations of college freshmen, even differences 
that are statistically significant are often very small in magnitude (see, 
e.g., Hyde et al. 1990).* Thus, gender differences in actual mathematical 
competence do not seem to be responsible for the large differences in the 
numbers of men and women choosing to enter fields requiring some level 
of mathematical competence. Instead, I argue that cultural beliefs about 
gender and mathematics differentially influence the movement of males 
and females along educational and career paths leading to careers in 
science, math, and engineering. In the next section, J draw upon current 
understandings of gender as a multilevel system to develop this argument. 


CULTURAL BELIEFS AND BIASED SELF-ASSESSMENTS 
Gender and Cultural Beliefs 


Sociologists have increasingly realized that gender is a multilevel system 
that consists not only of roles and identities at the individual level, but 
also includes ways of behaving in relation to one another at the inter- 
actional level, and cultural beliefs and distributions of resources at the 
macrolevel (Ridgeway 1997; Ferree, Lorber, and Hess 1999; Risman 1998). 
The multilevel nature of this system allows processes that contribute to 
the reproduction of gender inequality at the macro, micro, and interac- 
tional levels to occur simultaneously. In this way, the gender system is 
overdetermined and represents a powerfully conservative system. While 
I focus primarily on the role of macrolevel cultural beliefs in perpetuating 
gender differences in the early career-relevant decisions young people 
make, processes at the interactional and individual levels undoubtedly 
also contribute to the outcomes described. 

Cultural beliefs about gender (hereafter called “gender beliefs”) are the 


* Using data from the current study, males have significantly higher math test scores 
than females, although the differences are small in magnitude, less 0.1 of a standard 
deviation Males have math scores of 51.7 versus 51 3 for females for the sample used 
in models 1-3 (see table 1, models 1, 2, and 3 ). Females, by contrast, have significantly 
higher math grades. Further, these small gender differences do not account for the 
large gender differences in decisions relevant to careers in science, math, and engi- 
neering, as will be shown in models 4 and 5, presented later in the article. 
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component of gender stereotypes that contain specific expectations for 
competence. It is this component, with its specific expectations of com- 
petence, that presents special problems for gender equality (Ridgeway and 
Correll 2000). Gender beliefs are also cultural schemas for interpreting or 
making sense of the social world. As such, they represent what we think 
“most people” believe or accept as true about the categories of “men” and 
“women.” In North America, at least, men are widely thought to be more 
competent than women, except when performing “feminine” tasks (Con- 
way, Pizzamiglio, and Mount 1996; Wagner and Berger 1997; Williams 
and Best 1990). As we will see below, substantial evidence indicates that 
mathematical tasks are often stereotyped as “masculine” tasks. Even in- 
dividuals who do not personally believe that men are more competent 
than women are likely aware that this belief exists in the culture and 
expect that others will treat them according to it. This expectation has 
been shown to modify behavior and bias judgments, as will be described 
below (Foschi 1996; Steele 1997). I next review the literature that estab- 
lishes the nature of the gender beliefs associated with mathematics. I then 
describe how these gender beliefs bias judgments of mathematical com- 
petency and, consequently, influence career-relevant choices. 


Gender Beliefs about Mathematics 


Many studies have shown that students view math as masculine and 
perceive mathematics to be a male domain (Meece et al. 1982; Fennema 
and Sherman 1977, 1978; Hyde et al. 1990; Armstrong 1981; Whyte 1986). 
Likewise, most students believe math and science to be more useful and 
important for boys and better understood by them (Eccles et al. 1984). A 
recent ethnographic study of over 300 male and female students who were 
enrolled in an engineering or science major or had switched out of one 
paints a detailed picture of the gendered culture of math and science 
(Seymour and Hewitt 1997). Many of the women in this study said they 
had difficulty “giving themselves permission” to major in science, math, 
and engineering, even though they could not explain precisely what had 
discouraged them (p. 241). They described a dampening effect of a cultural 
message that suggests that women either could not or should not do math 
and science. 

Collectively, the studies cited above demonstrate that widely shared 
cultural beliefs do include claims that males are more competent than 
females at mathematics. While empirical support for actual gender dif- 
ferences in mathematical competence is weak (Baker and Jones 1993; 
Finn 1980; Harnisch 1984; Hyde et al. 1990), the belief of male mathe- 
matical superiority itself is widely dispersed in American culture. Expo- 
sure to news reports that claim that males have greater natural mathe- 
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matical ability has been found to increase mothers’ stereotypic perceptions 
of their daughters’ mathematical abilities Jacobs and Eccles 1985). Re- 
search also suggests that parents convey different expectations of math- 
ematical success to their male and female children (Frome and Eccles 
1998). Likewise, male and female teachers at all grade levels routinely 
have lower expectations in math for females than for males (AAUW 1992; 
National Science Foundation 1994). Thus, individuals are exposed to gen- 
der beliefs associated with mathematics from various sources (teachers, 
parents, counselors, published results of standardized test scores by gen- 
der), and likely become aware that “most people” believe that males, as 
a group, are better at math. 

Some individuals probably also come to personally believe that males 
are better at math, although girls have been shown to be less likely than 
boys to hold stereotypic views about mathematics (Hyde et al. 1990). If 
an individual girl believes that boys are better at math, she might view 
mathematical competence as inconsistent with a female gender identity, 
doubt her mathematical ability, and decrease her interest in careers re- 
quiring high levels of mathematical proficiency. In this way, personally 
holding gender stereotypic views in regard to mathematics would be suf- 
ficient to produce gender differences in perceptions of mathematical com- 
petence and commitment to careers requiring mathematical proficiency. 
However, personally holding a stereotypic belief is not necessary for the 
argument I make. Instead, it is only necessary that individuals perceive 
that others hold these gendered beliefs with respect to mathematics, a less 
stringent assumption. In the next section, I explain why this less stringent 
assumption is sufficient for the argument I make and describe how cultural 
beliefs about gender and mathematics differentially influence the early 
career decisions of males and females. 


The Impact of Gender Beliefs on Judgments and Behaviors 


Gender beliefs can operate in different ways simultaneously to contribute 
to the reproduction of gender inequality. It is clear that children learn 
and internalize gender beliefs and that this internalization affects behav- 
ior. However, there is some variation in what is internalized. With respect 
to mathematics, one possibility is that an individual comes to personally 
believe that boys are better at math than girls. Holding stereotypic beliefs 
about activities, such as mathematics, has been shown to influence the 
attitudes and career aspirations of young people (Eccles et al. 1999). The 
other possibility is that an individual internalizes the belief that “most 
people” believe boys are more competent than girls at mathematics. 
Ridgeway (1997) argues that when gender beliefs are salient they shape 
behavior most powerfully by affecting people’s sense of what others expect 
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of them. When males are widely thought to be more competent at a task 
than females, both males and females in a situation unconsciously expect 
more competent task performances from men (Berger et al. 1977). This 
differential performance expectation has been shown to invoke the use 
of a more lenient standard for evaluating the performances of men in the 
situation compared to women (Foschi 1989). The use of a more lenient 
standard to judge male performances causes males to be perceived as 
having more task ability than females, even when males and females 
perform at the same objective level (Foschi et al. 1994; Foschi 1996). 
Thus, when a female enters a situation having internalized the belief that 
“most people” expect more competent performances from men, even if 
she does not personally endorse this stereotypic belief, she may still leave 
the situation with a lower assessment of her ability compared to a male 
performing at the same level, due to the biasing effect of others 
expectations. 

Recent research by Steele (1997) and Lovaglia et al. (1998) further 
suggest that when individuals know others expect people of their social 
category (e.g., women, African-Americans) to do relatively poorly on a 
task, this knowledge creates anxiety and actually leads to poorer per- 
formances. Steele and colleagues (Steele 1997; Spencer, Steele, and Quinn 
1999) experimentally manipulated the relevance of a gender belief asso- 
ciated with a task. When subjects were told that males performed better 
at the task, male subjects outperformed female subjects. However, when 
subjects were told that previous research had found no gender differences 
in performing the task, females and males did equally well. Even if sub- 
jects did not personally believe that males were better at the task, their 
awareness that others held this belief heightened their anxiety and had 
an impact on their performance. This leads to the conclusion that re- 
gardless of whether gender beliefs are personally endorsed or internalized 
as other people’s expectations, they often lead to biased self-assessments 
of ability. I now turn to describing how gender beliefs about mathematics 
bias perceptions of task competence and, thereby, influence career-relevant 
decisions. 


The Constraining Effect of Gender Beliefs 


My general argument is that widely shared cultural beliefs about gender 
and task competence differentially bias how individual males and females 
evaluate their own competence at career-relevant tasks. This bias may be 
the result of the internalization of a cultural belief about gender and 
mathematics into one’s gender identity, or it may be the result of the 
expectation of others causing males and females to invoke the use of 
different standards for evaluating their own mathematical success, or 
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FIG. 1.—The impact of biased self-assessments on career-relevant decisions 


both. The predicted outcome, however, is the same: males will overesti- 
mate and females will underestimate their own mathematical ability. If, 
for a given level of achievement, females are less likely than males to 
perceive that they are good at a task, they should be more likely to reduce 
their efforts and interests in activities requiring competence at the task, 
and therefore they should also be less likely to persist on a career path 
requiring task competence. 

It is important to note that gender beliefs are not rigid scripts that 
individuals are compelled to follow. Indeed, when individuals assess their 
own competence at a given task, their assessments should depend more 
on performance information (such as grades or test scores) than on cultural 
beliefs about gender differences in task competence. However, cultural 
beliefs provide a context of meaning that modifies or biases the more 
situationally relevant foreground information, such as the evaluations of 
task competence by others (Ridgeway 1997; West and Zimmerman 1987). 
At the individual level, this biasing effect allows individuals considerable 
variability in the perceptions of their task competence. But, at the ag- 
gregate level, it should be sufficient to produce systematic gender differ- 
ences in perceived task competence. 

Figure 1 provides a general sketch of my argument about the impact 
of these processes on persistence on educational and career paths. To test 
this model, I propose three hypotheses, which are described in more detail 
below. 


Gender Beliefs and Biased Seff-Assessments 


In order for a person to continue on a path toward a given career, I 
assume that she or he must adopt a personal conception of herself or 
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himself as competent at the tasks believed to be necessary for that career 
path. That is, while many factors certainly influence individual career- 
relevant decisions and preferences, as a minimum, one must feel com- 
petent at the skills or tasks necessary for a given career in order to be 
commit oneself to pursuing that career. I refer to this personal conception 
of task competence as a “self-assessment.” As depicted in figure 1, positive 
performance feedback in regard to a given task by legitimate others (such 
as teachers or supervisors) should increase one’s personal self-assessment 
of task competence. However, cultural beliefs about gender and task com- 
petence are argued to provide a framing context (Ridgeway 1997; West 
and Zimmerman 1987) that biases other information individuals use in 
assessing their own competence. 

In the case of mathematics, when males receive positive feedback about 
their ability, they should evaluate themselves as skilled at mathematics 
since a positive evaluation is consistent with both the feedback they re- 
ceived and with societal expectations about their mathematical compe- 
tency. Conversely, females who receive positive feedback about their 
mathematical ability should be less likely to perceive that they are skilled 
at mathematics since this perception is incongruent with widely shared 
beliefs about gender and mathematics. Research has shown that individ- 
uals are more likely to attend to and retain information that confirms 
stereotypes and to ignore information that contradicts expectations (Ham- 
ilton 1981). Further, Foschi (1996; Foschi et al. 1994) has demonstrated 
that when a cultural belief about a task advantages males (i.e., when 
males are widely thought to be more competent at the task), both males 
and females unconsciously use a more lenient standard to evaluate male 
performances compared to female performances. The use of a more lenient 
standard ensures that even when they are performing at identical ability 
levels, males are judged as being more competent or having more task 
ability than females. 

Since widely shared cultural beliefs include claims that males are more 
competent than females at mathematics, I expect that males are more 
likely than females performing at the same level to perceive that they are 
good at mathematics. 

HYPOTHESIS 1.—Males' assessments of their own mathematical com- 
petence is higher than females’ assessments, controlling for performance 
feedback about mathematical ability. 

To test the idea that widely shared cultural beliefs about male math- 
ematical superiority leads to gender differences in mathematical self- 
assessments, it is useful to compare male and female self-assessments for 
another set of tasks for which cultural beliefs do not advantage males. 
Verbal tasks are used for this purpose. While comparing male and female 
perceptions of task competence across a wide variety of tasks would pro- 
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vide a better test of this idea, the available data provide these measures 
only for math and verbal tasks. Previous research is unclear on whether 
cultural beliefs about verbal tasks advantage females or are gender neu- 
tral, but they do not advantage males. Therefore, if male-advantaging 
cultural beliefs lead to higher self-assessments of mathematical task com- 
petence by males, in the absence of male-advantaging beliefs, gender 
differences in verbal self-assessments will either be smaller, nonexistent, 
or slightly favor females. Thus, I expect males not to make higher self- 
assessments of their verbal competence than females when both perform 
at the same level. This distinction is important because if males globally 
assess their competence higher than females at all tasks, regardless of the 
task’s gender association, then gender differences in self-assessments could 
not predict gender differences in persistence on any particular path. 


Performance Feedback and Self-Assessments 


For both males and females, I expect more positive performance feedback 
about task ability to lead to higher self-assessments of task competence. 
However, 1 expect gender beliefs to influence the extent to which per- 
formance feedback affects self-assessments. Males who assess themselves 
as competent at mathematics can base their assessment, at least partially, 
on societal expectations, rendering feedback about their competence less 
important to them than to females, who must base their assessments on 
something other than societal expectations. Societal expectations of their 
competence provide males with a sense of confidence in their mathe- 
matical ability that makes performance information, either supportive or 
contradictory to this expectation, less relevant to their self-assessments. 
In other words, knowing others expect you to do well at a task provides 
an insulating layer from the constant input of performance feedback com- 
mon in school and work environments. Conversely, self-assessments will 
be more contingent on performance feedback in a situation where societal 
expectations of task competence are lacking. Therefore, I expect that: 

HYPOTHESIS 2.—The effect of performance feedback about mathematical 
competence on mathematical self-assessment is larger for females than for 
males. 


Biased Self-Assessments and Career-Relevant Decisions 


As previously stated, both males and females must adopt a personal con- 
ception of themselves as competent at the tasks believed necessary for a 
specific career if they are to continue on a path leading toward that career. 
Therefore, while performance information about task competence un- 
doubtedly has an impact on decisions to move forward on a path leading 
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to a given career (path “C” of fig. 1), self-assessments of task competence 
will have an effect on career-relevant decisions above and beyond the 
effects of external indicators of ability (path “D” of fig. 1).5 Since math- 
ematical competence is assumed to be necessary for persisting on a quan- 
titative career path, I expect that for both males and females: 

HYPOTHESIS 3.—Higher self-assessments of mathematical competence 
increase the odds of persisting on a path toward a career in a quantitative 
profession. 

However, if males are more likely than their female counterparts of 
equal mathematical ability to believe they are competent at mathematics 
(hypothesis 1), and if more positive self-assessments increase the likelihood 
of continuing on the quantitative career path (hypothesis 3), then gender 
differences in self-assessments of mathematical competence should par- 
tially account for the disproportionately high numbers of males in the 
quantitative professions. 


DATA 


The data for this study are from the National Educational Longitudinal 
Study of 1988 (NELS-88). In 1988, a multistage probability sample of 
approximately 25,000 eighth grade students, their parents, teachers, and 
school administrators, from over 1,000 schools was surveyed. A subsample 
of the students from the base year was again surveyed in 1990, 1992, and 
1994, when most were sophomores, seniors, and two years beyond high 
school, respectively. The students were also given tests in mathematics, 
reading, social studies, and science, which were developed by the Edu- 
cational Testing Service. The data set contains over 6,000 variables on 
each student (see Ingels et al. [1992] for more information on this data 
set). 

While approximately 25,000 students were surveyed in the base year 
of the NELS-88 study, the subsequent waves were a subsample of the 
base year and were restricted to those students who continued to be 
enrolled in the same high school and who were not enrolled in some 
alternative certification program. Thus, the sample size available at each 
wave decreases. I utilize three different longitudinal subsamples to test 
the above hypotheses. The self-assessment hypotheses (hypotheses 1 and 
2) are tested using the eighth to tenth grade longitudinal subsample (sam- 
ple size = 17,424). The hypothesis about the effect of biased self-assess- 


* Some might wonder why external evidence would have an effect at all if it is not 
internalized into a self-assessment. However, noninternalired evidence could have an 
effect in that it can affect how gatekeepers, such as teachers, channel students along 
career paths. 
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ments on career-relevant decisions is tested on either the eighth to twelfth 
grade subsample (sample size = 16,489) or on the eighth grade to two 
years past high school subsample, depending on whether the decision 
being made occurs during the senior year of high school or in college. In 
the latter case, I examine choice of college major, making it necessary to 
restrict analysis to those students who attended college (sample size = 
8,724). The cases are weighted using the appropriate NELS longitudinal 
sampling weights,’ and the analyses are conducted on cases for which 
there is no missing data on variables included in the models.’ I further 
restrict my attention to those students who identified themselves as Asian/ 
Pacific Islander, Hispanic, African-American, or white, approximately 
99% of the sample. 


MODELS AND MEASUREMENTS 


Five models are used to test the three hypotheses. The dependent variable 
in model 1 is mathematical self-assessment, a latent variable described 
below. The dependent variable in model 2 is verbal self-assessment, also 
a latent variable. While model 1 tests whether males assess their math- 
ematical competence higher than females, the verbal self-assessment 
model, model 2, evaluates whether males globally assess their competence 
higher than females do, regardless of a task’s gender association. Taken 
together, models 1 and 2 provide a test for the idea that widely shared 
cultural beliefs about gender and task competence have an impact on the 
perceptions males and females make of their competence at various tasks 
(hypothesis 1). Model 3 tests the second hypothesis that the effect of per- 
formance feedback about mathematical competence on mathematical self- 
assessments is greater for females than for males. To evaluate this hy- 
pothesis, separate models are estimated for males and females to test for 


“A Heckman probit model is used to evaluate the bias introduce by this selection of 
cases (Van de Ven and Van Pragg 1981). 

"A different longitudinal sample weight was used for each longitudinal subsample: 
FIPNLWT, F2PLNLWT, and F3PNLWT for the eighth to tenth grade, eighth to 
twelfth grade, and eighth to two years after high school subsamples, respectively. 

* Another possible way of handling the decreasing sample size is to select those students 
who attended college in the 1994 wave of the data set and use them for all analyses. 
This approach has the advantage of producing a consistent sample across each model. 
However, by selecting all of the available cases available at each wave, I retain a 
larger sample sire and prevent the potential bias of selecting only students who went 
to college, arguably a more academically oriented sample of students. For the sake of 
comparison, all models were also run on the smaller sample of only those students 
who were in college two years beyond high school. Results from models estimated on 
this alternative sample were virtually identical to those presented here. 
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the interaction effect between gender and performance feedback.’ In all 
other models, gender is included as a male dummy variable in order to 
assess the magnitude and direction of the gender effect on self-assessment 
and the career choice process. Models 4 and 5 evaluate the effect of biased 
self-assessments on career-relevant decisions at two different time points 
(hypothesis 3). All cases were weighted prior to estimating these models. "° 
In the next section, 1 describe how each of the concepts mentioned in the 
hypotheses is operationalized and measured and how the models are 
estimated. 


Dependent Variables 


The latent variable “mathematical self-assessment” measures the extent 
to which students believe they are skilled at math. Likewise, the latent 
variable “verbal self-assessment” measures the extent to which students 
think they are verbally skilled. Three items serve as indicators of math- 
ematical self assessment: “Mathematics is one of my best subjects,” “I 
have always done well in Math,” and “I get good marks in Math.” Like- 
wise, three items serve as indicators of verbal self-assessment: “I learn 
things quickly in English,” “I get good marks in English,” and “English 
is one of my best subjects.” Students were asked to agree or disagree on 
a six-point scale to these prompts during their sophomore year of high 
school, the only year in which these items were included in the survey. 
Confirmatory factor analysis provided support for a two-factor model, 
measuring distinct math and verbal self-assessment concepts. The math 


’ When models are estimated separately for males and females, the error variances are 
allowed to vary between the two samples. The other possible way of modeling inter- 
active gender differences is to include one or more interactive terms (the product of 
gender with the varlable of interest) in a model that includes a gender dummy variable 
and is, therefore, estimated for both males and females. A model estimated with in- 
teractive terms constrains the error variances to be equal between the male and female 
samples While both modeling schemes produce identical coefficient estimates, tests of 
significance could vary due to the difference in the assumption regarding the error 
variances. For comparison, a model with a gender and evidence interaction term was 
estimated and no substantive differences in tests of significance were found. The models 
presented here, with male and females modeled separately, have the advantage of 
allowing for easier comparison of the magnitude of the hypothesized gender difference 
in the effect of evidence on self-assessments. 


* Some have argued that sampling weights are not necessary in multivariate analysis 
if the weight is not a function of the dependent variable, and that weighting in mul- 
tivariate analysis, at least with the OLS estimator, actually produces inefficient esti- 
mates (Winship and Radbill 1994). All models present in this article were also estimated 
without weights, and the results are highly similar. I also followed the procedure 
advocated by DuMouchel and Duncan (1983) for assessing whether estimates from 
weighted and unweighted models are significantly different and found no significant 
differences. Results are available by request. 
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items had standardized factor loadings of 0.89, 0.88, and 0.87, respectively. 
The verbal items had standardized factor loadings of 0.78, 0.85, and 0.82, 
respectively. When the factor loadings were constrained to be equal in 
the male and female samples, the chi-square statistics increased insignif- 
icantly by 6.63 with 4 degrees of freedom, indicating that the mathematical 
and verbal self-assessment concepts have the same meaning for males 
and females. 

Models with mathematical or verbal self-assessment as the dependent 
variable (models 1, 2, and 3) were estimated in a structural equation 
framework using maximum-likelihood estimation. Models 1 and 2 were 
estimated simultaneously, and the residuals for the math and verbal as- 
sessment latent variables were allowed to correlate.'* One advantage of 
structural equation modeling is that equality constraints can be placed 
on regression coefficients for different groups, such as males and females, 
to assess whether statistically significant differences in the size of the effect 
of an independent variable on a dependent variable exists between the 
groups. Importantly, the effect of performance feedback, in the form of 
math grades, on mathematical self-assessment can be constrained to be 
equal in the male and female samples to examine whether these constraints 
statistically improve the fit of the model, allowing for a test of hypothesis 
2. The AMOS and MPLUS statistical packages were used to estimate 
these models (Arbuckle 1997; Muthen and Muthen 1998).” 

Path persistence, the dependent variable in hypothesis 3, refers to 
whether or not students move into activities that require a certain level 
of mathematical competence and are relevant to careers in the quantitative 
professions. Path persistence is measured at two time points on the ed- 
ucational path leading to careers in science, math, and engineering. First, 
a calculus enrollment variable measures whether or not a student enrolled 
in calculus by her or his senior year of high school. This information was 
taken from students’ transcripts. Second, a quantitative major variable 
indexes whether or not those students who enrolled in a postsecondary 
educational institution selected a quantitative major in college. Students 
were asked in an open-ended question to list their college major. The 
National Center for Educational Statistics (NCES) coded responses into 
112 detailed categories. I created a dichotomous measure from this var- 
iable where all engineering majors, chemistry, physics, other physical sci- 


" Allowing the residuals for the two latent variables to be correlated improved the fit 
af the model (7 = 298; df = 1) 

2 These models were also estimated using OLS and robust variance estimators, and 
no substantive differences in estimates were found. That is, all three methods generated 
coefficients that were of similar magnitude, and the same coefficients were found to 
be significant by each method of estimation. Results are available on request. 
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ences, computer programming, statistics, and mathematics were coded as 
quantitative and all other majors were coded as nonquantitative. 

As the path persistence variables are dichotomous, logistic regression 
is used to estimate these models (models 4 and 5). Thus, the models 
estimate the effect of the independent variables, described below, on the 
likelihood of a student enrolling in high school calculus or choosing a 
quantitative major.” 


Independent Variables 


The argument that gender affects how students assess their task com- 
petence hinges on comparing females and males who are otherwise equal 
in relevant ways. It is especially important to control for other factors 
commonly associated with differences in educational attainment. For this 
reason, all models control for race and parental education. Gender is added 
as an independent explanatory dummy variable in models 1, 2, 4, and 5. 
Females constitute the reference category for the gender dummy variable. 
Gender and race are taken from students’ responses on the survey in- 
strument. Race is recoded as a series of dummy variables representing 
African-Americans, Hispanics, and Asians, with whites serving as the 
reference category. Parental education was recoded into years of education 
from a categorical variable where one parent or guardian was asked to 
describe the highest level of education she or he attained. If applicable, 
the parent answered the same question for her or his spouse or partner. 
Responses were averaged if data were available for two parents or 
guardians.“ 

“Tracking” is the process whereby students are separated by ability into 


“ It might seem more elegant to analyze the entire set of models in a structural equation 
framework, with the mathematical and verbal self-assessment concepts serving as 
intervening variables. However, with dichotomous dependent variables, the assump- 
tion of normally distributed disturbances is untenable. Muthen (1984) proposed the 
CVM estimator that can be used to analyze categorical dependent variables in struc- 
tural equation modeling. However, this estimator makes the strong assumption that 
the categorical variable reflects the individuals standing on an underlying, normally 
distributed latent variable. This is not theoretically reasonable in this study. That is, 
I conceive of the choice to pursue a quantitative or nonquantitative major as inherently 
categorical. (See West, Finch, and Curran [1995] for more on the CVM estimator.) 

Y The NELS-88 data set also provides a composite index of parent’s education, 
F2PARED, constructed from the parent education variable collected when the student 
was in the twelfth grade. To avoid causal time ordering problems, I opted to construct 
my own parent's education variable using data collected from parents when the student 
was in the eighth grade; Le., I did not want to have a variable from the twelfth 
predicting the self-assessment outcomes collected at the tenth grade. As parents’ ed- 
ucation levels are highly stable over the student's high school years, models run using 
F2PARED in place of my parent’s education variable produced identical estimates. 
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different classes to be taught the same school subject (e.g., honors-level 
algebra I, academic-level algebra I, regular algebra D.'* Tracking has been 
shown to both reflect and reproduce class and racial inequalities in the 
United States (e.g., see Oakes 1985). Additionally, and relevant to the 
current study, it is likely that when students make self-assessments of their 
mathematical or verbal competence, they compare themselves to others 
in their own mathematics or English classes. In this way, academic tracks 
represent ability reference groups within which students compare them- 
selves when making self-assessments. This suggests that grades should 
have the same effect on self-assessments of competence within various 
levels of academic track placement. For example, a student in a lower 
track math class earning a B might assess his mathematical competence 
as high as a student earning a B in a higher track math class does, even 
though ability was supposedly used to determine track placement. Ac- 
cording to the data used in the current study, females are slightly more 
likely to be in tracks associated with higher academic ability (see table 
1). Consequently, females also are more likely to assess their competence 
in comparison to members of a higher-ability reference group. Gender 
differences in track placement could, therefore, contribute to gender dif- 
ferences in self-assessments of task competence. 

A series of dummy variables was constructed to control for the track 
of high school math and English classes in which students were enrolled. 
A dummy variable representing the honors track and one representing 
the academic track were created for math and English classes using 
teacher descriptions.'* The general high school track serves as the refer- 
ence category. While both honors and academic classes are considered 


1S I use the term “track” to refer to classes students are required to take. The decision 
to place students in one track or another is usually made by counselors and teachers. 
Input from students is rare (Oakes 1985). I control for track of required classes when 
predicting the decision to enroll in calculus, a nonrequired, or elective, class. While 
being in an honors- or academictevel math track has been shown to increase the odds 
of choosing to enroll in calculus (Seymour and Hewitt 1997), most students, even those 
in higher math tracks, do not chose to take calculus. Thus, the decision to enroll in 
calculus, while influenced by track, represents a choice that students make and one 
that has occupational consequences. 

“The teacher data provides two items about the level of each student’s classes. The 
first is a measure of either the student's science or math class. I used this variable to 
create the math-level dummy variables, even though in about half the cases I had only 
the student’s science level. In a similar fashion, the English-level variable was created 
from an item from elther the student's English or social studies class. In collecting the 
variables this way, the National Center for Educational Statistics relies on the high 
correlation between level of math and science (or English and social studies) taken. I, 
too, assume that in the absence of knowledge of math or English track, science and 
social studies serve as adequate proxies. 
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higher ability levels than the general high school math or English class, 
honors is generally thought to be the highest math or English track. 

In comparing the perceptions students make of their ability to perform 
a task, it is important to control for actual task ability. Task ability (in 
this case, math and verbal ability) was measured by averaging the eighth 
and tenth grade scores on the math or verbal standardized tests admin- 
istered by the Educational Testing Service as part of the NELS-88 survey. 
Test scores are on a 100-point scale. I use the average of eighth and tenth 
grade to provide a more stable measure of ability over time.” Test scores 
from the eighth and tenth grades of high school represent the more recent 
scores available that do not create time-ordering problems in the models 
estimated. Not surprisingly, there is a high correlation between eighth 
and tenth grade test scores (0.88 for the mathematics tests and 0.81 for 
the verbal tests). While many have argued that these types of tests are 
culturally biased to the advantage of white, middle-class males, to the 
extent that this bias exists, it means that females and minorities with 
scores equal to that of white males would, if the bias were removed, 
actually have more ability than their white male counterparts. Thus, any 
cultural bias would make it harder to find statistically significant differ- 
ences in self-assessments due to gender or race. 

Of primary importance to this study is the effect of performance feed- 
back on students’ self-assessments of task competency. Mathematical 
and verbal performance feedback is measured by using the average math 
and English grades students received in high school. The NCES created 
these average grade variables by converting grades taken from students’ 
transcripts to a continuous 13-point scale (with “1” representing the 
highest grade). This conversion allows grades to be comparable across 
the various schools attended by students in the NELS study." I con- 
verted these values to the commonly used 4.0 grade point average scale 
(*4.0” being the highest “A” grade) to aid in interpretability. Therefore, 
a one-unit change in the grade variable represents a grade change of 
approximately one letter grade. While grading scales or requirements 
may differ from one school to another, this variation is not problematic 
in this study since grades are used as an indicator of the feedback 
provided to students about how others (their teachers) assess their math- 
ematical (or verbal) competence. In other words, it is less important 


1 The longitudinal effects of eighth grade test scores on tenth grade test scores and 
on the other independent variables were also modeled in the structural equation model. 
The results were highly similar to the models presented in the article using the average 
of eighth and tenth grade test scores. Further, I estimated the effects of the eighth 
grade scores on the dependent variables in models 4 and 5; these effects were found 
to be largely mediated through tenth grade scores 

1 Students were drawn from over 1,000 different schools 
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whether or not grades actually measure ability or competence, but rather 
that they are available to students and accepted by them as a legitimate 
external evaluation of their competence.'” 

English grades and test scores are included in the math self-assessment 
models, and conversely, math grades and scores are included in the model 
with the verbal self-assessment dependent variable. This is done to make 
all models as comparable as possible. Further, it is likely that students 
make relative comparisons of the performance feedback they receive in 
various classes when assessing their ability and making career-relevant 
decisions. If students did not make relative comparisons, then the only 
performance information that would influence their mathematical self- 
assessments would be their mathematical grades and scores. However, 
others have shown that students do make these comparisons, classifying 
academic subjects, college majors, and even students within college majors 
as math/nonmath, hard/soft, “techy”Ffuzzy” (Seymour and Hewitt 1997; 
Montell 1992). If students do compare their math and verbal feedback, 
and especially if they perceive competency in these two areas to be in 
tension with each other, then as students receive increasingly higher verbal 
feedback they might see themselves as less skilled at mathematics, as less 
of a “techy.” Specifically, higher English grades would dampen the math- 
ematical self-assessments students would otherwise make if they instead 
based their mathematical self-assessments solely on their mathematical 
performance.” Therefore, it is important to control for English perform- 
ance in the mathematical self-assessment models. 


1* One reviewer asked whether the grade and self-assessment items actually represent 
distinct concepts. Confirmatory factor analysis indicates that a model where the three 
math self-assessment items predict the latent variable “mathematical self-assessment” 
and the math grade item predicts the latent variable “math grades” produces a better 
fit than a model where the math grade variable is allowed to be a fourth measured 
variable predicting the latent variable “mathematical self-assessment.” This is also true 
for the verbal self-assessment items and English grade variable. The correlations be- 
tween the individual math self-assessment items and the math grade variable range 
from 0 40—0.47, and the correlation between the individual verbal self-assessment items 
and the English grade variable range from 0.35-0.47. 

* One possible explanation for this relationship is that with higher English grades 
students are less likely to believe that their math grades are the result of possessing 
specific akill or competency in mathematics For example, students who receive high 
grades in both English and math might perceive that they are generically good at 
school, rather than spectfically skilled at math or English. Or, consider the scenario 
where two students receive equal grades in mathematics, both receive higher math 
grades than English grades, but one student earns better grades in English than the 
other student does. While both students earned better grades in math than they did 
in English, the student with lower English grades is presented with more of a contrast 
between the feedback she or he received in these two areas. She or he, therefore, might 
make higher assessments of her or his own math competence, even though both had 
equal math grades. 
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Finally, the self-assessment items are added as independent variables 
in models 4 and 5 to test the path persistence hypothesis (hypothesis 3). 
That is, I argue that if self-assessment of mathematical competence differs 
by gender, this differential perception should be at least partially respon- 
sible for the gender differences in the movement of males and females 
into activities relevant to the quantitative professions. To test this idea, 
mathematical self-assessment is allowed to predict persistence on the 
mathematical career path. By adding the mathematical assessment var- 
iable as an independent variable in the path persistence models, I expect 
that the magnitude of the gender effect on path persistence will decrease. 
The verbal self-assessment variable is also added to these models since 
students likely use relative assessments of their competence when making 
career-relevant decisions. 


RESULTS AND DISCUSSION 


Table 1 provides descriptive statistics for the males and females in the 
sample for whom there is complete data for each stage of the analysis.”! 
The cases were weighted using the appropriate longitudinal probability 
weights provided in the NELS-88 data set. Not surprisingly, there is a 
large gap between the number of males and females who elected a quan- 
titative college major. Compared with only 4% of females, 12% of males 
have majors in engineering, mathematics, or the physical sciences.” A 
smaller gender gap was found in calculus enrollment, where 10% of fe- 
males and 11% of males had enrolled in calculus by their senior year of 
high school (see the model 4 column of table 1). The means of the math- 
ematical assessment items do suggest that males are more likely than 
females to believe they are competent in math. This pattern emerges even 
though math grades and math test scores are very similar for males and 
females. Males do not appear to globally assess their competence higher, 


22 Missing data is less than 10% for cach variable, with the exception of the calculus 
enrollment variable, which was missing for approximately 13% of the cases. This 
variable was taken from students’ transcripts, and there is no reason to believe that 
there should be a pattern to the type of student on which this information was available 
or missing. To check for biases due to missing data, mathematical assessment models 
were run on the entire sample, including those who had missing data on some of the 
variables used in the analysis, using full information maximum-likelihood estimation 
(see Anderson [1957] for a discussion of ML estimation in the presence of missing 
data) No substantive differences in the relative magnitude or significance of estimates 
were found. 

2 Due to the large sample size, virtually all differences in means between males and 
females are significant. For this reason, I discuss the magnitude of the differences in 
this section and reserve discussion of significance for the multivariate models to follow. 
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as the means for the English assessment items are higher for females than 
for males. Finally, gender differences in the parental education, ethnicity, 
and math and English class-level variables appear minor, as would be 
expected. I now turn to the models designed to bring evidence to bear on 
the hypotheses offered. 


Gender Beliefs and Biased Self-Assessments 


The first hypothesis is that males assess their mathematical competence 
higher than females who perform at the same ability level and who receive 
the same feedback about their mathematical competence. Model 1 pro- 
vides a test of this hypothesis (see table 2). First, and not surprisingly, 
higher math grades and test scores increase the level of mathematical self- 
assessment. Thus, those with more mathematical ability, as measured by 
test scores, are more likely to believe that they are skilled at mathematics. 
Further, the more positive the feedback from legitimate others, in the 
form of math grades, the higher the level of mathematical self-assessment. 
This is not surprising since evidence of task competence should be highly 
salient as students make personal assessments of their skill at a given 
task.” However, controlling for these differences in ability and perform- 
ance feedback, males were found to assess their task competence 0.25 
points higher than their female counterparts. Consistent with hypothesis 
1, males are more likely to perceive that they are good at math than are 
those females with equal math grades and test scores. 

Since the argument presented suggests that widely shared cultural be- 
Hefs about gender and task competence bias males’ and females’ self- 
assessments of task competence, it is useful to evaluate the same model 
for a set of tasks that have different gendered beliefs associated with 
them. Model 2 is identical to model 1, except the dependent variable is 
verbal self-assessment. As with the mathematical self-assessment model, 
differences in ability and performance feedback are controlled. Note that 
the parameter estimate for the male dummy variable is significant, 
but negative, meaning that females were found to make higher self- 
assessments of their verbal ability. This indicates that males do not glob- 
ally assess their task competence higher than females, regardless of the 
gender association of the task. Instead, cultural beliefs associated with a 
particular task or field of study bias students” perceptions of their abilities 
in that field. 


2 One reviewer asked how much of the variation in mathematical self-assessment 
was explained by the grade variable alone. The r-square for the mathematical self- 
assessment model with math grade as the only independent variable is 0.245, compared 
with 0.346 for the complete model presented in table 2. 
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TABLE 2 
MAXIMUM-LIKELIHOOD ESTIMATES OF THE EFFECTS OF GENDER AND OTHER 
VARIABLES IN A STRUCTURAL EQUATION MODEL OF MATHEMATICAL AND VERBAL 


SELF-ASSESSMENT* 
MODEL 1* MODEL 2* 
MATE SELF- VERBAL SELF- 
ASSESSMENT" ASSESSMENT” 

INDEPENDENT VARIABLES Coefficient SE Coefficient SE 
Male ......... 245* 025 — 119% 023 
Math grades o... 0.02. 861* .019 —.325* 017 
Math test scores 076* 002 —.001 002 
English grades —.393* 020 754% 019 
Verbal test scores... ..... — 039% 002 .026* 002 
Class level ” 

Math honors . 184* 042 175* 038 

Math academic... ......... 056* .030 .141* 027 

Enghsh honors. .. .... —.096* 038 .060* 035 

English academic... .... 001 .029 — 044" 026 
Ethnicity“ 

AMAN oo. xsl eee cages ts enis .116* 064 158" 058 

African-American. ..... 586* 39 440% 036 

Hispanic . BL? ts O77* 043 286* 039 
Parent's education years) .. —.037* .006 .002 005 


SOURCE —NELS-88 

Norr —Fit statistics x’ = 1,858 (60 df; p1 (RFD = 949, IFI (Bolen delta square) = 983, CFI = 
982, RMSEA = 047; R? = 346 for math assessment end .281 for verbal assessment. 

* Estimates are based on 6,877 females and 6,624 males 

* Math and English class level variables are a series of (0-1) variables, with “the general high school 
track” servmg as the reference category 

* Ethmaty us measured as a series of (0-1) vanables, with “white” serving as the reference category. 

* Models 1 and 2 were estimated mmultancously and the residuals for the two assessment variables 
were allowed to correlate 

* Mathemabcal and verbal self-assessment variables are latent variables constructed from 3 math and 
3 verbal self-assessment items The math items heve factor loadings of 89, 88, and 87 The verbal rtems 
have loadings of 82, .78, and 85 See text for the exact wordmg of the items 

* P< 0S, one-tailed test. 


While it is beyond the scope of this article to attempt to extend the 
argument presented to understanding racial or ethnic differences in career 
choice, it is worth discussing what may appear to be surprising results. 
Controlling for track of math and English class, grades, test scores, and 
parents’ education, Asians, African-Americans, and Hispanics were found 
to assess both their math and verbal ability higher than whites. The 
estimate for the coefficient for the African-American dummy variable is 
especially notable for its magnitude. Others have shown that African- 
American students often have more positive attitudes toward education 
than whites (Portes and Wilson 1976; Catsambis 1994) and have higher 
self-concepts and higher self-esteem (Coleman 1966; Rosenberg and Si- 
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mons 1971). Portes and Wilson (1976) have argued that positive attitudes 
towards self and school represent a real advantage to blacks in facilitating 

. educational success. Espousing positive attitudes may be a strategy that 
African-American children learn from their families as a partial way of 
countering the impact of discrimination. The higher self-assessment values 
for African-Americans found in this study, then, appear consistent with 
earlier research. 

The puzzle is how are African-Americans able to maintain high self- 
assessments in the face of dominant cultural beliefs that undoubtedly 
disadvantage them? In other words, why does the cultural belief approach 
advocated in the current study appear to hold for gender but not for race? 
It is again worth noting that students from different racial backgrounds 
grow up mostly in racially homogenous families, compared to the mixed- 
gender families in which most male and female youth are raised. Further, 
whereas males and females largely attend school together, members of 
different racial or ethnic groups are often segregated from each other into 
different kinds of schools. Growing up in mostly same-race families allows 
for the family to serve as a site of resistance to dominant cultural beliefs. 
Families can teach children alternative beliefs to counter hegemonic be- 
liefs. African-Americans, in particular, have been highly successful in de- 
legitimating hegemonic beliefs (see, e.g., MacLeod [1987] 1995). Being 
isolated in racially segregated schools further reduces the amount of daily 
exposure students have to hegemonic beliefs compared to what they would 
confront in more racially integrated schools. In contrast, alternative gen- 
der belief systems, such as the present day “girl power” phenomenon, are 
pitted daily against hegemonic gender beliefs. Thus, even individuals who 
are exposed to alternative gender beliefs in their families likely have daily 
exposure to hegemonic gender beliefs. 


Performance Feedback and Self-Assessments 


The results presented thus far show that males assess their mathematical 
competence higher than females of equal mathematical ability. But, as 
stated in hypothesis 2, since cultural beliefs about mathematics advantage 
males, performance feedback about their task competence should be less 
important to them in making self-assessments of their mathematical com- 
petence. Conversely, performance feedback, in the form of math grades, 
should have a larger impact on females’ self-assessments of their math- 
ematical ability, since they must contend with lower societal expectations 
of their mathematical competency. To test this hypothesis, I estimate the 
effects of the independent variables on mathematical self-assessment sep- 
arately for males and females. 

Comparing the male and female regression coefficients for the math 
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grade variable (see model 3, table 3), the larger coefficient for females, 
0.97 versus 0.74 for males, suggests that females do rely more on per- 
formance feedback about their task competency in making self-assess- 
ments. By constraining the math grade coefficients to be equal in the male 
and female samples and by comparing the fit of this model with the fit 
of the model where this constraint is relaxed, the gender difference in the 
Strength of the effect of math grades on self-assessment was found to 
be significant” As hypothesized, feedback about their mathematical 
competence has a significantly larger effect on the mathematical self- 
assessments of females compared to males. 

Model 3 also shows that higher English grades and test scores actually 
lead to lower levels of mathematical self-assessment for both males and 
females. Recall that the mathematical self-assessment models control for 
the feedback students receive about their verbal competence. The logic 
behind this control is that it is likely that when students assess their 
mathematical competence, they make relative comparisons of the feed- 
back they receive in various school subjects. As mentioned earlier, some 
have suggested that students perceive competency in math and verbal 
areas to be in tension with one another (Seymour and Hewitt 1997; Montell 
1992). The negative regression coefficients for the English grade variable 
in the mathematical self-assessment models, and the negative coefficient 
for the math grade variable in the verbal self-assessment model (model 
2), support the idea that students use performance information in relative 
ways when assessing their specific task competence. 

As with math grades, the effect of English grades on mathematical self- 
assessments is significantly larger for females. (However, the effect is more 
negative for English grades and was more positive for math grades). That 
is, while higher English grades lead to lower self-assessments of their 
mathematical ability for all students, controlling for math grades and test 
scores, the negative effect is larger for females. While the gender difference 
in the effect of math grades on mathematical self-assessment was hy- 
pothesized, the significant gender difference in the effect of English grades 
was not. One explanation for this unexpected finding is that when indi- 
viduals are in a situation where societal expectations of their task com- 
petence are lacking, they pay heightened attention to all performance 
feedback in the general environment, in this case the environment of 
school. Therefore, the process of making relative comparisons of various 
types of feedback when assessing one’s own competence at a specific task, 


H In SEM modeling, regression coefficients can be constrained to be equal, and a nested 
model contrast can be made between the constrained and unconstrained model. In 
this case, the significant chi-square of 31.6 with one degree of freedom suggests that 
the more constrained model produces a significantly worse fit. 
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TABLE 3 
MAXIMUM-LIKELIHOOD ESTIMATES OF THE EFFECTS OF MATH GRADES AND OTHER 
VARIABLES IN A STRUCTURAL EQUATION MODEL OF MATHEMATICAL SELF-ASSESSMENT- 
A COMPARISON OF GENDER DIFFERENCES IN THE REGRESSION COEFFICIENTS* 


MODEL 3: MATHEMATICAL SELF-ASSESSMENT 
INDEPENDENT VARIABLES Females Coefficient SE Males Coefficient SE 


Math grades .................065 967" 027 739" 026 
Math test scores ................. .078* 003 .072* .003 
English grades ....oooocconocccooo —.490* 029 —.282* 028 
Verbal test scores ................ —.040* 03 —.036* 003 
Class level:* 
Math honors .............0000e .193* 059 .178* 059 
Math academic ... ........... 016 042 .087* .041 
Engtish honors... ........ - 129" 052 —.059 055 
English academic ............. —.024 .041 032 .039 
Ethnicity? 
AA ios ra a .107 .090 .128 .089 
African-American ... . ....... .623* .056 532* 055 
Hispanic . oo... cocccconccnono 033 060 142" 061 
Parent’s education (in years) . —.030* 008 —044* 008 


Source.—NELS-88. 

Norr —Fit statistics: x7 = 505 (48 df); pl (RFD = .971; IFI (Bollen delta square) = 994, RMSEA 
= 038, R’ = .361 for females and 315 for males. 

* Estimates are based on 6,877 females and 6,624 males. 

> Math and English class level variables are a somes of dummy variables, with “the general high school 
track” serving as the reference category 

* Ethnicity is measured as a senes of dummy variables, with “white” serving as the reference category. 

* P< 05, one-tailed test. 


a process which occurs for all students, is amplified under conditions 
where societal performance expectations are lacking. Widely shared cul- 
tural beliefs about gender and mathematics create this type of environ- 
ment for female students.” 


Biased Self-Assessments and Career-Relevant Decisions 


Gender differences in self-assessments of competence at mathematics be- 
come important if they can be linked to gender differences in early career- 
relevant decisions, in this case, the decision to persist on the path leading 


* As further support of this idea, verbal self-assessment models were also estimated 

for males and females, and no gender differences were found in the strength 
of the effects of math or English grades on verbal self-assessments. Recall verbal tasks 
are not clearly associated with gender differences in societal expectations. While higher 
English grades lead to higher verbal self-assessments and higher math grades lead to 
lower verbal self-assessments, indicating again that students use performance feedback 
in relative ways, no gender difference was found in the strength of the effect of either 
type of grade on verbal self assessment. Results are available on request. 
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to careers in the quantitative professions. In model 4a, I examine the 
impact of gender on enrolling in high school calculus; and in model 4b, 
I add the self-assessment variables to assess their impact on the gender 
regression coefficient (see table 4). In model 4a, the coefficient for the male 
dummy variable is significant and positive. Converting the coefficient into 
an odds ratio, we see that males are 1.23 times more likely to enroll in 
calculus than are their otherwise equal female counterparts. Importantly, 
since mathematical ability is controlled, the gender difference in calculus 
enrollment cannot be attributed to superior mathematical ability that 
some claim males possess (Peng and Jaffe 1979; Rudisill and Morrison 
1989; Benbow and Stanley 1980, 1983; Kolata 1980). 

In model 4b, I add the self-assessment variables to the equation. As 
hypothesized, higher levels of mathematical self-assessment increase the 
odds of enrolling in calculus. For every additional point of mathematical 
self-assessment, the log odds increases by 0.39. The regression coefficient 
for the verbal self-assessment variable is negative, suggesting that stu- 
dents’ decisions to enroll in calculus are based on a relative comparison 
of their assessments of their math and verbal skills. That is, at any level 
of mathematical self-assessment, the higher the perception of verbal 
ability, the lower the odds that a student will enroll in calculus, and 
vice-versa. Since grades and test scores are controlled, self-assessments 
of task competence are shown to have an effect on career-relevant de- 
cisions over and above actual ability. Of primary interest, the coefficient 
for the male dummy variable is no longer significant, indicating that 
the effect of gender on calculus enrollment is, at least partially, the result 
of gender differences in perceptions of mathematical competence. Put 
another way, when males and females perceive themselves to be equally 
mathematically competent, they are equally likely to enroll in calculus.?* 


% Since the gender gap in calculus enrollment has narrowed in recent years and is 
smaller than the gender gap in some advanced elective science classes, such as physics 
(AAUW 1992), one reviewer suggested that I model physics enrollment, rather than 
calculus enrollment. J therefore reran models 4a and 4b, changing the dependent 
variable from calculus enrollment to physics enrollment I could not, however, change 
the mathematical self-assessment variable to a sclence assessment variable because no 
such variable exists in the NELS data. (The self-assessment items are only asked with 
regard to math and verbal skills). The results of the physics enrollment models are 
qualitatively similar That is, higher mathematical self-assessment contributes to an 
increased likelihood of enrolling in physics, and gender differences in mathematical 
self-assessment partially explain this difference. However, the magnitude of the effect 
of mathematical self-assessment on physics enrollment is smaller than for calculus 
enrollment. This is not surprising, as we would expect math self-assessment to have 
an impact on math enrollment more than it would on science enrollment. Even though 
the calculus gender gap is relatively small, it is especially important for women to 
enroll in calculus if they are to persist on the path to careers in engineering, math, 
and science, as will be shown in model 6. 
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TABLE 4 
MAXIMUM-LIKELIHOOD ESTIMATES OF THE EFFECTS OF GENDER AND OTHER 
VARIABLES IN A LOGISTIC REGRESSION MODEL OF CALCULUS ENROLLMENT." 


MODEL 4A MODEL 4B 

INDEPENDENT VARIABLES Coefficlent SE Coefficient SE 
Male A ae 211* 105 138 121 
Math grades .................0ee 594" 105 195* .118 
English grades ................... 344" 097 585* .107 
Math test scores ................. 161* 011 .143* O11 
Verbal test scores ............. .. —.0011 013 011 013 
Class level” 

Math honors .................. 1.11* 186 1.12* 187 

Math academic ............... 478* .187 503* 188 

English honors... .627* .141 .684* 142 

English academic ........ . 370* .152 377* 153 
Ethnicity." 

ASMA OTER E .701* .144 ni 146 

African-American ....... ..... .930* 239 877% 242 

Hispanic . . na...sa eeaeee -380 263 395 263 
Parent's education (in years) ... 037 023 .047* 023 
Comstant ..........cccceeeeesee ees —15.5* 676 —16.5* .705 
Math self-assessment ........... 392* O50 


Verbel self-assessment .......... —.109* O51 


Source. —NELS-88 

Norr —FH statistica x? = 1,208 (13 @) for model 4a and 1,245 (15 df) for model 4b, log Hkehhood = 
—2,232 for model 4a and —2,183 for model 4b; HkeHhood ratio x’ test of model improvement (model 4a 
vs 4b) = 98 (2 @) 

* Estimates are based on 5,876 females and 5,681 males 

> Math and English class level variables are a series of dummy variables, with “the general high school 
track” serving as the reference category. 

* Ethnicity ts measured as a series of dummy vanables, with “white” serving as the reference category. 

* P < 0S, one-tailed test. 


This suggests that gender differences in the self-assessment of task com- 
petence are at least partially responsible for the differential movement 
of males and females along the path to the quantitative professions. 
To further test the path persistence hypothesis, I examine the impact 
of gender and mathematical self-assessments on selecting a quantitative 
college major (see table 5).* In model 5a, converting the male dummy 


™ The sample analyzed in model 5 is limited to those who attended college, thereby 
introducing the possibility of sample selection bias. To test for this potential bias, I 
estimated a Heckman probit model using version 6 of the Stata statistical software 
package. This model estimates maximum likelihood probit models with sample selec- 
tion (see Van de Ven and Van Pragg [1981] for more on this model). No significant 
difference was found between the model with sample selection and that without, 
thereby indicating that the selection mechanism does not introduce bias into the es- 
timates produced (o = .180; P = .737). This model is presented in the appendix. 
Comparing the probit coefficients for the model estimated with and without selection, 
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TABLE 5 
MAXIIMUM-LIFELIHOOD ESTIMATES OF THE EFFECTS OF GENDER AND OTHER 
VARIABLES IN A LOGISTIC REGRESSION MODEL OF CHOICE OF QUANTITATIVE Major* 








MODEL SA MODEL 5B 

INDEPENDENT VARIABLES Coefficient SE Coefficient SE 
Male .......... 1.35* 129 1.26* 128 
Math grades . o nn. 578* 097 318* 109 
English grades ... —.222* 109 —.071 120 
Math test scores . 062* 010 047" 010 
Verbal test scores —.018* 094 — 0096 0099 
Class level ° 

Math honors ... 367* 186 363* 184 

Math academic . 138 151 140 152 

English honors .... ........ 038 167 080 167 

English academic... —.057 142 —.043 142 
Ethnicity* 

Asian .. coco 229 213 228 220 

African-American. ..... . 1.08* .205 1.02" .208 

Hispanic ........ f 040 .267 056 264 
Parent's education (in a —.071* 025 —.063* 025 
Constant .. . ........ bee —5.86* 536 —6.22* 5% 
Math self-assessment 274* 057 
Verbal self-assessment .......... —.093* 052 





NoTr.— Fit statistics: X = 289 (13 df) for model 5a and 294 (15 df) for model 5b; log hketihood = 
—1,599 for model Sa and —1,580 for model Sb; hkelihood ratio x’ test of model improvement (model Sa 
vs. Sb) = 38 (2 df) 

* Estates are based on 3,539 females and 3,085 males 

ya and e high achool 

track” serving as the reference category. 
* Ethnicity 1s measured as a series of dummy variables, with “white” serving as the reference category. 

* P < 05, one-tailed test. 


variable coefficient to an odds ratio, we see that males are 3.86 times 
more likely to choose a quantitative major than are their otherwise equal 
female counterparts. 

In model 5b, I add the self-assessment variables as independent vari- 
ables. As predicted in hypothesis 3, higher levels of mathematical self- 
assessment increase the odds of choosing a quantitative major. For every 
additional point of mathematical self-assessment, the log odds increases 
by 0.27. This suggests that all students need to develop a personal con- 
ception of themselves as skilled at mathematics if they are to move toward 
a career in a quantitative profession. As with the calculus model, higher 
verbal self-assessments decrease the odds of choosing a quantitative major, 


we can see thet while some estimates change, the estimates for the variables of primary 
theoretical importance to the current study (the male dummy variable and the as- 
seasment variables) differ only slightly. 
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indicating that students use relative understandings of their competencies 
when making career-relevant decisions. Further, the magnitude of the 
gender coefficient does decrease with the inclusion of the mathematical 
assessment variable, although the difference is small. This small effect, 
combined with the larger effect of mathematical self-assessment on cal- 
culus enrollment, suggests that gender differences in mathematical self- 
assessment contribute to the gender disparity in the decision to pursue a 
quantitative career.” 

In fact, others have argued that calculus enrollment might be especially 
important for females in choosing a quantitative major (Seymour and 


3 The regression coefficients for the African-American dummy variable may seem to 
contradict established findings about ethnicity and educational attainment. For ex- 
ample, model 4 indicates that African-Americans are more likely than whites to enroll 
in high school calculus. However, this effect is largely the result of the controls in the 
model. If math grades and test scores are removed from the model, African-Americans 
are less likely than whites to enroll in calculus (odd ratios of 0.67). This suggests that 
the factors that continue to contribute to the lower grades and test scores African- 
American students receive are probably also behind their lower enrollment in advanced 
math classes. By contrast, when grades and test scores are removed from the model, 
males continue to be more likely than females to enroll in calculus (odds ratio of 1.16). 
Likewise, in model 5, African-Americans who attend college were found to be more 
likely than whites to choose a quantitative major. When all variables except for the 
race and gender dummy variables are removed from the model, the coefficient for the 
African-American dummy variable remains positive and significant, although its mag- 
nitude is decreased to 0.47. Model 5 was estimated for students enrolled in two- and 
four-year colleges. When the model was estimated only for those students who were 
attending a four-year college, the African-American coefficient was no longer found 
to be significant. However, the lack of significance still means that African-Americans 
are just as likely as whites to select a quantitative major. Others have found that, 
while in the past minority students were less likely to major in engineering or the 
sciences, this gap has closed in recent years (Brown 1994). Astin (1993) found that 
approximately equal proportions of black, white, and Hispanic students chose a science, 
math, or engineering major upon entering college, and the proportion of Asian students 
was significantly higher. In an extensive ethnographic study of students who were 
either majoring in science, math, and engineering or had switched out of one of these 
majors, Seymour and Hewitt (1997) found that children from working-class or minority 
backgrounds reported being encouraged to select careers that would provide them with 
a secure future. Engineering was seen as “synonymous with success” (p. 325). This 
study showed that the extra encouragement that minority children received to enter 
these fields from thetr families and teachers did contribute to their decisions to chose 
a quantitative major. However, African-Americans and Hispanic students dropped out 
of quantitative majors at a significantly higher rate than whites or Asians, citing as a 
main reason that they had chosen a major “Inappropriate” for them (p. 324). Race 
often becomes more salient for African-Americans and Hispanics in college than it 
was in high school since their racial or ethnic group is usually more of a numerical 
minority in college than in high school. That the choice of an engineering major began 
to seem “inappropriate” only once in college suggests that cultural beliefs about racial 
or ethnic groups have an impact on career-relevant decisions, but the influence occurs 
later in the career-choice process when race and ethnicity are more salient. 
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TABLE 6 
MAXIMUM-LIKELIHOOD ESTIMATES OF THE EFFECTS OF 
CALCULUS ENROLLMENT IN A LOGISTIC REGRESSION MODEL OF 
CHOICE OF QUANTITATIVE Major* 





MODEL 6 

INDEPENDENT VARIABLES Coefficient SE 
Male: ni cts oida 1.39* 139 
Math grades ...........00 2 0... 273* 099 
English grades . . —.112 107 
Math test scores ........ . ...... .028* .0095 
Verbal test scores . — 011 0086 
Class level” 

Math honors’ ............ o 138 161 

Math academic .. ........... 141 129 

English honors .. 104 142 

English academic — 083 119 
Ethnicity: 

AAD ee —.038 154 

African-American ............. 839" 165 

A Sess ee ewes —.225 190 

Parent's education (in years) —.072* 022 
Constant iorsin cesseseeeeees —5.86" 536 
Math self-assessment .. Lies 231" 047 
Verbal self-assessment ...... —.094* 047 
Calculus enrollment ........ . 117* 197 
Calculus enrollment x male —.352* 172 

Sourcr. —NELS-88. 


ratio xX test of model improvement (model 6 vs. 5b) = 46 (2 df). 


category. 
* Ethnicity is measured as a senes of dummy variables, with “white” serving 
es the reference category. 
* P < 05, one-tailed test 


Hewitt 1997; McIlwee and Robinson 1992). If this is true, then gender 
differences in mathematical self-assessment, which at least partially ex- 
plained the gender gap in calculus enrollment, become even more con- 
sequential. To evaluate this argument with the data used in the current 
study, I estimated a logistic regression model of the effects of the calculus 
enrollment variable, the male dummy variable, and the interaction of 
these two variables on the likelihood of choosing a quantitative major 
(see model 6, table 6). All other independent variables from models 4 and 
5 were also included. The significant negative interaction term indicates 
that the effect of calculus enrollment on choosing a quantitative major is 
larger for females. Converting the calculus regression coefficient to an 
odds ratio, we see that females who enrolled in high school calculus are 
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3.22 times more likely to choose a quantitative major than females who 
did not take calculus. Males who enrolled in calculus are only 2.27 times 
more likely than males who did not take calculus to choose a quantitative 
major.” In sum, enrolling in calculus has a significantly larger influence 
on the decision of females to choose a quantitative major. However, since 
females make relatively lower self-assessments of their mathematical abil- 
ity, they are also less likely than their equal ability male counterparts to 
enroll in calculus. In this way, gender differences in mathematical self- 
assessment further widen the gender gap in quantitative college majors. 

In sum, the results presented provide support for the supply-side model 
presented. Gender beliefs about task competence bias the assessments 
individuals make of their own competence at mathematics, and these 
biased assessments differentially influence the decisions males and females 
make to persist on the path toward careers in science, math, and engi- 
neering. Since males tend to overestimate their mathematical competence 
relative to females performing at the same ability level, they are more 
likely to choose careers in the quantitative professions. Importantly, gen- 
der differences in the selection of activities leading to these careers are 
not the result of actual differences in ability or merit, but they are instead 
the result of biased perceptions of competence. 


SUMMARY AND CONCLUSION 


The purpose of this study was to develop and test a simple supply-side 
model about gender differences in early career-relevant decisions. To test 
the hypotheses generated, I focused on high school students’ perceptions 
of their mathematical competence and how these perceptions, controlling 
for actual ability, influenced decisions to persist on the path leading to 
careers in the quantitative professions. Since these professions have re- 
mained stubbornly male dominated (Hanson 1996), they represent an 
important site from which to examine the constraining effects of gender 
beliefs on career decisions. 

The results of this study show that males assess their own mathematical 
competence higher than their otherwise equal female counterparts. Males 
are more likely than females with the same math grades and test scores 
to perceive that they are mathematically competent. Males were not found 
to assess their competence at verbal tasks higher than females, demon- 
strating that males do not globally assess their competence higher for all 
tasks, regardless of the task’s gender association. Instead, widely shared 


P The inverse log of (1.17 — .352) is 2.27. 
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cultural beliefs about gender and task competence bias the perceptions 
individuals have of their own task ability. 

Further, the effect of performance feedback on self-assessments was 
found to differ by gender. Math grades had a significantly larger positive 
effect on the mathematical self-assessments for females than for males. 
Proof of their competence could be more important to females because 
they must contend with lower societal expectations of their mathematical 
ability. More generally, this suggests that the appraisals individuals make 
of their own competence at various tasks are more contingent on local 
evidence when societal expectations for success are lacking. 

Importantly, self-assessments of task competence were found to influ- 
ence career-relevant decisions, even when controlling for commonly ac- 
cepted measures of ability. For males and females, the higher they rate 
their mathematical competence, the greater the odds that they will con- 
tinue on the path leading to careers in the quantitative professions. How- 
ever, since males tend to overestimate their mathematical competence 
relative to females, males are also more likely to pursue activities leading 
down a path toward a career in science, math, and engineering. While 
standardized test scores and high school grades are certainly imperfect 
measures of ability, I contend that they are likely more accurate measures 
of ability than are the self-perceptions of high school students. Therefore, 
these results suggest that those who persist on a mathematical career path 
may not even be the best qualified for careers requiring mathematical 
proficiency. In other words, boys do not pursue mathematical activities 
at a higher rate than girls do because they are better at mathematics. 
They do so, at least partially, because they think they are better. 

This study’s major contribution is to highlight one mechanism by which 
cultural beliefs about gender constrain the early career-relevant choices 
of men and women. The model presented focuses on how gender beliefs 
bias self-perceptions of competence. This focus is important since pre- 
sumptions of competence often legitimate inequality in achievement- 
oriented societies. The model proposed was evaluated with respect to the 
quantitative professions. However, my eventual goal is to highlight more 
generally how gender beliefs associated with various career-relevant tasks 
bias individuals’ perceptions of their competence at those tasks and, con- 
sequently, influence their commitment to different career trajectories. Any- 
time widely shared cultural beliefs about a task advantage males, the self- 
assessments males make of their competence in regard to the task should 
be higher, on average, than that of females performing at the same ob- 
jective level. If the task is also one that is thought to be necessary for 
movement along a particular career path, then males, acting on upwardly 
biased appraisals of their task competence, should be more likely than 
females to continue on the path leading to that career. Conversely, if a 
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task belief advantages females, such as beliefs associated with nurturing 
skills or abilities, females should assess their task competence higher than 
males. A gender difference in the self-assessment of nurturing ability, for 
example, might partially explain the gender gap in professions such as 
nursing. The model could also be used to explain gender differences in 
the choice of specialties within a field, assuming that tasks believed to be 
necessary to pursue various specialties are associated with clear and stable 
cultural beliefs about gender. For example, if it is widely believed that, 
to be successful, surgeons have to be able to maintain emotional distance, 
and men are thought to better than women at maintaining emotional 
distance, then the model could be employed to understand the continued 
male dominance of the specialty of surgery within the field of medicine. 
As these examples suggest, while the quantitative professions have been 
extreme in their resistance to the entry of women, there is no reason to 
think that they are the only professions that have skills that are both’ 
thought to be necessary and have gender beliefs associated with them. 
These required skills might either be less “critical” for entrance into a 
given profession or less stereotyped than mathematical skills are for the 
quantitative professions. This, however, would only serve to dampen, not 
eliminate, the effect shown. Career-relevant decisions are made at many 
points throughout the life cycle. Therefore, even small gender differences 
occurring at decision-making junctures can serve to carry males and fe- 
males in substantially different occupational directions. 

The cultural belief approach advocated in this article can only explain 
differences in the supply networks of workers for occupations that have 
clear and stable gender beliefs in the culture. Since many occupations and 
specialties within occupations do have stable, widely available cultural 
beliefs associated with them, the argument presented here is relevant for 
understanding the continued gender gap in a wide array of occupations. 
However, newly emerging types of jobs quickly become gender labeled 
(Game and Pringle 1983), and the gender typing of particular kinds of 
work changes over time. The processes by which gender beliefs about 
tasks associated with different kinds of work emerge and change are not 
fully understood. Given the importance of these processes for reproducing 
gender inequality, this is fertile ground for future research. 

The results of this study demonstrate that widely shared cultural beliefs 
attached to various tasks affect not only how individuals are channeled 
into particular activities and subsequent career trajectories by others, but 
also how individuals “self-select” into occupationally relevant activities. 
This implies that the gender-segregated labor force will be reproduced 
partially through the different and seemingly voluntary choices men and 
women make. Any attempts to counter the effects of gender beliefs on 
gender segregation and inequality in the labor force, therefore, will require 
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looking beyond how stereotypes are used by gatekeepers, such as teachers 
and employers, and focusing also on how gender beliefs affect males’ and 
females’ perceptions of their own abilities at crucial decision-making 


junctures. 


APPENDIX 


TABLE Al 


A COMPARISON OF THE ESTIMATES OF THE EFFECTS OF GENDER AND OTHER 


VARIABLES IN A PROBIT REGRESSION MODEL OF CHOICE OF QUANTITATIVE Major, 
ESTIMATED WITH AND WITHOUT SAMPLE SELECTION* 


MODEL 5B WITHOUT 


SELECTION 

INDEPENDENT VARIABLES Probit Coefficient 
Male ....... 624* 
Math grades ....... .. ..... .... 158* 
English grades ..........o.o...o... —.033 
Math test scores ................. .022* 
Verbal test scores —.0065 
Class level:* 

Math honors ................ 132* 

Math academic ............... 072 

English honors ................ 085 

English academic ............. —.044 
Ethnicity”* 

African-American ..... 461* 

Hispanic o.. — 093 
Parent’s education (in years) —.031* 
Constant ......... —2 90* 
Math self-assessment ........... 120* 
Verbal self-assessment —.050* 


Sourcr —NELS-88 
NOTE —p = .180 (x2 = 113 [1 9 P = 737) 


SE 


MODEL 5B WITH 


SELECTION 


Probit Coefficient 


615% 
.149* 
0013 
.023* 
—.0058 


* Model Sb “with selection” is a standard Heckman probit model which provides maxomum-Hkelhood 
estimates with sample selection Model 5b “without selection” is simply a probit regresaron model. Model 
5b without selection us based on 11,557 cases 4,931 of these cases are censored in the model with selection 

* Math and English class level vanabies are a senes of dummy variables, with “the general high school 


track” serving as the reference 


category 
* Ethnicity is measured as a series of dummy vanables, with “white” serving as the reference category. 


* P< 05, one-tailed test 
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This article examines women’s employment patterns during the 
child-rearing period and the consequences of those patterns for earn- 
ings later in life, in 12 industralized countries. This study proposes 
an analytic framework that combines “welfare regime” and gender- 
specific policies to explain country differences. The findings pre- 
sented here suggest that institutional arrangements mediate the costs 
to women’s part-time and intermittent employment. Within welfare 
regimes, employment continuity is highest among countries in which 
the state provides support for working mothers. Furthermore, this 
study finds that lower support for mothers’ employment is associated 
with higher wage penalties to employment discontinuity. 


INTRODUCTION 


Women’s employment patterns along their life cycle are related to family 
events and, in particular, to the presence and age of children. Many women 
tend to drop out of the labor force after giving birth and to return to full 
employment only later in life. In recent years, however, a growing number 
of young mothers have maintained continuous labor market activity, 
either by maintaining the jobs they held before giving birth or by shifting 


* Earlier versions of the article were presented at the ECSR Conference on Rational 
Action Theory, Stockholm, Sweden, October 16-19, 1997; the international conference 
on The Welfare State at Century's End: Current Dilemmas and Possible Futures, 
sponsored by Tel Aviv University and the Lavon Institute for Labour Research, Tel 
Aviv, Israel, January 5—7, 1998; and the American Sociological Association 

San Francisco, August 21-25, 1998. This research was supported by a grant from the 
G.LF., the German-Isreeli Foundation for Scientific Research and Development. 
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into part-time employment (Spain and Bianchi 1996; Main 1988; Stier 
1998; Sundstrom 1992). 

The periodic separation of women from work is seen by proponents of 
human capital theory as the major determinant of the gender wage gap 
and of occupational sex segregation (Polachek 1975; Mincer and Ofek 
1982; Duncan and Prus 1992). This is mainly because intermittent em- 
ployment is assumed to result in skill atrophy, which, in turn, reduces 
productivity and thus wages. Workers who interrupt their employment 
are also perceived by employers as having lower commitment to work. 
Consequently, they are less likely to gain access to the high-paying, more 
attractive jobs (Gronau 1988; Stafford and Sundstrom 1996). 

Studies that focused on the consequences of women’s employment pat- 
terns along the life cycle for their labor market rewards have adopted, 
for the most part, a microlevel approach. While this approach explains 
many aspects of the labor market, it generally ignores institutional and 
normative arrangements that structure women’s work and that may me- 
diate the effect of intermittent or part-time employment on market out- 
comes. In recent years, a growing number of studies have focused on 
comparing women’s employment situation across industrialized countries 
(e.g., Rosenfeld and Kalleberg 1990; Gornick and Jacobs 1996, 1998; Gor- 
nick, Meyers, and Ross 1998; Hansen 1995; Blau and Kahn 1995). These 
studies document the variation in market behavior and market conse- 
quences in relation to gender issues and emphasize the importance of the 
institutional context within which women operate, for understanding 
labor market outcomes at the individual, as well as societal, level. 

Our study aims to probe the variations across countries in women’s 
employment pattern along the life course and their earnings consequences. 
We introduce an explanatory framework that links the country differences 
to structural arrangements associated with distinct welfare regimes. Com- 
parison of countries, which represent different welfare regimes and labor 
market arrangements, permits us to explore variations in market returns 
to women’s employment pattern. In particular, we focus on the effect of 
institutional settings on earning outcomes of part-time and intermittent 
employment during the child-rearing years. In the process, we also identify 
the social-political contexts in which “penalties” for reduced employment 
of women (i.e., working part-time instead of full-time during child-rearing 
years) are minimized. 

We start by outlining the variety of institutional contexts in which 
women are employed, as they are manifest in 12 industrialized countries, 
including Sweden, Norway, the Netherlands, Germany, Austria, Italy, 
Israel, Britain, Australia, New Zealand, Canada, and the United States. 
We then introduce our expectations concerning the pattern of women’s 
work and their outcomes in the different settings. This is followed by an 
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analysis of women’s work behavior, the costs associated with the different 
employment patterns, and the structural characteristics that are related 
to them. 


THE CONTEXT OF WOMEN’S MARKET ACTIVITY 


Several social institutions are said to influence employment in general and 
women’s employment in particular. Brinton (1988), for example, focused 
on the role of the family, the educational system, and the labor market 
in mediating processes of decision making and achievements at the in- 
dividual level. Others pointed to the importance of market regulations by 
state policies, or the interaction between the market and the welfare state, 
as the major institutions that affect employment outcomes (see, e.g., Bloss- 
feld 1997; Diprete et al. 1997). To the extent that these institutions and 
their interrelationships vary from one country to another, the context in 
which women’s employment takes place will differ substantially. 

Addressing the institutional context in generalized terms, Esping- 
Andersen (1990) has proposed a typology of welfare regimes that pertains 
to country differences in social policies regarding citizen rights and the 
organization of work. The current study uses this typology as a point of 
departure. According to Esping-Andersen’s typology, three models of wel- 
fare regimes can be differentiated: the social-democratic welfare state, the 
liberal welfare state, and the conservative-corporatist welfare state. The 
first model is characterized by a universalistic approach to social rights, 
a high level of decommodification, and an inclusion of the middle class 
in social programs. The liberal model, at the other extreme, provides only 
limited social insurance. Its social programs are directed mainly toward 
the working class and the poor, and means-tested assistance is prevalent. 
In the conservative-corporatist welfare regime, social principles prevail 
in most areas, based, however, not on egalitarian standards but rather on 
eligibility dependent upon social statuses (mainly family, class, and relig- 
ion) and traditions. 

Each of the welfare models represents a distinct relationship between 
the state and the market. The liberal regime is based on the predominance 
of the market. Its basic ideology is that economic well-being is best 
achieved through the unfettered operation of the market (i.e., via market 
wages). State involvement takes place only when the market fails (Esping- 
Andersen 1990; Gustafsson 1994). In the social-democratic regime, on the 
contrary, the state is fully responsible for assuring the welfare of its citizens 
irrespective of market forces in general and the citizens’ own market 
activity in particular. Indeed, the social-democratic welfare state aims to 
buffer individuals from market uncertainties and to weaken the links 
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between market and life chances. The conservative regime represents a 
third model in which the state, the market, and other institutions—mainly 
the family and the church—share responsibility for citizens’ welfare. Wel- 
fare rights are not universal but rather depend on particularistic statuses. 

In his discussion of women’s employment, Esping-Andersen focused 
on the extent of female labor force participation and the magnitude of 
sex-segregation in the labor market. He argued that female labor market 
participation would vary across the three welfare regimes. The differences 
are related mainly to the variation in structure of the labor market and, 
particularly, the size of the public sector. More specifically, Esping- 
Andersen anticipated a high rate of female labor force participation in 
the social-democratic countries, a somewhat lower rate of economic ac- 
tivity for women in the liberal countries where the market controls the 
demand for labor, and a low rate of participation in the conservative 
countries, in which women are marginal to the economy. While Esping- 
Andersen did not discuss explicitly the employment behavior of women 
along their life course, some expectations can be derived using the un- 
derlying principles of the three welfare regimes. In the following section 
we outline these expectations, regarding three main topics: women’s par- 
ticipation in paid employment, part-time employment, and the pattern of 
work along the life course and its economic consequences. 


Women’s Participation in the Labor Market 


In all industrial countries, women still bear the major responsibility for 
child rearing, independent of welfare regime and specific family and child 
policies. In fact, none of the public policies (not even in the most egalitarian 
countries) has been effective enough to change the household division of 
labor between the genders (Sainsburry 1996). Thus, family obligations 
and especially the presence of children restrict women’s involvement in 
the economy. The rates of women’s labor force participation, however, 
and especially their pattern of work, differ in accordance with welfare 
regime and are affected by specific family and gender-oriented policies. 
In a society based primarily on “market principles,” such as the United 
States, market criteria determine who will work and who will take care 
of the children. Having a stronger position in the market and being free 
of child-care responsibilities (normatively, as well as practically), men are 
expected to allocate most of their time to labor market activity. Women 
are constrained in their time allocation decisions and are compelled to 
weigh the costs and benefits of market activity against household re- 
sponsibilities and obligations (Gronau 1977; Becker 1981). Consequently, 
a strong selection of women into the market is anticipated, based on 
expected rewards and on their orientation toward work (Blakemore and 
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Drake 1996; Hansen 1995). Child-care arrangements, which free women 
to allocate more of their time to the labor force, largely depend in the 
United States on private initiatives and thus are subject to forces of supply 
and demand (Gustafsson 1994) and could further enhance the selection 
into paid employment. 

In a social-democratic welfare regime, the “social regulation” overrides 
market principles, and the state is committed to increasing equality among 
all citizens. Under this regime, there are basically two ways for women 
to gain equality with men. One way is to participate in the labor force 
in a similar manner to that of men. In such a case, the state assumes 
responsibility for the children, mainly by providing state-subsidized child- 
care facilities, and enforces gender-equality measures in the labor market. 
This is what Esping-Andersen (1999) defines as “de-familialization.” The 
anticipated consequence is a high participation rate of women, and es- 
pecially mothers, in the labor force. Nonetheless, a universal benefit sys- 
tem for parents, while aimed to enhance women’s labor force participation 
opportunities, may also promote the incorporation of less committed work- 
ers into the labor force (Hansen 1995). This has implications for the types 
of jobs women will choose and explains in part the higher concentration 
of women in the public sector and in female-dominated occupations (Han- 
sen 1995). 

A second route to improving women's economic position and to protect 
their economic independence is for the state to provide monetary remu- 
neration for child care (and housework). This is done through family- 
related policies such as maternity and paternity leave (for pay), child 
allowances, tax relief for children, and so on (Gauthier 1996). According 
to this latter strategy, women are not necessarily encouraged to participate 
in the market, but rather they are compensated by the state for the time 
invested in child care. 

The conservative-corporatist regime supports a traditional division of 
labor between the genders, both normatively and institutionally. Although 
a certain degree of decommodification does take place, there is no attempt 
on behalf of the state to eliminate gender inequalities. Rather, the expec- 
tation is that the traditional family, with the male breadwinner, will pro- 
vide for all family members (Esping-Andersen 1999). The state intervenes 
(through public assistance programs) when the family fails. Women in 
these countries are perceived as the main caretakers of children and fam- 
ilies, and family-related policies, including the tax regime, encourage 
women to withdraw from (or limit their involvement in) market activity, 
especially when they have young children. This portrayal is very much 
in line with Sainsbury’s (1994, 1996) “breadwinner model.” 
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Women’s Part-Time Employment 


Part-time employment is an important means of incorporating women 
into the market in all countries, irrespective of welfare regime. Part-time 
employment is perceived by women as a way to cope with their multiple 
roles as mothers and workers. Employers too see advantages in women’s 
part-time employment since it reduces absenteeism and increases their 
flexibility (Beechey and Perkins 1987; Duffy and Pupo 1992). Nonetheless, 
the three welfare regimes differ in the types of jobs open to part-time 
employees and in the public perception of part-time employment. 

Within the liberal welfare regime, part-time employment constrains 
women’s ability to gain access to the better jobs, which grant high earnings 
and opportunities for career advancement (Beechey and Perkins 1987; 
Duffy and Pupo 1992; Sundstrom 1992). This is mainly because part-time 
employment is concentrated in a limited number of occupations and in 
dead-end, temporary jobs. Furthermore, part-time employment is gen- 
erally taken to signal lower commitment to work than full-time employ- 
ment (Kishler and Alexander 1987). As a result, women who work part- 
time are often denied entry into positions of authority and responsibility. 
We expect, therefore, that in countries with liberal welfare regimes, where 
there is strong selection of women into paid employment, only a minority 
of women will be in part-time employment. 

The situation differs in other welfare regimes. The social-democratic 
welfare regime is committed to promoting women’s labor force partici- 
pation. In this framework, part-time employment is offered as a way to 
maintain the continuous involvement of mothers in the labor force. It is 
assumed that part-time employment during early motherhood is a tran- 
sitional stage, and women will return to full-time employment. Concom- 
itantly, employment conditions in part-time jobs are similar to those in 
full-time employment. This includes employment benefits, union protec- 
tion, access to good jobs, and an easy transition into full-time employment 
later in life (Sundstrom 1997). 

Part-time employment is of special significance under the conservative 
welfare regime. It is perceived as a solution to the incorporation of mothers 
into the labor force, without altering the gendered division of labor. Since 
in these societies women are expected to give higher priority to their 
parental role, part-time employment has a more permanent nature and 
is highly concentrated in female-type, secondary sectors of employment. 
Because of the dominance of social principles in the conservative regime, 
however, part-time workers enjoy employment benefits and union pro- 
tection similar to full-time workers. High rates of part-time employment 
among women, especially among young mothers, are therefore expected 
in these societies. 
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Continuous Employment and Its Economic Consequences 


The variation among welfare regimes in women’s labor force participation 
rates and their involvement in part-time employment has implications for 
their lifelong pattern of work and its economic consequences. Continuous 
employment is expected primarily in countries characterized by social- 
democratic welfare regimes, which are committed to enhancing gender 
equality through market work. Transition into part-time employment, 
during certain periods of life, is likely to have little negative consequences, 
since such a mode of employment is fully supported by labor market 
institutions and since it is not taken as an indication of a lack of com- 
mitment to work. Thus, part-time employment is an alternative for em- 
ployment interruption, and it operates more as a “bridge” to full-time 
employment later in life than as a “trap” in marginal employment (Natti 
1995). Indeed, work interruption is discouraged and is therefore expected 
to be penalized both in terms of women’s ability to return to their previous 
job and in terms of earnings. 

Employment patterns along the life course in countries with conser- 
vative welfare regimes are expected to differ from the patterns discussed 
above. First, higher rates of intermittent employment are expected, since 
women’s traditional roles are promoted and only limited market oppor- 
tunities (especially for young mothers) are offered. Those who will return 
later in life to market activity will most likely find part-time, secondary 
employment. Work interruption, however, may not be costly to women, 
since most mothers work in secondary jobs, which do not offer high 
economic rewards. Also, due to a structured labor market and the op- 
eration of social principles in market rewards, the penalties for noncon- 
tinuous full-time employment will be low. 

In the liberal welfare regime, market opportunities for women will affect 
their pattern of work. Those who have lower wages and higher costs of 
child care will interrupt their employment; those who can “afford” child 
care will continue their employment. In this case, part-time jobs provide 
an inadequate solution, as we argued above. Women who do not change 
their occupation concomitant with the move into part-time employment 
can preserve their skills and move later on to full-time employment. On 
the other hand, those who change occupations will be trapped in part- 
time (mainly secondary) jobs and will not be able to improve their position 
in the labor force in the long run. In general, in market-based political 
economies, work interruptions are expected to result in high economic 
costs, both because interruptions involve skill atrophy and because em- 
ployers interpret it as a signal regarding the employee’s commitment to 
work (Duncan and Prus 1992; Mincer and Ofek 1982). A stringent selec- 
tion of women into full-time employment is expected to contribute, as 
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well, to inequality among women in the economic rewards associated with 
work behavior. 


COUNTRY VARIATION IN FAMILY AND WORK POLICIES 


Esping-Andersen’s typology was criticized by feminist writers for its 
rather exclusive focus on the relationship between the market and the 
state, while the family and the gendered division of labor were addressed 
only partially (Daly 1994). It was argued that although his analysis was 
concerned with the employment trajectories of men and women, Esping- 
Andersen neglected to discuss the relationship between the state and 
gender-specific policies and orientations. Lacking, in particular, is a 
systematic treatment of the division of labor and women’s role in the 
family and how these affect women’s employment (Bussemaker and Kers- 
bergen 1994; Daly 1994; Sainsbury 1994, 1996; O’Connor 1996). In his 
recent writings, however, Esping-Andersen (1999) considers the state/fam- 
ily relationships more systematically by introducing the concepts of famil- 
ialization and defamilialization to differentiate between countries in which 
traditional family dependencies still prevail and those in which the role 
of the family is minimized. His main conclusion, however, is that this 
distinction largely corresponds to the threefold welfare regimes.’ 

The welfare typology notwithstanding, considerable country variation 
in women’s employment exists within each of the three regimes. Countries 
differ, for example, in the general orientation toward women’s work. 
Norms and attitudes will affect how women structure their life-course 
employment pattern, deciding whether to work when children are young 
and whether to take full-time employment. Similarly, economic factors, 
such as the cost of living and men’s level of earning (e.g., the ability of 
a household to maintain the aspired standard of living with only one 
paycheck) may affect women’s employment choices. In this article, we 
focus on another institutional aspect that affects women’s employment 
pattern, that is, family-oriented policies and practices. These do not nec- 
essarily overlap with the more general labor market policies enacted by 
states. Specifically, within each regime, countries differ with regard to 
policies concerning mothers’ employment, partly as a result of perceptions 
regarding female employment and partly because of different historical 
and contextual developments. 

In a recent comparative study of family policies, Gornick, Meyers, and 


? According to Esping-Andersen's (1999) analysis, social-democratic countries have 
achieved a high level of defamilialization, while conservative countries preserved the 
role of the male breadwinner and the dependency of women on the family rather than 
on the state. 


1738 


Women’s Employment 


Ross (1997) examined policies that support mothers’ employment in 14 
OECD countries. Their study demonstrated considerable variation in em- 
ployment-related policies concerning women within each welfare cluster. 
For example, based on several indicators of policy (i.e., maternity leave 
policies, child-care arrangements for various ages of children, school 
schedule, etc.), they categorize Sweden as having a highly encouraging 
policy for mothers’ employment, while Norway, another social-democratic 
country, is ranked at a lower level. A lower level of support for mothers’ 
employment is also apparent in the Netherlands, where the welfare state 
was modeled largely along the lines of the social-democratic regime (Pfau- 
Effinger 1998). Taking yet another example, Canada’s policies regarding 
female participation in the labor market are more extensive and suppor- 
tive compared to other countries with liberal welfare regimes. Differences 
are also evident within the conservative cluster, with France and Italy 
exhibiting a highly supportive policy for mothers’ employment and Ger- 
many and Austria providing a lower level of support. 

In this article, we adopt Gornick, Meyers, and Ross (1996) categoriza- 
tion of countries according to the level of their employment-supportive 
policy.’ We add this dimension to the welfare regime typology in order 
to enhance our understanding of women’s employment patterns and their 
expected economic consequences. By way of summary, we present in table 
1 the distribution of the 12 countries included in our study along the lines 
of welfare regimes and employment-supportive policies. The two dimen- 
sions are clearly related but do not fully correspond to one another. Ac- 
cordingly, most countries with a liberal welfare regime have a low level 
of state support of mothers’ employment. Only one country in this regime 
(Canada) has an intermediate level of support, and no country falls into 
a category of liberal welfare regime and high support for mothers’ em- 
ployment. In countries with conservative or social-democratic welfare 
regimes, mothers’ employment receives intermediate or high support from 
the state. Correspondingly, none of these countries fall into the “low sup- 
port” category. 

Our study aims to explore this intraregime variation by examining 


3 According to their scheme, countries were categorized into three levels of employment- 
supportive policies: high, intermediate, and low. They constructed an index based on 
several indicators of public parental leave schemes (e.g., legislated job protection, weeks 
of paid maternity leave, wage replacement rate, coverage of policy, availability of 
extended leave, and paternity benefits) and child-care policies (including tax relief for 
child care, guaranteed child care for age groups 0-2 and 3—5, child care i 
percentage of children ages 0-2 in publicly funded child care, percentage of children 
3-5 in publicly funded child care, percentage of children age 5 in preprimary school, 
and percentage of children in public after-school programs). For a detailed discussion 
of this index, see Gornick et al. (1996, 1997). 
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TABLE 1 
CATEGORIZATION OF COUNTRIES BY WELFARE REGIME AND THE LEVEL OF SUPPORT 
FOR MOTHERS EMPLOYMENT 


LEVEL oF SUPPORT TO MOTHERS’ EMPLOYMENT 
WrLrarr REGIME Low Intermediate High 


Social democratic . Norway, Sweden 
the Netherlands 
Conservative .... .. Germany, Austria Israel, Italy 
Liberal . .. ........ United States, Canada 
United Kingdom, 
Australia, 
New Zealand 


women’s employment pattern along their life course and the economic 
costs associated with interrupted employment and transition to part-time 
employment. Making use of the two dimensions—welfare-state regimes, 
on the one hand, and policies supportive of female employment, on the 
other hand, we derive hypotheses concerning these costs. Our main ex- 
pectations regarding the employment pattern of women in each institu- 
tional arrangement and the costs associated with intermittent employ- 
ment—either a transition to part-time employment or exit from the labor 
force—are outlined below. 

In general, we expect high continuity of employment in the social- 
democratic countries and a fast return to full employment. We also expect 
lower earning penalties to (long-term or short-term) shifts into part-time 
employment but higher penalties to employment interruptions (with- 
drawal from the labor force). In conservative countries, we expect a rel- 
atively higher level of employment transition to part-time or out of the 
labor force, since women are not generally encouraged to participate in 
the labor market when family demands are high. Consequently, the pen- 
alties for employment interruption and for a transition to part-time will 
be low. Finally, the highest level of a continuous full-time employment 
and the highest levels of penalty to deviation from this pattern are ex- 
pected in countries with liberal welfare regimes. 

Welfare regime notwithstanding, the variation in policies supporting 
mothers’ employment implies that within each welfare regime the em- 
ployment pattern and employment consequences will differ. In countries 
with high levels of support for mothers’ employment, the penalty asso- 
ciated with the transition into part-time jobs will be low, since this form 
of employment maintains women’s attachment to the labor market. Em- 
ployment interruption, however, is a less agreeable behavior in this context 
and will entail a higher level of penalty. An intermediate level of support 
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suggests lack of appropriate conditions that facilitate mothers’ employ- 
ment continuity. In this case, it is expected that women will either reduce 
their involvement in the labor market (i.e., move into part-time jobs) or 
completely withdraw from employment when they have young children. 

Applying these expectations to the classification in table 1, we hypoth- 
esize higher continuity of employment in Sweden compared to Norway 
and the Netherlands and a faster return to full-time employment. This is 
so since these social-democratic countries differ in the level of institutional 
support given to mothers’ employment. We further expect a lower earn- 
ings penalty in Sweden for a transition into part-time employment com- 
pared to the two other countries. In the three countries, nonetheless, 
women will be penalized similarly for employment interruptions. 

In the group of countries with conservative regimes, German and Aus- 
trian women are expected to interrupt their employment (for a short or 
a long period) more often than Israeli or Italian women, since the former 
have lower institutional support for employment. However, German and 
Austrian women are less likely than women in Italy or Israel to face high 
penalties for their employment interruptions or for a transition to part- 
time employment. Variation in employment patterns and the penalties 
associated with them are expected also among the liberal countries. Since 
Canada has the highest support for mothers’ employment, Canadian 
women are expected to work continuously, either full- or part-time, and 
to have lower penalties for their work interruptions than women in other 
countries with liberal welfare regimes. 


DATA AND MEASUREMENTS 


For the purpose of the cross-national analysis, we use data obtained from 
the ISSP 1994 survey on “family and changing gender roles” (Zentralar- 
chiv fuer Empiriche Sozialforschung 1997). The ISSP (International Social 
Survey Program) is conducted as a collective effort by participating 
nations aimed at ensuring high comparability. There were 24 countries 
that participated in the “gender-role” module. Within each of the countries, 
a national representative sample of the adult population was drawn, and 
respondents were presented with identical questionnaires. Our analyses 
are based on 12 of the 24 countries included in the ISSP “family and 
gender-role” module. The selection of countries was based on theoretical 
and empirical reasons. We decided to exclude all former-communist coun- 
tries from the analysis because the relationship between family and 
women’s employment pattern was historically embedded in the socialist 
regime and is still in the state of flux. Japan and the Philippines were 
also excluded because they do not easily fit into the welfare-regime ty- 
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pology. Among the European countries, Spain and Ireland were not in- 
cluded due to data problems: the Spanish data had no information on 
either the extent of work (full- or part-time) or on weekly working hours. 
Thus it was impossible to include Spanish women in the analysis of wage 
penalties. In the Irish sample, too few women participated in the market. 

In addition to attitude items, respondents were presented with an ex- 
tensive set of questions on past and present family and work status. This 
unique feature of the data allows us to compare, in the most reliable way, 
women’s employment patterns and their consequences. To measure 
women’s employment pattern, we used information concerning women’s 
work activity during different stages of their family life cycle. These in- 
cluded information on work behavior after marriage but before having 
children, when preschool children were at home, when the children were 
in school, and when the children left home. 

In the present analysis, we focus on employment status during two 
periods that represent high family demand on women’s time: when women 
had preschool children and when they had school-age children. In ac- 
cordance, we limited the sample to include only women who went through 
these phases.‘ For each of these periods, female respondents reported 
whether they were employed full-time, part-time, or were out of the labor 
force. Based on this information, we constructed the employment-pattern 
variable. Since there are three possible states at every family life-course 
stage, there are theoretically nine possible employment patterns. Due to 
empty cells, we collapsed the nine possible combinations into five mean- 
ingful employment patterns: continuous full-time employment, continuous 
part-time employment, continuous nonemployment, those who worked 
part-time (including very few who did not work) in the first period but 
worked full-time in the second, and those who did not work in the first 
period and worked part-time in the second. This typology will be put to 
use, both as a dependent and as an independent variable, in the different 
parts of the analysis. 

The analytic section of the article comprises two parts: first we examine 
women’s pattern of work, utilizing multinomial logistic regression models 
in which the employment-pattern serves as a dependent variable and the 
welfare and policy regimes serve as the main independent variables. The 
models control for women’s age and education. In the second part, we 
evaluate the effect of the employment-pattern along the life course on 
current earnings, within each institutional context. Earnings are measured 
in country-specific currency and in different units (e.g., monthly income, 


* Women who are still in the second stage of child rearing (Le., they have school-age 
children at home) were also included in our final sample. 
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annual income).* To achieve comparability and to overcome the differ- 
ences in the unit of measurement, we standardized the earnings using a 
measure of relative earnings. Accordingly, for each country, we calculated 
the relative distance of respondent’s income from work from the maxi- 
mum earning reported in the country-specific sample (for a discussion of 
this procedure, see Gornick et al. 1996). In addition to the employment- 
pattern, the wage equations include as control variables: education, age, 
hours of work, whether the woman is in a professional, managerial, or 
technical (PTM) occupation, and whether she has a position of authority 
at work. Due to the selective nature of women’s employment and es- 
pecially the differential selectivity across countries, we employ the Heck- 
man (1979) estimation procedure in order to control for sample selectivity 
when estimating the earnings of women in the labor force.’ 


FINDINGS 
Family Life Cycle and Women’s Employment 


We begin our analysis with a description of women’s employment status 
at three points in time: (1) after marriage but before children, (2) when 
the children were young (preschool children), and (3) when the children 
were in school. Figure 1 presents the employment patterns for the 12 
countries included in the study. The first panel of figure 1 refers to 
women’s employment patterns before they had children. The striking 
feature of these data is the similarity across countries. With the exception 
of the Netherlands, and to a lesser extent Italy, the majority of women 
in all countries reported working full-time prior to having children. Swe- 
den and Canada exhibit the highest figures, with 76% and 81%, respec- 
tively. Dutch women, by way of contrast, have an exceptionally low level 
(44%) of full-time employment prior to having children. 

About a third of all women in Italy and the Netherlands, and over a 
quarter of the women in Germany, Australia, the United States, and 
Norway were not in the labor force before having children. The figures 


* Germany and Israel were the only countries to provide information on net monthly 
income. 


* Education was measured as an ordinal variable including the seven ordered categories 
provided by each country. Alternative formulations (e.g., dummy categorles of school 
level, years of schooling) yielded similar results. 

* The probit equation for the selection model included age, education, marital status, 
whether the woman ever worked while having children, and two attitudinal indicators. 
The latter items were based on the extent of agreement with the following sentences. 
“A preschool child is likely to suffer if his or her mother works” and “Family life 
suffers when the woman has a full-time job.” 
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were substantially lower. Correspondingly, 


however, 


only a minority of women in all countries held a part-time job. It is 
important to note that country disparities in part-time employment and 


no employment prior to having children do not necessarily indicate dif- 
ferences in women’s disposition toward labor force participation. The 


observed employment behavior could be affected by other factors, such 
in some countries than in others. We will address this point subsequently. 


as school enrollment, which may be more prevalent at this stage of life 


for other countries, 
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Employment patterns when family demands are high are probably more 
revealing with regard to the constraints on women’s labor force behavior 
than their experience early in the family life cycle. For this reason, we 
examine country differences in the employment patterns of women with 
preschool children. These are presented in the second panel of figure 1. 
There is a dramatic shift in women’s employment patterns with the arrival 
of preschool children in the family. This shift gives occasion to the emer- 
gence of large country variation. Over one-third of women with preschool 
children in Israel, Italy, the United States, Austria, and Canada (in that 
order) engaged in full-time employment. In all other countries, however, 
less than 20% engaged in full-time employment, with the lowest level 
found in the Netherlands (4.5%) and New Zealand (6%). Concomitantly, 
the Netherlands, Germany, Australia, and New Zealand have the highest 
levels of nonemployment when mothers have preschool children (over 
60%), followed closely by Britain, Norway, and Austria (above 50%). 

The lowest levels of nonparticipation were found in Israel (31%) and 
Sweden (40%). Sweden also had the highest rate of part-time employment 
(43%), followed by Norway and Israel. It is noteworthy that in five of 
the twelve countries (United States, Canada, Israel, Italy, and Austria) 
most women who participated in paid employment during this stage of 
the family life cycle worked on a full-time basis. With the exception of 
Austria, these are countries in which the overall rate of employment 
among mothers of preschool children remained rather high. 

- For the sake of illustration, we contrast the employment distribution, 
before and after having children, in the Netherlands, Sweden, and Israel. 
In the Netherlands, the proportion of women working part-time remained 
unchanged (comparing the top and the middle panels), whereas the pro- 
portion of women working full-time dropped to less than 5%. In fact, the 
overwhelming majority of women withdrew from the labor force when 
they had young children. By way of contrast, the most noticeable feature 
in the Swedish data is the decline in proportion of full-time employment 
and substantial increase in part-time employment when, women have 
preschool children. A certain proportion of women is out of the labor 
force during this phase of the life cycle, but the proportion is not nearly 
as high as in the Netherlands. Israel exhibits yet another pattern. Although 
part-time employment grows when mothers have preschool children, a 
large proportion of women are employed full-time during this phase. Con- 
comitantly, only a small portion of mothers does not participate in the 
labor force. 

The finding that stands out in the third panel of figure 1 is the return 
of women to the labor force when children are of school age, albeit with 
a different mix of full- and part-time employment. If we compare the top 
and bottom panels, we find that the magnitude of nonparticipation is only 
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slightly greater in the third phase, but the extent of part-time employment 
is considerably higher in most countries. Nonetheless, country differences 
in women’s employment patterns remain substantial. When we compare 
the second and the third panels, we find that the shift into full-time 
employment is especially salient in the United States, where 54% of the 
women with school-age children worked full-time (compared to 35% in 
the previous phase of the family life cycle). No significant change, however, 
is evident in the rate of part-time work between the two time periods. 
Israel shows a very similar pattern—an increase in full-time but very little 
change in part-time employment. 

In Sweden, part-time as well as full-time employment increased as most 
women returned to labor market activity. A similar shift took place in 
most other countries (except for Italy), albeit with a varying proportion 
remaining out of the labor force. Italy is the only country in which the 
pattern of female employment remained as it was in the second period. 
Although in most countries, then, the labor force participation of women 
was higher when children were of school age, they clearly did not return 
to the level of economic activity they had prior to having children. 

The number of women out of the labor force during this third stage of 
the family’s life cycle was considerably lower than in the previous stage. 
Nonetheless, half of all Dutch and German women and over a third of 
Austrian, Italian, and Australian women were still out of the labor force. 
Swedish women show the lowest level of nonparticipation with only 12% 
of all mothers of school-age children being out of the labor force. In all 
other countries, the rate of nonparticipation ranged between one-fifth and 
one-quarter of all mothers of school-age children. 


Transitions and Emerging Employment Patterns 


To this point, we considered the static features of female employment 
during different points in the family life cycle, from a comparative per- 
spective. Such a perspective is incomplete, however, since it fails to capture 
the dynamic aspect of actual shifts in labor market position, which is 
necessary for an understanding of different paths that women take. To 
address this point, we examine the shift or stability in employment states 
at the microlevel. We focus on two points in time—when women had 
preschool children and when the children were in school. 

As pointed out in the data and measurement section, there are five 
meaningful employment patterns: continuous full-time employment, a 
shift from part-time to full-time, continuous part-time employment, a shift 
from no employment to part-time employment, and continuous nonem- 
ployment. The distribution of women across the five-category employment 
pattern is presented in figure 2. The frequencies of the various patterns 
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Fia. 2.—Distribution of employment pattern along the life course by country 


are presented as a stacked bar for each of the countries, with the pattern 
of full-time employment in both phases at the bottom, followed by other 
employment arrangements, and ending with no labor force participation 
in either period at the top. 

Taken as a whole, the figures suggest that the relationship between the 
institutional arrangements (i.e., the welfare regime and the level of em- 
ployment-supportive policies), and employment patterns is not straight- 
forward. The highest level of full-time continuous employment is observed 
in Israel, Italy (two conservative countries with high support for mother’s 
employment), the United States, and Canada (liberal countries that differ 
in their level of employment support). They are followed by Aus- 
tria—another conservative country, but with a lower level of employment 
support policies. Women in the United States and Canada also return to 
full-time employment sooner than do other women. Thus, about half of 
all women in these two countries worked full-time by the time their 
younger child went to school. The lowest level of continuous full-time 
employment is noted in the Netherlands and in New Zealand, which, 
again, fall into two different categories of institutional arrangements. 

Continuous part-time employment is most prevalent in Sweden, Nor- 
way, Israel, and in two of the liberal countries—the United Kingdom and 
New Zealand. These countries also exhibit an early return from nonem- 
ployment to part-time (a pattern prevalent in the Netherlands and Aus- 
tralia as well). Separation from market work is highest in the Netherlands 
and Germany, followed closely by Austria—three countries with an in- 
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termediate level of support for mothers’ employment. The rate of long- 
term withdrawal from market work is also high in Italy, which has more 
supportive employment policies. Countries that represent the liberal model 
exhibit similar rates of female nonparticipation in the labor market. These 
rates are higher than in Scandinavian countries but lower than the rates 
common in nations with a conservative welfare regime. 

Although the variation across countries in women’s employment pat- 
terns shows only partial correspondence to the institutional contexts in 
which women’s work is organized, the analysis of change in employment 
states along the family life cycle reveals an important relationship between 
welfare regimes and women’s employment. In Israel, Italy, and, to a lesser 
extent, Austria—all with conservative welfare regimes—we find large 
proportions of women in continuous full-time employment and continuous 
nonemployment. These are mutually exclusive categories, which implies 
heterogeneity of the female population. This mirrors the institutional du- 
ality of these countries, where there is a strong family orientation but also 
social policies supportive of women’s work. This distinguishes Israel and 
Italy from Germany (and probably the Netherlands), where there is little 
support for female employment and indeed a very high proportion of 
women in continuous nonemployment. 

In the social-democratic countries—specifically, Norway and Swe- 
den—we find that the categories of continuous full-time employment and 
continuous nonemployment are rather small. This means that there is 
greater homogeneity in the population of women. Most women work part- 
time when they have young children, or they shift from one state to 
another, so that practically all women are employed either full- or part- 
time at one time or another. These patterns are also evident in Australia, 
New Zealand, and the United Kingdom—countries characterized by a 
liberal welfare regime—but continuous nonparticipation is higher in these 
countries, as might be expected in systems with weak family-support 
policies. 

Variation in employment patterns within institutional contexts may 
reflect differences in the composition of the female workforce (mainly its 
age and education). Our first task is, therefore, to test the hypothesis 
regarding the effect of institutional arrangements on the pattern of em- 
ployment along the life course, taking into account women’s socio- 
demographic characteristics. We hypothesized that the likelihood to be in 
each employment category will vary across welfare regimes and employ- 
ment support policies. More specifically, we expect higher rates of con- 
tinuous full-time employment under the liberal welfare regime compared 
to the other two welfare regimes, and in countries with higher support 
for women’s employment, within each regime. 

In order to test these hypotheses, we estimated a set of multinomial 
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TABLE 2 
MODELS EXPLAINING WOMEN’S EMPLOYMENT PATTERN ALONG THE LIFE COURSE 











—2 Log Likelibood af x Change af Change 
1. Base model .............. 2... 16,180.78 8 
(2) Model 1 + welfare regime .. 15,807.12 16 
3. Model 2 + employment 
supportive policies .......... 15,671.69 24 
4. Model 3 + welfare regime 
x employment supportive 
IA winieikioe ea 15,607.92 28 
5. Model 4 + country specific 
O is ences esis s 15,188.17 52 
Model 2 vs. model 1 .......... 373.66" 8 
Model 3 vs. model 2 ........... 135 43» 8 
Model 4 vs. model 3 ........... 63 77* 4 
Model 5 vs. model 4 ........... 41975* 24 





logistic regression models in which the employment pattern is the depen- 
dent variable (with continuous nonemployment as the omitted category). 
The analysis starts out with a simple model, which includes the age and 
education of women and then adds the welfare regime and characteristics 
of employment supportive policies. Model statistics for various specifi- 
cations are presented in table 2. Model 1, which includes age and edu- 
cation as explanatory variables, serves as the base model. This model 
assumes no variation across welfare and policy regimes in women's pat- 
terns of employment. Model 2 tests the effect of welfare regime, while 
model 3 adds the effect of family policy to the previous model. Model 4 
tests the full set of interactions between the two dimensions— welfare 
regime and policy level. In the final step, we fitted model 5, which assumes 
country-specific effects on the employment pattern. The top half of table 
2 presents the goodness of fit for each model, and the bottom panel shows 
the statistics for model comparisons. 

A comparison between model 2 and model 1 reveals that type of welfare 
regime significantly affects women’s employment pattern. The likelihood 
statistic (—2LL) decreased from 16,180.78 to 15,807.12 ( = 373.66; df 
= 8). The comparison of model 3 with model 2 reveals that employment- 
supportive policies also have a significant effect on the employment pat- 
tern. Model 3, which includes botk welfare regime and employment pol- 
icies, improves on model 2 (x? = 135.43; df = 8), indicating that the two 
dimensions are important for understanding women’s employment pat- 
tern. Model 4 includes the full set of interactions between welfare regimes 
and policy level. The —2LL value associated with this model is signifi- 
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cantly lower than in model 3 (x? = 63.77; df = 4). This attests to the 
fact that the level of support for mothers’ employment affects women’s 
employment patterns differently depending on welfare regime. 

Last, we tested a model with an indicator for each of the 12 countries 
to capture the total variation across countries.’ The model statistics for 
the difference between model 5 and model 4 are significant (x? = 419.75; 
df = 24). It is clear from these findings that there are important country 
idiosyncrasies and that country-specific institutions and norms uniquely 
affect the employment patterns of women. Nonetheless, the analysis un- 
derscores the fact that welfare regime and female employment policies go 
a long way to explain the differential patterns of women’s employment 
along the family life course. Consequently, model 4 is our model of choice 
for evaluating the effects of employment support policy and welfare re- 
gime on women’s employment patterns. The coefficient estimates asso- 
ciated with this model are presented in table A1 in the appendix. 

In order to facilitate the evaluation of the results and their bearing on 
our hypotheses, we calculated the expected probabilities of the various 
employment patterns for each of the six institutional contexts. The prob- 
abilities were derived from the multinomial logistic results of model 4 and 
were calculated for a woman age 40 with a high school education. The 
results are presented in graphical form in figure 3. In general, employment- 
supportive policies increase the probability of working full-time contin- 
uously while family demands are high and reduce the probability of with- 
drawing from market work. As evident from figure 3a, the probability of 
continuous full-time employment within each welfare regime is higher 
where employment supportive policies are extensive. For example, the 
probability of continuous full-time employment in the liberal context is 
0.30 for women whose employment is supported by the state and only 
0.18 for those with a low level of support. The pattern is similar in the 
conservative regime. The probability of continuous full-time work is 0.39 
where policies are highly supportive and 0.30 where support is low. The 
same is true for the social-democratic regime (0.17 vs. 0.10, respectively). 
When comparing across welfare regimes, we find that women in conser- 
vative countries have a significantly higher probability to work full-time 
continuously compared to women in social-democratic regimes. 

The probability of withdrawing from the labor force during the two 
time periods under study is higher in the conservative countries with 
intermediate level of support than in conservative countries with high 
support for mothers’ employment (fig. 3b). A similar pattern is found 


* When the dummy indicators for country effect are added to model 4, the terms for 
welfare regime and support levels (and their interactions) drop out. Hence, this model 
has 52 degrees of freedom associated with it. 
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among the social-democratic countries (0.27 and 0.06, respectively, for 
intermediate and high support). Among the liberal countries, however, 
the employment policy level does not affect the probability of interrupted 
employment. A short-term work interruption followed by a transition 
to part-time employment is more prevalent in countries with a lower 
level of support for mother’s employment, within each welfare regime 
(figure 3c). 

The other employment patterns are less consistent with our expectations 
(fig. 3d and fig. 3e). Specifically, the probability of working part-time 
during the entire period is similar for most categories. It is higher only 
for social-democratic countries with high support for mothers’ employ- 
ment (0.44) and lower in the liberal regime with intermediate support. 
Finally, no meaningful differences are evident in the probability of work- 
ing part-time for a short period. 

It is important to keep in mind that, while representing well-defined 
points in the family life course, the work experience women reported may 
have spanned a long period of time. A 60-year-old woman and a woman 
in her 30s are likely to have had preschool children in quite different 
historical periods. In particular, it is not clear to what extent the welfare 
and employment-support classifications that pertain to rather recent cir- 
cumstances are pertinent to the labor market older women experienced 
years earlier. Although a historical examination of policy changes in each 
country is beyond the scope of the present study, we addressed this issue 
by reanalyzing the data separately for four birth cohorts. This detailed 
analysis did not reveal any systematic cohort effects. More important, the 
patterns described earlier were revealed with varying intensity in each of 
the birth cohorts.’ We conclude, then, that although policies have changed 
during the last 30-40 years in most countries, the underlying ideology and 
basic institutional mechanisms of employment-supportive policies for 
women were probably already present and did not change substantially 
over this period. 


Employment Patterns and Earnings 


The last issue to be examined is the effect of the employment pattern 
along the family’s life cycle on women’s earnings later in life. Our purpose 
is to test the hypotheses concerning the effect of welfare regime and em- 
ployment supportive policies on penalties to employment interruption. 


” The cut-off points for the four age groups were under 40, 41-50, 51-60, and above 
60. The patterns were generally more pronounced in the younger age cohort but were 
still significant in the oldest age group. The detailed results may be obtained from the 
authors upon request. 
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TABLE 3 
MODELS EXPLAINING WOMEN’s WAGE RANK BY EMPLOYMENT PATTERN ALONG 
THE LIFE COURSE 











1. Base model ......... idad 3,642.36 16 
2. Model 1 + welfare regime . 3,027.00 26 
3. Model 2 + employment 

supportive policies ..  ...... 2,678.06 36 


polides. A 2,665.90 41 
actions AN 2,484.26 71 


Model 2 vs. model 1 ......... 615.36* 10 
Model 4 vs. model 2 ......... 348.94* 10 
Model 4 vs. model 3 .... ..... 12.16* 5 
Model 5 vs. model 4 .......... 181.64* 30 


For this purpose, we regressed earnings at the time of the survey on the 
pattern of past employment and the institutional context, controlling for 
age, education, hours of work, current occupation, and authority posi- 
tion.” To control for women’s selection into paid employment, we applied 
the Heckman selection procedure. Various specifications were examined, 
and the model comparisons are presented in table 3. The base model (col. 
1) introduces the individual-level variables. The second model adds the 
welfare regimes, the third model adds the employment-supportive policies, 
the fourth model includes the full set of interactions between welfare 
regime and the level of employment-supportive policies, and the last model 
(model 5) adds an indicator for each country. In the lower half of the 
table, we evaluate the contribution of various models using x’ tests for 
the difference in the log likelihood. 

The comparison of model 2 to model 1 reveals a large and significant 
improvement when welfare regime type is included (x7 = 615.36; df = 
10). The additive combination of the two institutional indicators further 
improves the model, indicating a unique significant effect of level of sup- 
port (evident from the comparison of model 3 to model 2, where x’ = 
348.94; df = 10). Model 4 adds interaction terms for the combination of 
welfare regime and level of employment-supportive policies, which further 


The analysis of income consequences is limited to women who participated in paid 
employment at the time of the survey. This limitation results in a sample of 2,436 
women. 
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improves the estimation, but only marginally so (x7 = 12.16; df = 5). 
The last model adds idiosyncratic factors, which are captured by a sep- 
arate indicator for each country. The comparison of this model with the 
previous one (X? = 181.64; df = 30) indicates that in addition to welfare 
regime and employment-supportive policies, unique country character- 
istics also affect women’s employment pattern on their wage level. 

Since model 4 best captures the systematic effects of the association 
between employment patterns, welfare regime, and employment suppor- 
tive policies on earnings, we turn now to examine it more closely (table 
4). The first and second columns of table 4 show the coefficient estimates 
and their standard errors, respectively. The comparison category used in 
the model is continuous full-time employment in the liberal regime with 
low support, so that the coefficients measure the deviation of each social 
context from this baseline. 

Since we are interested in differences among employment patterns 
within each regime and employment policy type, we calculate the differ- 
ence between the coefficient for each employment pattern and the coef- 
ficients for full-time continuous employment, in each institutional context. 
These differences may be interpreted as the “penalties” for interruption 
of employment during the family life course, within the various social 
contexts. The results are presented in the third column. For example, 
under the liberal regime with intermediate support, the coefficient for 
continuous full-time employment was 0.181, and the coefficients for a 
continuous part-time employment was 0.078. Hence, the calculated cost 
for a continuous part-time work will be 0.103 (0.181 — 0.078 = 0.103). 
Similarly, the cost of a long-term work separation in a conservative regime 
with intermediate level of support equals 0.073 (0.182 — 0.109 = 0.073). 

We then tested the statistical significance of these differences. With few 
exceptions, the figures support our expectation that high support for 
women’s employment minimizes the costs of employment interruptions 
and the transition to part-time. A reduction of wages due to work sep- 
aration is apparent in all welfare regimes with an intermediate level of 
employment-supportive policies. For example, in the liberal regime, with 
intermediate support for mothers’ employment, short-term work inter- 
ruption reduces the wage rank by seven percentage points. A long-term 
separation from paid employment reduces the wage rank by more than 
19 percentage points (see the third column of liberal—intermediate sup- 
port). The reduction is slightly smaller in the social-democratic countries 
(16 percentage points). Long-term work separation (out the labor force 
continuously) entails lower costs in the conservative countries with in- 
termediate employment support (seven percentage points). Interestingly, 
and somewhat unexpectedly, continuous part-time employment reduces 
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EFFECTS OF EMPLOYMENT PATTERN ALONG THE LIFE COURSE ON WAGE RANK 











Coeffictent SE Within-Group Difference 
Ages hieed otra debetetcesme Solnde .004* 000 
Education ...... o coccoccrono cece eens .079* 003 
Hours of work ........... ooo erce .005* 000 
Whether supervise .. ...........2...-. .039* 006 
Whether white collar ....... ......... .034* 006 
Liberal—low support: 
Part-full ........oooococono > > cece ee —.018 O15 —.018 
Part-part .....ocoococononononoronnn —.004 .015 —.004 
NILF-part ...occccccccncno cece neces —.012 .015 — 012 
NILF-NILF coooccccccccccccconnno 004 .025 004 
Liberal Intermediate support: 
Fulbfull .........o.o.ooooo 5 .ocomomo.. .181 O19 0 
Part-full o... ...oooicosooionorosonoo oo .179 023 —.002 
Part-part ...... 6 ccc es ee cece ee ee O78 025 — 103* 
NILF-part .occcccccccsececeeeeeeeees 115 .024 — 066% 
NILF-NILF ....occccccccccoroconro 013 .032 —,194* 
Conservative intermediate support: 
Fulb full . oo... eee eee cece eens .182 .019 0 
Part-full. unir ita 214 029 .032 
Partpart neont raio eresas .105 020 —.077* 
NILF-part cocccccononoconcnononnnnns 118 .023 —.064* 
NILF-NILE ........ 0.0.2. cece eee .109 .028 —.073* 
support. 
Pal full cards 012 O16 0 
Part-full ...........0. sesse cece eee 014 026 .002 
Part-part ........:ccccceeeeeee conos 020 019 008 
NILF-part o... wc cece cece eee e ee .023 033 011 
NILF-NILF eseese cece eee .056 034 044 
Social democratic—intermediate 
support: 
Fulfill sisi. cick cosido 346 022 0 
Part-full ........o o 0oooocccccoo eee -308 -020 — 038 
Part-pert .......cceccceseeeeeeee sues 250 017 — 096" 
NILF-part ...occorococcocnnncconos 249 016 —.097* 
NILF-NILF .................. 2.55. 184 .030 —.162* 
Social democratic—high support: 
Fulbfull .........ooooocccooo cece eee 214 028 0 
Part-full ..........o ooo cece cece ee es 247 029 033 
Part-part O .164 .018 — 050 
NILE-part o......cccceesee ceeeeeees .128 022 —.086* 
NILF-NILF ... .............. wie .181 048 —.033 
A sccceesscne sheaeesveee —.018 O15 


aca A AE 
NoTE.—The model is based on Heckman Procedure using maxtmum-tHkeHhood estimator Model x’ 


= 1,848 17; df = 41. 
* P< 05. 
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the wage rank by 8-10 percentage points in all countries, with an inter- 
mediate level of support for mothers’ employment. 

Contrary to our expectations, no wage penalties are observed in coun- 
tries with a liberal welfare regime and low employment support. It should 
be recalled, however, that the estimation models include type of occu- 
pation and authority position. When these attributes were excluded from 
the model (these analyses are not presented), effects of employment pattern 
on earnings did emerge.’ We infer from this that in the liberal market 
regime work is structured in a way that creates a strong linkage between 
employment pattern and type of occupation and position. These, in turn, 
largely determine economic rewards. How this is achieved cannot be 
determined from our data and requires additional investigation. 

When employment policies are aimed to encourage mothers’ partici- 
pation in paid employment, low costs or even no costs are associated with 
deviations from continuous full-time employment. Only in the social- 
democratic regime with high support for mother employment, women are 
penalized (by a reduction of nine percentage point) for a short separation 
from the labor force. Lower levels of support for female employment, 
however, do bring about penalties to work interruptions and to long-term 
transition to part-time employment. 


CONCLUSIONS 


Women’s low attainments in the labor market are commonly attributed 
to their employment patterns along the family’s life cycle. It had been 
argued that women’s interrupted employment while they have young 
children at home is largely responsible for their lower wages and their 
limited employment opportunities in the labor market. In this article, we 
focused on two institutional dimensions: Esping-Andersen’s typology of 
welfare regimes and the level of mothers’ employment-supportive policies, 
suggested by Gornick et al. (1997), to scrutinize their effect on women’s 
employment pattern at the period when family demands are high; and 
the consequences of the employment pattern on earnings later in life. We 
anticipated variation in the level of employment interruptions and re- 
duction of working hours along the lines of welfare regimes and employ- 
ment-supportive policies. Moreover, we expected variations in the costs 
of discontinued employment and of the transition into part-time work 
among the institutional dimensions. 

The empirical analysis confirmed the complex relationship between 


* This was not the case in any of the social context except for the Hberal regime with 
low employment support 
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institutional context and women’s employment patterns. We found that 
within all welfare regimes, employment continuity is highest among coun- 
tries in which the state provides support for working mothers. That is, 
policy aimed toward supporting mothers’ employment facilitates their 
attachment to the labor market by providing the necessary conditions to 
participate in full-time employment on a continuous basis. In countries 
where the family assumes most of the child-rearing responsibilities (Le., 
countries with conservative regimes and those who do not highly support 
women’s employment), women are compelled to interrupt their employ- 
ment and rely on other sources of income—either through their spouses 
or through state mechanisms. 

By affecting the way in which labor markets operate, the state, through 
policies and regulations, reduces the costs of women’s part-time and dis- 
continued employment. We found support for the proposition that insti- 
tutions mediate the costs of employment interruptions. Both in conser- 
vative and in sociodemocratic countries with lower levels of support for 
mothers’ employment, part-time and discontinuous employment reduce 
women’s earnings in the long run. This, however, is not the case in most 
countries that encourage mothers’ participation in paid employment. 

Although many of the findings are in line with the propositions outlined 
at the outset, some inconsistencies emerged as well. For example, the rate 
of women’s full-time employment in conservative countries was higher 
than anticipated by the model. Likewise, the penalty to work interruptions 
including transitions to part-time employment) in social-democratic coun- 
tries with an intermediate level of support for mothers’ employment was 
relatively high. Moreover, no wage penalties were observed in liberal 
countries with low support for mothers’ employment. These deviations 
underscore the fact that idiosyncratic factors such as unique historical 
processes, or normative systems, operate at the country level. 

Our findings have several important implications. First, they underscore 
the importance of institutional context in affecting women’s employment 
patterns. We found that welfare regime and employment supportive policy 
explain much of the variation in women’s employment patterns across 
countries. Policies directed toward women’s employment have the poten- 
tial of encouraging or discouraging continuous full-time employment. 
These policies are related to, but do not necessarily coincide with, the 
welfare regimes (e.g., Sainsbury 1994, 1996; O’Connor 1996). 

Second, against the common argument that part-time is detrimental to 
women’s market position, the cross-national comparison reveals a more 
complex reality. Societies have different means of incorporating women 
into paid economy and rewarding them. Some countries developed strong 
mechanisms for integrating women into the labor market through highly 
supportive policies. This has a twofold effect: first, it results in a more 
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heterogeneous female labor force which is accommodated by diverse em- 
ployment patterns and, second, since there is no “standard” employment 
pattern, no penalty is associated with intermittent or part-time work. By 
way of contrast, in countries that maintain the traditional gender division 
of labor and preserve women’s familial roles (providing only intermediate 
levels of support for their employment), continuous employment is less 
prevalent and the labor market penalties for “nonstandard” employment 
are the highest. 


APPENDIX 


TABLE Al 
LoG-LIKELIHOOD ESTIMATES OF EMPLOYMENT PATTERN ALONG THE LIFE CYCLE 








Fol-Full Part-Full Part-Part NILF-Part 





Aci a tas te —.027* —022* —044* — 018* 
(.003) (.004) (.003) (.003) 
Education 2.0.0.0... o cececeeeeeeee o o conc 486" 447* 354" 155% 
(036) (041) (.035) (.036) 
Liberal —intermediate support ........ Bane 457* — 065 —.391* —.270 
(.163) ( 186) (.187) (175) 
Conservative —intermediate support ..... 224 —.898* — 171 —1.075* 
(.125) (.170) (.127) (130) 
Conservative—high support ... o.s... 147% —436* .136 —1618* 
(.126) (.164) (.132) (188) 
Social democratic—intermediate support ... —.936* —.632* —247* -227* 
(143) (13) (115) (.102) 
Social democratic—high support .......... 1.036* 1.020* 1.765* 931% 
(223) (228) (197) (.199) 
Constant ~ oea orsica on ono ee cess 1,396 —1.553 .196 .143 
(.268) (310) (260) (258) 
NOTE —N = 5,421, model x2==1,227.87; df = 24. 
* P< 05. 
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To conserve space for the publication of original contributions to 
scholarship, the comments in this section must be limited to brief 
critiques. They are expected to address specific errors or flaws in 
articles and reviews published in the AJS. Comments on articles are 
not to exceed 1,500 words, those on reviews 750 words. Longer or 
less narrowly focused critiques should be submitted as articles. 
Authors of articles and reviews are invited to reply to comments, 
keeping their replies to the length of the specific comment. The AJS 
does not publish commenters’ rebuttals to authors’ replies. We reserve 
the right to reject inappropriate or excessively minor comments. 














COMMENT: WHY WE LOVE OUR CHILDREN’ 


Role conflict is one of the oldest concepts in sociology (Stryker and Macke 
1978). It occurs when actors pursue two or more roles that are incom- 
patible with each other, given the actors’ limited resources. (If actors had 
unlimited time, energy, and other resources, they would never experience 
role conflict.) As an increasing number of women move into life-long labor 
force participation, the work/family conflict that these women experience 
when they are simultaneously full-time workers and mothers of small 
children has become a particularly important topic for sociological re- 
search (Moen 1992). At the same time, because an increasing number of 
men are now married to these women and carry out some of the child 
care responsibilities, more men experience greater work/family conflict 
than ever before (Gerson 1993). 

Given that their work roles place certain demands on them, employed 
parents would experience the work/family conflict to the extent that they 
desire to spend more time and energy on their roles as parents. Thus the 
question of the work/family conflict is partially a question of parental 
investment (as well as partially a question of work investment). Why do 


1I thank Paula England, Jeremy Freese, Rosemary L. Hopcroft, Matthew C. Keller, 
and Alan S. Miller for their comments on earlier drafts, and Cary Lee Coryell and 
Abigail Iris Coryell for the essential insight. Direct all correspondence to Satoshi Ka- 
nazawa, Department of Sociology, Indiana University of Pennsylvania, Indiana, Penn- 
sytvania 15705-1087. E-mail: Kanazawa@groveiup.edu 


Permission to reprint a comment printed in this section may be obtained only from 
the author. 
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some parents choose to spend more time and energy to take care of their 
children than others? 

The question of role conflict, and of why some parents choose to make 
greater parental investment into their children than others, points to a 
larger theoretical question: Why do actors choose to pursue what they 
pursue (be it work or parental investment or anything else)? Why do 
actors want what they want? If individuals do not pursue parenthood 
and employment simultaneously, they would not experience the dilemma 
of the work/family conflict. Since role conflict of any kind has almost 
entirely negative consequences, no rational persons would choose to ex- 
perience role conflict if they had a choice. The fact that role conflict is 
nonetheless very prevalent seems to suggest that the goals actors pursue 
when they play mutually incompatible roles are not consciously chosen. 
Where then do actors’ values and preferences (which define their goals) 
come from? 

I contend that evolutionary psychology can answer the question of 
where values and preferences come from (Kanazawa 2001) and can there- 
fore explain why some parents choose to invest more time and energy 
into their children than others. One evolutionary psychological theory of 
parental investment (known as the Trivers-Willard hypothesis; Trivers 
and Willard 1973), for instance, explains why wealthy parents value sons 
over daughters while poor parents value daughters over sons. However, 
a recent empirical test of the hypothesis (Freese and Powell 1999) finds 
absolutely no support for it, thereby throwing into question the relevance 
of evolutionary psychology for traditional sociological problems. In this 
brief comment, I will first present the fundamental principles of evolu- 
tionary psychology and point out one minor problem with Freese and 
Powell’s test of the Trivers-Willard hypothesis, which might very well 
have led to their null finding. I will then present my own empirical test 
of the hypothesis, using data from the National Survey of Families and 
Households, which supports the hypothesis. 


PRINCIPLES OF EVOLUTIONARY PSYCHOLOGY 


Evolutionary psychology explains human cognition and behavior in terms 
of evolved psychological mechanisms. Am evolved psychological mecha- 
nism is an information-processing procedure or decision rule that evo- 
lution by natural and sexual selection has equipped humans to possess 
in order to solve a particular adaptive problem (a problem of survival or 
reproduction). Unlike decision-rules in microeconomic subjective expected 
utility maximization theory or game theory, however, psychological mech- 
anisms mostly operate behind our conscious thinking. 
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Male sexual jealousy is an example of an evolved psychological mech- 
anism (Daly, Wilson, and Weghorst 1982). Because gestation in human 
and most other mammalian species occurs inside the female body, males 
of these species can never be certain of the paternity of their mates’ 
offspring, while females are always certain of their maternity. In other 
words, the possibility of cuckoldry exists only for males. Men who are 
cuckolded and invest their resources in the offspring of other men end 
up wasting these resources, and their genes will not be represented in the 
next generation. Men therefore have a strong reproductive interest in 
making sure that they will not be cuckolded, while women do not share 
this interest. Accordingly, men have developed a psychological mechanism 
that makes them extremely jealous at even the remotest possibility of their 
mates’ sexual infidelity. The psychological mechanism of sexual jealousy 
attenuates men’s adaptive problem of paternal uncertainty. The same 
psychological mechanism often leads to men’s attempt at mate guarding, 
in order to minimize the possibility of their mates’ sexual contact with 
other men, sometimes with tragic consequences (Buss 1988; Buss and 
Shackelford 1997). 

While men and women are the same in the frequency and intensity of 
their jealousy in romantic relationships (White 1981; Buunk and Hupka 
1987), there are clear sex differences in what triggers jealousy. There is 
both survey and physiological evidence from different cultures to show 
that men get jealous of their mates’ sexual infidelity with other men, 
underlying their reproductive concern for cuckoldry. In contrast, women 
get jealous of their mates’ emotional involvement with other women, 
because emotional involvement elsewhere often leads to diversion of their 
mates’ resources from them and their children to their romantic rivals 
(Buss, Larsen, and Weston 1992; Buss et al. 1999). 

Note that we do not consciously choose or decide to get jealous. We 
just get jealous under some circumstances, in response to certain pre- 
dictable triggers, but otherwise do not know why. However, what triggers 
jealousy is always understandable to others, and these triggers are also 
cross-culturally constant (Thiessen and Umezawa 1998); otherwise, no 
romance novels or romantic comedies would ever become international 
hits. Evolutionary psychology contends that evolved psychological mech- 
anisms are responsible for most of our emotions, and they are also behind 
most of our preferences and desires (Kanazawa 2001). Evolutionary psy- 
chology explains human cognition and behavior in terms of these evolved 
psychological mechanisms (and the preferences, desires, and emotions they 
produce in us). 

Evolutionary psychology contends that the entire human body (includ- 
ing the brain and its psychological mechanisms) evolved over millions of 
years during the Pleistocene epoch in the African savanna where humans 
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lived during most of this time (Maryanski and Turner 1992, pp. 69-90). 
This environment—the African savanna where humans lived in small 
bands of 50 or so related individuals as hunter-gatherers’—is called the 
environment of evolutionary adaptedness (EEA) (Bowlby 1969) or an- 
cestral environment, and it is to the EEA that our body (including the 
brain and its psychological mechanisms) is adapted. To the extent that 
the current environment in which we live is different from the EEA, 
evolutionary psychology predicts that our cognition and behavior are not 
necessarily adaptive (“fitness maximizing”). “The adaptive correspondence 
between present conditions and present behavior, to the extent that it 
exists, is contingent, derived and incidental to Darwinian explanation. 1t 
depends solely on how much the present ontogenetic environment of an 
individual happens to reflect the summed features of the environment 
during recent evolutionary history, that is, on how different the present 
environment is from ancestral conditions” (Tooby and Cosmides 1990, p. 
378). 

Recall the example of male sexual jealousy as an evolved psychological 
mechanism. This psychological mechanism solved the adaptive problem 
of successful reproduction in the EEA by allowing men who possessed it 
to maximize paternal certainty and minimize the possibility of cuckoldry. 
Their sexual jealousy was therefore fitness maximizing in the EEA. How- 
ever, we now live in an environment where sex and reproduction are often 
separated (in that many episodes of copulation do not lead to reproduc- 
tion). There is an abundance of reliable methods of birth control, and 
many married women in industrial societies use the birth control pill. For 
these women, sexual infidelity does not lead to childbirth, and their mates 
will not have to waste their resource on someone else’s offspring. In other 
words, the original adaptive problem no longer exists. Yet men still possess 
the same psychological mechanism to make them jealous at the possibility 


7 As a first approximation, it might be useful to think of the EEA as the African 
savanna during the Pleistocene era, because this is indeed where many psychological 
mechanisms evolved. Technically, however, the EEA “is not a place or a habitat, or 
even a time period. Rather, it is a statistical composite of the adaptation-relevant 
properties of the ancestral environments encountered by members of ancestral pop- 
ulations, weighted by their frequency and fitness-consequences” (Tooby and Cosmides 
1990, pp. 386-87) In other words, the EEA might be different for different adaptations. 
For instance, in order to pinpoint the EEA for male sexual jealousy as an adaptation, 
we need to consider the entire period of evolution from the time when males did not 
have sexual jealousy (this is likely to be long before they were human) until the time 
when all human males had the psychological mechanism of sexual jealousy that they 
currently possess. This is likely the period during which cuckoldry was a problem for 
paternal certainty Further, we must emphasize the period during which cuckoldry 
was more prevalent (weight by frequency) and during which males with sexual jealousy 
had particularly greater reproductive success than those without it (weight by fitness- 
consequences). 
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of their mates’ sexual infidelity and to compel them to guard their mates 
to minimize the possibility of cuckoldry. No man would ever be comforted 
by the fact that his adulterous wife was on the pill at the time of her 
sexual infidelity. 

Further, because our environment is so vastly different from the EEA, 
we now face a curious situation where those who behave according to 
the dictates of the evolved psychological mechanism are often worse off 
in terms of survival and reproduction. Extreme forms of mate guarding, 
such as violence against mates or romantic rivals, are felonies in most 
industrial nations. Incarceration, and consequent physical separation from 
their mates, to which such violence can lead, does everything to reduce 
the reproductive success of the men. Just as our taste for fats and sweets 
(which increased our chances of survival and reproductive success in the 
EEA) makes us obese and reduces our chances of survival and repro- 
ductive success today, uncritically following the emotions and desires cre- 
ated in us by our evolved psychological mechanisms often leads to mal- 
adaptive behavior. 

It is my contention that the evolved psychological mechanisms produce 
values and preferences in us. No matter what individuals’ values and 
preferences are, they are always in the brain. It is therefore important to 
figure out how the human brain works in order to solve the problem of 
values. It is the basic principle of evolutionary psychology that the brain 
consists of evolved psychological mechanisms. These psychological mech- 
anisms create values and preferences that motivate human behavior. Hu- 
man actors take these values and preferences as their goals and make 
rational decisions to pursue these goals in the context of structural and 
institutional constraints. I therefore contend that evolutionary psychology 
is key to any theory of values. 


PARENTAL INVESTMENT: THE TRIVERS-WILLARD HYPOTHESIS 
AND FREESE AND POWELL 


The Trivers-Willard hypothesis (Irivers and Willard 1973) is unique 
within evolutionary psychology in that it makes class-based predictions 
of parental investment behavior and, in that sense, is particularly relevant 
to traditional concerns of sociologists. It begins with the observation, later 
confirmed by a large cross-cultural study (Buss 1989), that men’s repro- 
ductive success largely depends on their wealth and status (because 
women prefer to mate with wealthy, powerful men) and women’s repro- 
ductive success largely depends on their youth and physical attractiveness 
(because men prefer to mate with young, physically attractive women). 
In other words, men’s reproductive success hinges on factors that are 
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closely associated with class, whereas women’s reproductive success 
hinges on factors that are mostly orthogonal to class. For this reason, 
daughters from poor families are expected to attain higher reproductive 
success than their brothers, whereas sons from wealthy families are ex- 
pected to attain higher reproductive success than their sisters. Parents 
should thus be selected to favor (albeit unconsciously) daughters over sons 
in poor families and sons over daughters in wealthy families. The Trivers- 
Willard hypothesis has been supported by a large number of empirical 
studies on societies across history and throughout the world, including 
the contemporary United States (Betzig and Weber 1995; Gaulin and 
Robbins 1991). Cronk (1991) provides a comprehensive review of the 
empirical evidence in support of the hypothesis. 

Most recently, Freese and Powell (1999) subject the Trivers-Willard 
hypothesis to a rigorous empirical test. Freese and Powell’s article is highly 
commendable on at least three separate counts. First, unlike most soci- 
ologists, who tend to dismiss all biological explanations of human behavior 
on political and ideological grounds, Freese and Powell take evolutionary 
psychology seriously. Their article begins and ends with a sincere call to 
fellow sociologists to examine evolutionary psychology as a possible source 
of explanations of human behavior. Second, once again unlike most so- 
ciologists, Freese and Powell are enormously knowledgeable about evo- 
lutionary psychology. Their critique of it, unlike that by many social 
scientists, decidedly does mot stem from their ignorance of what they 
criticize. In particular, they are precisely correct when they state: “Once 
such a tendency [for wealthy families to favor sons over daughters and 
poor families to favor daughters over sons] has evolved, its influence on 
parental investment should persist even in evolutionary environments in 
which a Trivers-Willard effect does not contribute to greater fertility (e.g., 
in contemporary American society and others in which social status and 
number of offspring are not positively related).” In other words, Freese 
and Powell explicitly recognize that evolved psychological mechanisms 
(such as the one that produces the Trivers-Willard effect) need only be 
adaptive in the EEA, and not necessarily in the current environment. 
Third, they put evolutionary psychological hypotheses to rigorous em- 
pirical tests. They recognize that the merit of a scientific perspective does 
not rest on its ideological and political implications, but rather on its 
ability to explain empirical data. 

Despite their extreme seriousness, competence, and rigor, however, 
Freese and Powell find absolutely no empirical support for the Trivers- 
Willard hypothesis in their analysis of two large nationally representative 
data sets (the National Educational Longitudinal Study of 1988 and the 
High School and Beyond Study of 1980). They use a large number of 
measures of parental investment and predict, following Trivers and Wil- 
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lard (1973), that the interaction term of class (measured either by income 
or parent’s education) and the child’s sex should have a significant effect 
on the level of parental investment into the child. Only one of the 60 
interaction effects presented in their tables 3-5 is statistically significant 
in the predicted direction (far worse than expected by chance); nine are 
statistically significant in the opposite direction. From the overall pattern 
of the results, Freese and Powell conclude that there is no empirical 
support for the Trivers-Willard hypothesis. 

Why do they fail to find any empirical support for the hypothesis? 1 
believe their null findings are the result of one small error they commit, 
in an uncharacteristic moment of weakness when they fail to heed their 
own admonition to fellow sociologists. I believe that their problem lies in 
their choice of the dependent variables. Seven of the 12 dependent mea- 
sures of parental investment in their analysis of the National Educational 
Longitudinal Study and four of the six dependent measures in their anal- 
ysis of the High School and Beyond Study refer specifically to parental 
investment into the children’s education. These include measures of par- 
ents’ economic investment into the children’s education (whether the par- 
ents have begun saving for their children’s college, how much money they 
have saved for college, whether the children go to private school, and the 
number of educational items in the household, such as encyclopedia and 
atlas) and measures of parents’ involvement in children’s education (the 
frequency of conversation about school, the parents’ involvement with 
the children’s school, and the parents’ involvement in the parent-teacher 
organization). 

In the current environment, higher education, and the subsequent ca- 
reer opportunities that it affords, are equally important for sons and 
daughters. However, it is highly unlikely that our evolved psychological 
mechanisms, adapted to the EEA, would consider higher education as 
investment into sons and daughters alike. What education does is to in- 
crease the child’s future earnings and status. These are factors that in- 
crease only the sons’ reproductive success, not the daughters’. Our psy- 
chological mechanisms, adapted to the EEA, would consider (albeit 
completely unconsciously) investment into children’s education as paren- 
tal investment only for sons, not for daughters. I believe that the slight 
problem in Freese and Powell’s data analysis is that most of their measures 
of parental investment are specific to male offspring, and this minor error 
may have led to their null findings.’ Freese and Powell warn fellow so- 


3 Another potential problem is the sample selection bias. Their table 1 indicates that 
the mean family income in 1987 among the respondents m the National Educational 
Longitudinal Study is $41,600. This seems inordinately high. The mean family income 
in the same year for respondents in the National Survey of Family and Households 
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ciologists to beware of the differences in the EEA and the current en- 
vironments, yet fail to heed their own advice in their selection of the 
dependent variable. 

Among their measures of the dependent variable, only one (monitoring 
of child’s behavior) is specific to female offspring. One of the things par- 
ents can do to increase the reproductive success of their daughters is to 
limit or eliminate the sexual access men have to their daughters and 
thereby to protect their daughters’ sexual reputation. Parental monitoring 
of the children’s behavior achieves this goal. (Once again, this truly con- 
tributes to the daughters’ reproductive success only in the EEA.) It is 
interesting to note in this regard that this dependent measure consistently 
behaves differently from all the other measures in Freese and Powell’s 
analysis, so much so that it leads the authors to question whether this is 
indeed a measure of parental investment (Freese and Powell 1999, p. 
1728). I believe that this measure behaves differently from all the others 
because it is a female-specific measure of parental investment, while most 
of the others are male-specific ones. Freese and Powell (1999, p. 1729 n. 
21) also state that poor parents monitor their daughters more closely 
(thereby contributing to their daughters’ reproductive success) than 
wealthy parents. This finding is also consistent with my interpretation. 

I therefore regard Freese and Powell’s otherwise competent analysis to 
be inconclusive with regard to the empirical status of the Trivers-Willard 
hypothesis in the contemporary United States. I will use another large, 
nationally representative data set to subject the hypothesis to an empirical 
test once again. I will take particular care to select sex-neutral measures 
of parental investment in my analysis. 


ANOTHER EMPIRICAL TEST OF THE TRIVERS-WILLARD 
HYPOTHESIS 


Data 


I use the first wave of the National Survey of Families and Household 
as my data. A nationally representative sample of 13,007 respondents was 
surveyed in 1987 and 1988, through a combination of personal interviews 
and self-administered questionnaires. My sample includes all respondents 
who have at least one biological child under age 18 in the household 
(N = 6,034). 


is $29,100. Freese and Powell may therefore not have a sufficient variation in class 
positions in one of their data sets. 
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Dependent Variable: Parental Investment 


In one part of the self-administered questionnaire, respondents who had 
at least one biological child in the household were asked a series of ques- 
tions regarding their activities with their children. Respondents who had 
children between the ages of 5 and 18 were asked how frequently they 
engaged in the following activities with their children: (a) in leisure ac- 
tivities away from home (picnics, movies, sports, etc.); (b) at home working 
on a project or playing together; (c) having private talks; (d) helping with 
reading or homework. Responses to each of these questions varied from 
O (never or rarely) to 6 (almost every day). In my analysis of parents with 
children ages 5-18, I use the sum of their responses to these four questions. 
The dependent measure for these analyses therefore varies from 0 to 24. 

Respondents who had children between the ages of O and 4 (but no 
children between the ages of 5 and 18) were asked how frequently they 
engaged in the following activities with their children: (a) an outing away 
form home (at parks, museums, zoos, etc.); (b) at home playing together; 
(c) reading to the child. Responses to each of these questions once again 
varied from 0 (never or rarely) to 6 (almost every day), and thus the sum 
of the three items varied from 0 to 18. Because the two mutually exclusive 
categories of parents (those with only children under 5, and those who 
have at least one child over 5) were asked different numbers of questions 
regarding their activities with their children, these two categories of cases 
cannot easily be combined. I therefore standardized both dependent var- 
iables before merging the two sets of parents and used the standardized 
score as the dependent variable in my analysis of all parents (those with 
only children under 5 and those with at least one child over 5). 

Note that none of the component questions in my measure of parental 
investment refer specifically to higher education. Nor are any of them 
sex-specific in any way. Both boys and girls benefit from engaging in these 
activities with their parents. I therefore believe that, unlike Freese and 
Powell's measures of parental investment, mine are sex-neutral. Of course, 
my claim is decidedly sot that parents consciously calculate that taking 
their sons or daughters to the zoo more often will increase their repro- 
ductive success by turning their children into more desirable mates in the 
future. As I stated earlier, conscious thinking is usually not involved in 
the operation of psychological mechanisms. My claim instead is that 
wealthy parents unconsciously value their sons more than daughters and 
poor parents unconsciously value their daughters more than sons, and 
the extent to which they value their sons or daughters will manifest itself 
as their desire to be with and take care of their children. The activities 
selected above are just some of the (sex-neutral) ways in which the parents’ 
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desire to take care of their children, produced by how much they value 
them, manifests itself. 


Independent Variables 


For each parent, I measure whether they have at least one girl between 
the ages of 10 and 15 and whether they have at least one boy between 
the ages of 10 and 15 (for both, 1 = yes). I follow Freese and Powell in 
focusing on sons and daughters at the onset of puberty. In the EEA, 
individuals began their competition for mates when they reached puberty, 
and this was therefore the crucial time for parents to invest in their 
children in order to maximize their reproductive success. There is evidence 
that parents value children at the onset of puberty more than at any other 
age (Crawford, Salter, and Jang 1989; Littlefield and Rushton 1986). 

In addition to these crucial predictor variables, I include controls for 
sex (male = 1), age, race (black = 1), education, and occupational prestige. 
Evolutionary psychology predicts that mothers will make greater parental 
investment than fathers for a couple of reasons. First, women have a 
much lower fitness ceiling than men do; men can potentially produce a 
far greater number of children in their lifetime than women can. Thus, 
even though reproductive success is equally important for men and 
women, each child is more valuable to a woman than it is to a man 
because it represents a greater share of a woman’s lifetime reproductive 
potential than a man’s. Thus we expect mothers to value each child more 
than fathers do. Second, mothers are certain of their maternity, whereas 
fathers cannot be certain of their paternity. For both of these reasons, we 
expect mothers to make greater parental investment than fathers, and 
this prediction is consistently supported throughout my data analysis be- 
low. However, this prediction is not unique to evolutionary psychology 
(like the Trivers-Willard hypothesis is), so it cannot shed light on the 
empirical merit of evolutionary psychology relative to other theoretical 
perspectives. I will therefore not concentrate on the effect of sex in this 
comment. 


Social Class 

I divide my sample into rough quartiles according to their annual family 
income: (1) those with less than $10,000 (N = 934); (2) those between 
$10,000 and $25,000 (N = 1,323); (3) those between $25,000 and $40,000. 
(N = 1,143); and (4) those with more than $40,000 (N = 1,442), with 
1,192 cases with missing values on family income. In my analysis, I com- 
pare the top and the bottom quartiles to maximize the contrast of class 
positions. 
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TABLE 1 
PARENTAL INVESTMENT, ALL CHILDREN AGES 0-18 


CLASS 
Bottom Quartile Top Quartile 
Beta Beta 
Predictors: 
Girls 10-15 ............... 1217* —.0525 
Boys 10-15 ...............- —.0136 0526 
Control variables: 
Sex (male = 1)... ..... —.2090"* —.1343 
a do —.0652 15687 
Race (black = 1) ...... —.0363 .0347 
Education ................ 2273" .0845* 
Occupational prestige .. .0701 .0300 
Rese EAA EEE 1193 0641 
WN E Adie ele eekes 348 1,161 
“P< 0 
= P< 01 
=e P< 001 
mt P < 0001 
Results 


Table 1 presents the results of regression analyses of parents with children 
of all ages (0-18). Because the unit of the dependent variable has no 
intrinsic meaning (expressed as standard scores), I present only the stan- 
dardized coefficients in my tables. In the left column (bottom quartile), it 
shows that having pubescent girls significantly (P < .05) increases parents’ 
investment into the children, while having pubescent boys seems to de- 
crease it (albeit statistically nonsignificantly). This is consistent with the 
Trivers-Willard hypothesis that parents in poor families invest more in 
their daughters than in their sons. In the right column (top quartile), it 
shows that having pubescent girls significantly (P < .10) decreases parental 
investment while having pubescent boys equally significantly (P < .10) 
increases it. Once again, this is perfectly consistent with the Trivers-Wil- 
lard hypothesis that wealthy families favor sons over daughters. 

The empirical pattern is essentially the same in my analysis of a subset 
of the parents with at least one child between the ages of 5 and 18, 
presented in table 2. Even though the dependent variable here is not a 
standard score, I continue to present the standardized coefficients to fa- 
cilitate comparison across tables. In the left column (bottom quartile), 
having pubescent girls significantly (P < .10) increases parents’ investment 
into the children, while having pubescent boys decreases it (albeit statis- 
tically nonsignificantly). In the right column (top quartile), having pu- 
bescent boys statistically significantly (P < .05) increases parental invest- 
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TABLE 2 
PARENTAL INVESTMENT, CHILDREN AGES 5-18 


CLASS 
Bottom Quartile Top Quartile 
Beta Beta 
Predictors: 
Girls 10-15 . . ......... .1147 —.0389 
Boys 10-15 .. —.0614 0752* 
Control variables: 
Sex (male = 1) ......... —.1512* —.1077™* 
A 2 cress ndleces —.2600""* —.2995 "otr 
Race (black = 1). ...... — 0506 —.0412 
Education ... ....... 1800* 1533058» 
Occupational prestige .. 0853 0073 
its is 1640 1261 
Ni 209 760 
*P< 05 
=æ P< 01 
et P< 001 
owt P< 0001 


ment, while having pubescent girls decreases it (albeit statistically 
nonsignificantly). Once again, the overall pattern is perfectly consistent 
with the Trivers-Willard hypothesis. Parents in poor families favor daugh- 
ters over sons, while those in wealthy families favor sons over daughters. 

Figure 1 compares the mean levels of parental investment between the 
bottom and top quartiles. The top panel (fig. 1a) compares the means of 
the standard scores among all parents. The dotted line is for respondents 
with pubescent sons, and the straight line is for respondents with pubes- 
cent daughters. (Compare my fig. 1 to Freese and Powell’s [1999] fig. 1.) 
The figure shows that, among the respondents in the bottom quartile 
(with the family income less than $10,000 in 1987), those with pubescent 
daughters make greater parental investment than those with pubescent 
sons (0.1450 vs. 0.0742). Among respondents in the top quartile (with 
family income more than $40,000 in 1987), those with pubescent sons 
make greater parental investment than those with pubescent daughters 
(0.1009 vs. 0.0161). 

The bottom panel (fig. 1b) compares the means of the sum of parental 
investment across the four activities for parents with at least one child 
between the ages of 5 and 18. The Trivers-Willard pattern is again evident 
here. Among the respondents in the bottom quartile, those with pubescent 
daughters make greater parental investment than those with pubescent 
sons (17.1784 vs. 16.8268). Among the respondents in the top quartile, 
those with pubescent sons make greater parental investment than those 
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Bottom quartile Top quartile 
a) For all children ages 0-18 


Pubescent sons 
—_—— — — 0169517 





Bonom quartile Top quartile 
b) For children ages 5-18 


Fig. 1.—Mean levels of parental investment into sons and daughters, by class 
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with pubescent daughters (16.9517 vs. 16.5286). Both figures exhibit the 
pattern presented in Freese and Powell’s figure 1 (1999, p. 1710), depicting 
the theoretical prediction of the Trivers-Willard hypothesis. 


DISCUSSION 


The analysis of the National Survey of Families and Households provides 
support for the Trivers-Willard hypothesis. However, the observed effect 
is quite weak (often at P < .10). If Trivers-Willard hypothesis is true, why 
does the hypothesized psychological mechanism not produce a stronger 
preference for sons among wealthy parents and one for daughters among 
poor families? 

I believe this is because the mecessity to choose one sex over the other, 
an important scope condition for the Trivers-Willard hypothesis, is not 
urgent in the current environment.‘ Relative to our ancestors in the EEA, 
contemporary Americans, even those in poverty, are so materially rich, 
and have so few children, that the necessity to choose sons or daughters 
in whom to invest limited resources is often absent. We can sometimes 
invest maximally in both sons and daughters simultaneously, to insure 
the reproductive success of both. Although the psychological mechanism 
behind the Trivers-Willard hypothesis, being part of our innate human 
nature, is still there, and would operate when the circumstances require 
it, the contemporary United States, one of the wealthiest societies in the 
human evolutionary history, might not provide the required circum- 
stances. I would therefore expect to find a stronger effect of class on 
parents’ preference for one sex over the other in societies where resources 
are more scarce and fertility is higher. 

In the final analysis, Freese and Powell are right about one thing. The 
ultimate fate of a theoretical perspective should and does rest on its ability 
to explain empirical phenomena, rather than its political and ideological 
implications. The only way to evaluate a theoretical perspective is there- 
fore to continue to subject its theories to rigorous empirical tests and 
compare their performance to that of other theoretical perspectives. This 
comment is therefore “just another brick in the wall” (Buss 1997), building 
the empirical foundation of evolutionary psychology and its relevance to 
traditional sociological questions. 


SATOSHI KANAZAWA 
Indiana University af Pennsylvania 


11 owe this insight to Rosemary L. Hopcroft. 
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MAKING LOVE OUT OF NOTHING AT ALL? NULL 
FINDINGS AND THE TRIVERS-WILLARD HYPOTHESIS’ 


Kanazawa (2000, 2001; Kanazawa and Still 1999) has established himself 
as one of the leading enthusiasts of evolutionary psychology in sociology. 
In the preceding comment to our article, he succinctly introduces some 
of the basic principles of evolutionary psychology to a sociological au- 
dience that may be largely unfamiliar with this new program. His intro- 
duction also conveys the excitement that many evolutionary psychologists 
feel toward the enterprise. We too are interested in the application of 
evolutionary reasoning to sociological problems, but we are much more 
agnostic on the question of how valuable Darwinian approaches will 
ultimately prove for sociology. Kanazawa and we agree, however, that 
scholars should try to move these debates to empirical grounds whenever 
possible. 

In this spirit, our original study sought to test the application of a 
longstanding sociobiological hypothesis, the Trivers-Willard hypothesis 
(hereafter TWH), to parental investment as it has been commonly con- 
sidered by sociologists (Freese and Powell 1999). In doing so, we were 
pursuing various recommendations that sociologists should do more to 
incorporate Darwinian insights into their work (e.g., van den Berghe 1990; 
Ellis 1996) and following Trivers and Willard’s (1973) own conjecture 
that their hypothesis was applicable to contemporary American society. 
Using two national data sets and a broad range of sociological measures 
of parental investment, we found no evidence for TWH for investment 
in adolescents in contemporary American society.” Kanazawa, however, 


1 We thank Satoshi Kanazawa for generously making available the data extraction 
files from his analysis of the National Survey of Families and Households. We also 
thank Simon Cheng for his assistance. The research was partially supported by NIMH 
grant (PHS-T32 MH14588) for Freese and NSF grants (SBR-9810435 and SBR- 
9912267) for Powell. Direct correspondence to Jeremy Freese, Department of Sociology, 
1180 Observatory Drive, University of Wisconsin, Madison, Wisconsin 53706. E-mail: 
jfreese@ssc.wisc.edu 

1 While these results fail to support the TWH, we disagree with the characterization 
of them as “throwing into question the relevance of evolutionary psychology for tra- 
ditional sociological problems” (p. XX). Adjudicating this matter requires much more 
empirical and theoretical consideration than any single test of a single hypothesis can 
provide. In addition, we are reticent about referring to the TWH as an evolutionary 
psychological hypothesis, because it predates the specific program of evolutionary 
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argues that our original study is compromised by an erroneous selection 
of dependent variables. Using a different data set and a different measure 
of parental investment, Kanazawa finds results that appear to support 
the hypothesis, which he says provide “just another brick in the wall” of 
the empirical foundation of evolutionary psychology (p. 1774). 

While we applaud the effort to provide further testing of the hypothesis, 
we warn against slapping on the mortar just yet. Our reply first addresses 
Kanazawa's criticism of our choice of dependent measures. Then, we 
question Kanazawa's own selection of the sole dependent variable he uses 
in his analysis. Finally, we take another look at Kanazawa’s analysis and 
dispute that the data provide any persuasive support for the hypothesis. 


DOES KANAZAWA’S CRITICISM COMPROMISE OUR STUDY? 


Kanazawa’s criticism of our study is confined to one point: our choice of 
dependent variables related to education in testing the TWH.* The con- 
tested variables include measures directly related to higher education, like 
savings for college, as well as measures like how much parents talk to 
their child about school. Together, they comprise seven of the twelve 
variables that we examine in the National Educational Longitudinal 
Study (NELS), and four out of the six from High School and Beyond 
(HSB). We dispute his criticism below, but important to note first is that 
even if the criticism were correct, it still leaves seven dependent variables 
across the two large, national data sets in our study. Tests of these variables 
using two different measures of status yielded only one significant effect 
in the predicted direction—no more than what one would expect by 
chance—but three significant effects in the opposite direction. Consid- 
erable sensitivity analyses, reported in the text and footnotes of the original 
article, failed to yield any stronger evidence for the hypothesis. Thus, one 
could grant Kanazawa’s criticism of these variables entirely but still see 


psychology and because many of those who have done work on the hypothesis are 
associated more with sociobiology and Darwinian anthropology than with evolutionary 
psychology (see Cronk 1991; see Segerstrále [2000] for discussion of some of these 
distinctions among different programs). 

*In a footnote, Kanazawa offers another possible explanation for our null findings. 
Kanazawa suggests that our results may be compromised by “sample selection bias” 
because the mean reported family income of our estimation sample using NELS is 
$41,600, which he considers “inordinately high” (p. XX). In comparison, he notes that 
the mean family income of all NSFH respondents is $29,100. The comparison is not 
valid: instead, to be (roughly) comparable to NELS, the NSFH sample must be re- 
stricted only to those respondents who have an adolescent child. The mean family 
income for these NSFH respondents is $34,600, and, when we adjust by the sample 
weights, this mean increases to $43,456. 
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our results weighing against the application of the TWH in contemporary 
society. 

Nonetheless, we are skeptical of his criticism. Kanazawa objects to our 
using education variables because educational investment yields earnings 
and status, which he contends were only important for the reproductive 
success of males in our hunter-gatherer past. Consequently, he proposes 
that our evolved psychological mechanisms “would consider (albeit com- 
pletely unconsciously) investment into children’s education as parental 
investment only for sons, not for daughters” (p. 1767). In effect, this crit- 
icism asserts that the TWH is not directly relevant for “parental invest- 
ment” as the concept primarily has been used by sociologists, who have 
emphasized its connection to socioeconomic outcomes (see the references 
we provide for the measures in our original study). Instead, as his title 
suggests, Kanazawa’s proposal restricts the scope of contemporary ap- 
plicability of the TWH to measures that reflect a parental “desire to be 
with and take care of their children” (p. 1769), which he suggests is psy- 
chologically independent from what motivates parents to invest in 
education. 

If recognized as a post hoc conjecture about how the TWH might still 
be important for understanding contemporary American parental behav- 
ior despite our null findings, we certainly agree the proposal is worth 
testing. We object, however, to Kanazawa’s claim that the failure of the 
education measure is what proper reasoning from an sociobiological/ev- 
olutionary psychological perspective would have predicted all along, and 
thus our including these measures in our test was a mistake (a “moment 
of weakness” [p. 1767). We object because we think that plainly, had we 
found patterns consistent with the hypothesis for these measures, this 
would have been taken as support for TWH. Evolutionary psychological 
arguments supporting this proposal are easily devised: investments in 
daughter’s capacity to gain resources and status could be interpreted as 
facilitating her acquisition of the best possible mate or promoting the 
welfare of her offspring. Or, because modern higher education has ab- 
solutely no analogue in the Pleistocene, investment in it could have been 
taken as governed by the same unconscious mechanisms of parental love 
as other behaviors, rather than being governed by a separate mechanism 
dedicated to status-linked investment. Or, even if educational investments 
were just partly governed by a desire to “take care” of one’s children, 
this could still be interpreted as implying the observation of the interaction 
effect that is predicted by the TWH, in addition to the main effect of a 
general sex-specific bias. 

Our position here is strengthened by Kanazawa’s treatment of the one 
variable in our study for which a significant effect in the predicted di- 
rection was observed: parental monitoring as measured in NELS. Ka- 
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hazawa asserts that this measure is evolutionarily biased toward daugh- 
ters. Rather than dismiss it as a “sex-specific” measure, however, he states 
that it too is consistent with his evolutionary interpretation because we 
find that “poor parents monitor their daughters more closely (thereby 
contributing to their daughters reproductive success) than wealthy par- 
ents” (p. 1768). Even if we set aside that the observed pattern was not 
robust,* and that Kanazawa’s description of it is not actually correct,’ 
one could still protest that it seems as though when an effect for a sex- 
biased measure is consistent with the TWH, it is interpreted as supportive, 
but when an effect for a sex-specific measure is inconsistent with the 
hypothesis, it is dismissed as irrelevant. 

Finally, Kanazawa’s criticism implies predictions of its own regarding 
parental investment in education that are not supported by the data. 
According to his argument, instead of testing whether a Trivers-Willard 
effect exists for measures related to education, we should have taken an 
evolutionary psychological perspective to predict that these measures 
should be strongly biased in favor of sons (as they are “investment only 
for sons, not for daughters” [p. 1767]. The analyses from our original 
study bear directly on this prediction. If we look at the effect of child’s 
sex for the 11 measures of education (the main effects models of tables 3 
and 5; see Freese and Powell 1999, pp. 1726-1727, 1734), three effects 
were significant in predicted direction (bias toward sons), and three were 
significant in the opposite direction (bias toward daughters). Consequently, 
the results do not provide any more systematic support for Kanarawa’s 
evolutionary predictions in this sample than they do for the Trivers-Wil- 
lard hypothesis. 


DOES KANAZAWA OFFER A BETTER DEPENDENT MEASURE? 


Kanazawa presents results from the National Survey of Families and 
Households (NSFH) that are asserted to support the TWH. Although 
below we provide further analyses of these data that cause us to question 
the robustness of this finding, we have more fundamental misgivings 
about the dependent variable used in his analysis. Most important, Ka- 
nazawa’s lone criticism of our study is that we use education-related items 
to test the TWH, but the scale that he uses as his dependent measure 


‘The result was not robust across alternative model specifications, and the pattern 
also was not observed for our monitoring measure from HSB. 

* Our results did not indicate that poor parents monitor their daughters more than 
wealthy parents. Instead, wealthy parents monitored their daughters more than poor 
parents did, but the disparity is less than the difference in how much more wealthy 
parents monitor their sons. 
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includes the amount of time parents spend “helping with reading or home- 
work” (p. 1769). We do not understand why this item is acceptably “sex- 
neutral,” while seemingly similar items from our study are dismissed as 
“male-specific.” In justifying why his measure is “sex-neutral,” he reports 
that “none of the component questions in my measure of parental in- 
vestment refers specifically to higher education” (p. 1769). But, if by this 
he means that the sticking point is whether an item specifically concerns 
college, then several more measures from our original study should not 
have been challenged—only two of our measures from NELS and two 
from HSB refer specifically to college. 

In other words, if the homework item is acceptable, then his critique 
of our article would seem substantially weakened, as then 14 of our 18 
dependent variables would not be questioned. If the homework item is 
not acceptable, then it compromises the only dependent variable that 
Kanazawa’s analysis examines. We should also point out that the cor- 
relation of the homework item with the other items in the scale is as high 
as the correlation of these items with one another. This would suggest 
that help with homework does not represent some different dimension of 
parental investment from the other activities that included in the scale, 
but Kanazawa’s entire critique is predicated on just such a bifurcation. 

In addition, Kanazawa’s analysis has the ecological inference problem 
of attempting to test a hypothesis about investment in ax individual child 
with a dependent variable that asks parents how much time they spend 
interacting with their children in various ways. His analyses examine how 
this measure is affected by the presence of daughters and sons ages 10-15 
in the household, but the measure does not distinguish time spent with 
these children from time spent with other children in the same household. 
Consequently, the measure bears only indirectly on the specific sex-by- 
child effect predicted by TWH. We should point out that, following other 
sociological work on parental investment, our study did examine two 
household-level measures of investment (number of educational items in 
the home and involvement in a parent-teacher organization). Unlike Ka- 
nazawa’s study, however, we look at many other measures of investment, 
and our conclusions thus do not rest exclusively on a household-level 
measure. 


DOES KANAZAWA’S ANALYSIS SUPPORT THE TRIVERS-WILLARD 
HYPOTHESIS? 


Even if we are to accept Kanazawa's dependent variable, we are not 
persuaded by the results of his analysis. For his regression analysis, Ka- 
nazawa does not use the modeling strategy that we used in our original 
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study, in which the significance tests of the hypothesis were provided by 
interaction terms. Instead, his regression analyses compute separate mod- 
els for respondents in the lowest and highest quartiles, with dichotomous 
independent variables indicating the presence of daughters and sons ages 
10-15 in the household. In the regression of his investment measure on 
the low-income subsample, the coefficient for daughters is positive and 
that for sons is negative; while in the high-income subsample, the signs 
of these coefficients are reversed. This provides impressionistic evidence 
for the hypothesis, but the actual predictions of the hypothesis are about 
the relationship among these coefficients, mot the relationship of any or 
all of these coefficients from zero. Importantly, the pattern of coefficients 
could be as Kanazawa observes and the comparison could not be signif- 
icant, alternatively, the coefficients could be all negative or all positive 
and yet a significant relationship could exist among them that is consistent 
with the hypothesis. The reason again is that the TWH concerns the 
interaction effect of child’s sex and parental status; testing only whether 
the four coefficients differ from zero confounds the test of the interaction 
with any main effects of sex (or, for that matter, child’s age) on parental 
investment. 

Instead, under Kanazawa’s specification, if one subtracts the coeffi- 
cients from the regression on the highest quartile sample from the coef- 
ficients from the lowest quartile sample, the TWH predicts that the dif- 
ference for the variable “daughter age 10-15” will be greater (more 
positive) than the difference for the variable “son age 10-15.” To test the 
hypothesis, we respecified the two regressions as a single model with 
interactions of each independent variable by status group and then con- 
ducted a Wald test of the null hypothesis Bou — Barnau = Bro,ssom — 
Burson. In reconstructing Kanazawa’s analysis, we discovered a few mi- 
nor errors in coding, which affect the sample size. Our table 1 presents 
the corrected estimates of the key effects from Kanazawa’s original study 
for the analysis of children 0-18.’ The Wald test is significant at the 
P < .01 level. 

Thus, although Kanazawa does not explicitly provide a significance test 


* Following Kanazawa and our earlier study, we use two-tailed tests, although of course 
those who prefer one-tailed tests for directional hypotheses may divide the P-values 
by two. We are also more strongly inclined toward two-tailed tests here because in 
some of our ancillary analyses on these data the results were not consistently in the 
predicted direction. 

” Because of space limitations, we present results only for households with any children 
ages 0-18 here, rather than also including analyses of households only with children 
ages 5-18. This effectively provides a more líberal examination of the robustness of 
the findings, as in Kanazawa's original analysis it was this larger subsample that had 
the seemingly stronger results. 
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of the TWH, the results of his baseline model do appear to provide sta- 
tistically significant support for the hypothesis. At the same time, it might 
seem odd that the sample size for the top quartile in Kanazawa’s analysis 
is more than three times larger than the bottom quartile. This is mainly 
due to the listwise deletion of cases with missing values for occupational 
prestige. (We believe that including this control is difficult to theoretically 
justify anyway and that it should have been excluded.*) We expected that 
recovering these deleted observations would strengthen the statistical sig- 
nificance of the pattern of regression results, as increasing the sample size 
typically does. Instead, the second column of table 1 reveals that when 
the control for occupational prestige is excluded, the P-value of the test 
actually increases to where it is only significant at the P <.10 level. 
This is already not a strong finding, but we might be more persuaded 
by it if alternative specifications yielded at least similarly marginally sig- 
nificant results. We believe that examining the robustness of results is 
necessary to determine if a finding rests on analytic decisions that are not 
directly germane to the hypothesis. The remaining columns of table 1 
present the results of various sensitivity analyses that we conducted. Each 
represents an alternative way that the analysis could have defensibly been 
done. What if parental education had been used to measure status instead 
of household income, as we had also tested in our study? What if the 
sample would have been divided into income sextiles instead of quartiles? 
(If the argument is that Trivers-Willard effects are only observed at the 
extremes, then using sextiles should result in larger differences than quar- 
tiles). What if we had run the analyses separately for mothers and fathers? 
What if the key age range examined had been children 13-16 years 
old—corresponding to the typical age of the eighth and tenth graders in 
our NELS and HSB samples—instead of 10-15 years?” What if respon- 
dents who had any coresident daughters ages 0-18 had been compared 


* Put briefly, if one has already divided the sample into high and low status groups 
by family income, what is the purpose of holding occupational prestige constant within 
each status group, especially if education also is included as a control in the model? 
* The issue of child’s age is especially important, and we looked in more detail at how 
the results of the analysis changed depending on the age group that one considered. 
Although our original study focused on adolescents, the Trivers-Willard hypothesis is 
not specifically a theory of adolescent parental investment, but instead is not age- 
specific (see Anderson and Crawford 1993) As we pointed out, studies by sociobiologists 
have looked at the TWH in the context of human behaviors toward offspring ranging 
from infants to adult heirs (Freese and Powell 1999, p. 1737) For example, in one 
model, we compared the age groups 0-4 years, 59, 10-15, and 16-18. When the 
relevant differences among coefficients were computed, two age groups yielded results 
in the predicted direction, and two age groups yielded results in the opposite direction, 
none of which were significant. Alternative groupings by age yielded smularly hap- 
hazard results 
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to respondents who had any sons? In none of these cases do the results 
provide significant support for the hypothesis. Also, note that in none of 
the alternative specifications do the four coefficients differ from zero ac- 
cording to the pattern that was taken as providing support in Kanazawa’s 
comment. We must make plain that this was not a matter of our “fishing 
around” for alternative specifications that did not work, but rather we 
conducted some of the more obvious sensitivity findings to see if the 
finding is robust. 

The fragility of the results becomes even clearer when we re-examine 
the graph of mean differences that follows Kanazawa’s regression analysis 
(his fig. 1). The crossing pattern of lines may again provide impressionistic 
support, but no actual significance test is conducted. Our own calculations 
indicate that the relationship presented in this graph is at best marginally 
significant (P = .127; P = .122 for the other panel in Kanazawa’s fig. 
1),° but even this weak relationship is exaggerated by connecting the lines 
between the highest and lowest quartile means, which may suggest a 
linear relationship between income and investment. Panel a of our figure 
1 reproduces Kanazawa’s graph for households with children ages 5-18, 
while panel b provides a more fine-grained look at the relationship by 
dividing the data into deciles and calculating the means of the investment 
measure within each decile. This graph shows that the relationship be- 
tween income, child’s sex, and the dependent variable is far from straight- 
forward. Yet this still exaggerates the extent to which the data conforms 
to the predicted pattern of the TWH, because women are disproportion- 
ately represented among lower-income respondents (86% of the lowest 
quartile respondents are female, compared to only 59% of the highest 
quartile respondents) and because fathers and mothers both, on average, 
report higher levels of parent-child activities when they have a same-sex 
child ages 10-15. Panels c and d present separate graphs for mothers and 
fathers, which even impressionistically do not lend support for the hy- 
pothesis. Conducting the same test of conditional means as before but 
taking mothers and fathers separately, we also find that in neither case 
do the differences between lowest and highest quartiles provide significant 
support for the hypothesis (P = .51 for mothers; P = .49 for fathers). 
This further corroborates our concern about the robustness of the claimed 
support for the TWH. 

Kanazawa acknowledges that the results reported in his article are 
“quite weak” (p. 1774), but our sensitivity analyses suggest that they are 
so weak that their statistical significance is highly contingent on the re- 


12 We tested the difference in conditional means by specifying a regression model with 
terms for the interactions of child’s sex and status (and no controls) and then testing 
the null hypothesis that Bropav y Burpav == Broson E Br sow: 
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searcher making a particular set of analytic decisions that are not directly 
related to the hypothesis (¡.e., what age range of respondents to consider, 
how to divide sample into upper- and lower-status groups). Consequently, 
if one regards statistical significance and the robustness of results as being 
important desiderata for hypothesis testing, then we question whether the 
results should be regarded as supporting the hypothesis. 


CONCLUSION 


Our original study attempted to apply the TWH to the sociological 
conceptualization of parental investment, but we found no support for 
the theory in our sample of contemporary American adolescents. Ka- 
nazawa challenges our measures as inappropriate and provides his own 
study, which he claims supports the hypothesis. Among other points in 
this reply, we argue that Kanazawa’s critique leaves several null findings 
from our original study unchallenged, implies its own evolutionary pre- 
dictions that are not supported by the data, and contradicts the con- 
struction of the dependent variable for his own study. We have also 
questioned the claim that it was a mistake for us to have tested the 
TWH for education measures in the first place by arguing that an ev- 
olutionary perspective would have easily accommodated supportive 
findings consistent with the TWH, had we found them. Additionally, 
we show that his results are weak and fragile at best and that they are 
statistically significant only within a very limited range of possible and 
largely arbitrary analyst’s decisions. Consequently, although we com- 
mend Kanazawa’s effort to subject the TWH to further consideration, 
his comment does not persuade us that the TWH is important for un- 
derstanding either why we invest in our children or why we love them, 
at least in the contemporary United States. 

What about other studies? Kanazawa is incorrect when he reports 
that the TWH has “been supported by a large number of empirical 
studies on societies across history and throughout the world” (p. 1766).” 
Of the two studies that Kanazawa cites regarding the contemporary 
United States, we raised concerns about one in our earlier article (Gaulin 
and Robbins 1991; see Freese and Powell 1999, p. 1713 n. 6), and the 
other is a study of U.S. presidents that actually finds no evidence for 


1 The “comprehensive review” by Cronk (1991) that Kanazawa cites is actually a 
discussion of cases of female-biased parental investment, not the TWH per se. Only 
13 selective groups are summarized (pp. 410-11), and of these only eight have patterns 
that are claimed to possibly fit the Trivers-Willard model. The only contemporary 
developed society among these eight is the United States (based on Gaulin and Robbins 
[1991). 
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the hypothesis in its more contemporary sample (Betzig and Weber 
1995). By the same token, recent analyses of other large data sets also 
do not provide support for the TWH among younger children. Using 
the Child Development Supplement to the Panel Study of Income Dy- 
namics, Keller, Nesse, and Hofferth (in press) find no support for the 
TWH in tests that include four dependent variables that would seem 
to reflect closely on parental love and that have nothing to do with 
education: two measures of the warmth of the parent-child relationship 
(parent report and interviewer assessment) and two measures of the 
amount of time parents spend with their child. We have also examined 
the applicability of the Trivers-Willard hypothesis to the investment 
measures in the recently available Early Childhood Longitudinal 
Study—some of which are related to education, and some not—and we 
also find no support for the hypothesis for these families of kindergarten- 
age children (Freese and Powell 2001). Taken together, these studies 
increase the number of null findings that weigh against the American 
application of the hypothesis. 

In trying to explain why weak effects might be observed in the con- 
temporary United States, Kanazawa suggests that the material wealth 
of the United States has reduced the “necessity to choose one sex or 
another,” which he identifies as “an important scope condition of the 
[TWH?” (p. 1774). We are skeptical of this specific conjecture, given that 
investment in potential offspring, especially in contemporary societies, 
is balanced against a host of other drains on adult resources (perhaps 
of greatest sociobiological interest, investment in other kin, status seek- 
ing, and efforts to secure additional mating opportunities). In any event, 
many of the tested measures concern uses of time, which contemporary 
parents do not obviously possess abundantly more of than their Pleis- 
tocene ancestors. 

Nevertheless, it is perfectly possible that the evolved psychological 
mechanisms suggested by TWH do exist but that the predicted behav- 
ioral patterns are not observed because of some aspect of the novelty 
of contemporary, developed societies. Before we put too much stock in 
this, however, we would like to see more systematic cross-cultural ev- 
idence that favors the TWH. As things stand now, we are concerned 
by the seeming presumption that mechanisms shaped by a Trivers-Wil- 
lard dynamic are “part of our innate human nature” (p. 14), and that 
null findings are merely elaborating the exceptional scope conditions of 
this default state of mind. Such a presumption needs to be closely 
grounded in evidence or else it only increases the impression that some 
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sociobiologists, not unlike some advocates of other theoretical programs, 
assume the truth of propositions that they should be more judiciously 
testing. 


JEREMY FREESE 
University of Wisconsin, Madison 

BRIAN POWELL 
Indiana University 
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On Sociology: Numbers, Narratives, and the Integration of Theory and 
Research.By John H. Goldthorpe. Oxford: Oxford University Press, 2000. 
Pp. viii+337. $65.00 (cloth) $24.95 (paper). 


Stephen Turner 
University of South Florida 


When the most distinguished empirical social researcher in Britain takes 
on the problem of the relation between theory and research, places the 
issues in their larger historical setting (based on wide and accurate reading 
in the historical literature), and also states the issues in current technical 
terms, and does so with both panache and bite, we get a book that is 
well worth reading. The question that Goldthorpe examines is this: Does 
the historical reality of conflict between theorists and researchers “reflect 
some inherent incompatibility between their concerns . . . or was their 
failure to come together a matter of various unfavorable circumstances, 
albeit of a long lasting kind?” (p. 26). Implicitly this is also an answer to 
an unstated prior question, namely, How is it possible to get around the 
negative expectations that rationally follow from the fact that all previous 
attempts to create a statistical scientific sociology have failed? 

What Goldthorpe thinks should have happened in the history of so- 
ciology is this: something along the lines of rational choice should have 
been married to something along the lines of Yule’s statistics so that 
hypotheses about mechanisms, meaning essentially mechanisms by which 
aggregate level results are produced by individual choices made in varying 
contexts, could be tested. This, Goldthorpe argues, is where the future of 
sociology lies, and the mystery is why it did not come sooner. Goldthorpe’s 
answer is unfavorable circumstances of a long-lasting kind, but the blame 
he places is intellectual and, as he notes, also ironic, for it falls in part 
on the 19th-century positivist conception of science and its lingering neg- 
ative effects on the development of probabilistic analysis. Durkheim, be- 
dazzled by Comtean ideas about science, failed to take advantage of Yule. 
Statistical sociologists frequently confused the establishment of statistical 
phenomenon with their explanation, thus producing “laws” and regular- 
ities that they imagined were scientific but which begged the question 
“why?” Theorists failed too, and Goldthorpe’s béte noire is the kind of 
theory that concerns itself with analyzing assumptions to the exclusion 
of considering empirical evidence, in the fashion of Jeff Alexander. 

Goldthorpe discusses these issues in the light of a major example: he 
examines research on the stubbornly continuing fact of class education 
differentials despite the changes in the cost structure, policy, and many 
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other things, and he works through a detailed discussion of the theoretical 
issues, showing how the data interact with and shape a rational choice 
analysis that pinpoints the choices and the contexts of choice that make 
for the continuing differentials. The force of Goldthorpe’s case comes 
down to whether one takes this example as he does, namely as a model 
for sociology and a case of successful application of theory to data and 
vice versa. He makes the argument in part in a chapter on ethnography, 
in which he challenges the claims of ethnographers to provide a meth- 
odological alternative that achieves something that large data set analysis 
cannot. He argues that given the general logic of testing of hypotheses, 
cases do not establish much at all, but he suggests that there might be a 
role for ethnography in testing hypotheses about “local causes,” that is, 
about mechanisms, operating in particular contexts and not others, that 
cannot be tested in large data sets. 

Does all of this convince? Certainly it is an advance on previous dis- 
cussions, in that Goldthorpe acknowledges some of the massive difficulties 
in generating causal conclusions from structural equation models and 
recognizes the sheer intellectual inadequacy of much of what passes as 
explanation in statistical sociology, though he could certainly have gone 
much further in this direction. It is also an advance in that it recognizes 
that claims that sociology is a “science” have gotten in the way of dealing 
with these and other methodological problems. But what Goldthorpe 
seems blind to is a recognition of the role of concepts and the study of 
concepts and their changes in social life and history. That humans form 
concepts, conventions, and practices, and that the changes in these change 
the form of human existence, and that understanding social life involves 
necessarily and perhaps largely an understanding of these things as they 
actually figure in human action, seems simply not to be a part of Gold- 
thorpe’s model of sociology. Calculation, or rational choice, occurs within 
understanding, but it cannot produce understandings, concepts, or prac- 
tices. This fact seems to be the source of an inherent incompatibility 
between the concerns of theorists and the concerns of quantifiers of pre- 
cisely the kind that he claims does not exist. And by the way, could the 
concept of rational choice itself have developed other than by analyzing 
assumptions to the exclusion of empirical evidence? 


The Social Construction of What? By Ian Hacking. Cambridge, Mass.: 
Harvard University Press, 1999. Pp. x+261. $16.95 (paper). 


Steven Shapin 
University of California, San Diego 


Consider two criticisms of the idea that scientific knowledge is socially 
constructed: the first is that the notion is misconceived or malevolently 
motivated; the second is that it is banal. We are these days more familiar 
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with the first criticism. Defenders of science say, for example, that scientific 
theories are real in the same sense as the rocks on the ground. If you 
think such theories are mere social constructions, then you should walk 
off tall buildings with insouciant disregard for what physicists have to 
say about your likely fate. 

The second criticism is both less silly and less frequently heard. When 
one says that “the pressure and the volume of a gas stand in inverse 
proportion,” this proposition, idea, and associated belief stands in a causal, 
but contingent, relationship to the state of affairs in nature to which it 
refers. It is a proposition in English, whose sense depends upon the stock 
of knowledge that you happen to have about the meanings attached to 
gases, pressure, and volume, and whose credibility is shaped by processes 
of socialization in cultural institutions. What else could it be but a social 
construct? The only really interesting task for the sociologist or historian 
is to give an account of the ways in which different sorts of ideas are 
socially constructed. 

That's a version of what the philosopher lan Hacking seems to think, 
and it is a pretty sensible thing to think. Whenever you hear someone 
say that X is socially constructed, first consider what kind of thing X is. 
Hacking judges that sociologists have been careless in specifying what is 
supposed to be socially constructed, and he means to sort out sensible 
from imprudent usage. The effect is rather like being a member of a class 
collectively summoned into the headmaster’s study. He is very wise, gen- 
erally benevolent, and good-humored; but just now he is rather disap- 
pointed in the class’s behavior. 

Should a sociologist say that Boyle’s law was wholly a social construct, 
and should she mean that the state of affairs in nature was socially con- 
structed, and that the law was not, therefore, a reliable generalization, 
she would be both mistaken and unjustified. The usverbalised natural 
reality to which Boyle’s law refers is a causal element in the idea expressed 
by PV = k, though Hacking agrees that it is circular to use the truth of 
an idea as an explanation of why people believe it. Should a sociologist, 
however, say that the federal funds rate is wholly a social construct, she 
might be accused of vacuity, since there is nothing else that such a thing 
could be. Reality and consequentiality are not to be set against socially 
constructed status: businesses succeed or fail according to what the Alan 
Greenspan and his colleagues say the interest rate is. 

Moreover, humankind terms—child abuse, for example—can actualize 
their objects, because people may be aware (as gases are not) of how they 
are classified, and behave accordingly. You must not say that the abused 
child is a social construct if by that you mean that children have not been 
hurt, but you may coherently say that the institutions mobilized around 
the idea of child abuse are, of course, social constructs, and you may even 
say that, by virtue of these institutionalized ideas and practices, the abused 
child becomes a hurt human of a specific sort: a real, socially constructed 
sort. 
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I have only two objections to this generally well-intentioned and in- 
telligent performance. First, social studies of science are here represented 
almost solely by Bruno Latour and Andrew Pickering. Hacking finds them 
worthy representatives because, like himself, they pitch their work on the 
terrain of metaphysics and epistemology, aiming to say what sorts of things 
the world contains and how we may know about them. But many other 
sociologists decline the role of metaphysician or epistemologist, and, by 
saying that science is a social construct, they refer mainly to the credibility 
of its ideas, beliefs, and propositions, both within and without the scientific 
community. Hacking happens not to be very interested in such projects, 
and that’s his right, but he happily concedes that the sociology and history 
of science done using the idea of social construction as a methodological 
maxim are far more rich, detailed, and philosophically resonant than what 
went before. 

Second, not all the sociologists that Hacking aims to sort out are quite 
as confused or as imprecise as he maintains. Disinterested readers might 
be surprised by comparing Hacking’s account of the differences between 
humankind and natural-kind terms with Barry Barnes’s important essay 
“Social Life as Bootstrapped Induction” (Sociology 17:524—45). But this, 
and much writing in a similar vein by Barnes and David Bloor, is not 
mentioned, and Hacking makes glib generalizations about their views 
that bear little relationship to their actual work. Headmaster might know 
his pupils a bit better. 


Ideologies of Breast Cancer: Feminist Perspectives. Edited by Laura 
K. Potts. New York: St. Martin’s Press, 2000. Pp. xiv+213. $49.95. 


Julie Hepworth 
Queen Margaret University College 


Laura K. Potts’s edited collection of research on the meanings of breast 
cancer includes authors from the United Kingdom, the United States, and 
Canada whose perspectives draw on literary criticism, sociology, psy- 
chology, and cultural studies among others. The research employs various 
methodological approaches—for example, media analysis (Saywell et al.), 
autobiographical narratives (Potts), and analysis of social activism (Fish- 
man)—to elucidate the multiple dimensions and diversity of breast cancer 
experiences. The first of two parts, “Meanings of Breast Cancer,” presents 
the problematical relationship between biomedicine and women’s con- 
structions of breast cancer knowledge, the sexualized and maternalized 
breast in the print media about breast cancer, environmental risks to 
women’s health in the Bay Area of San Francisco, and women’s narratives 
of breast cancer and situating the self. In part 2, “Discourses of Risk and 
Breast Cancer,” examination of the discourses of prevention and risks to 
health are taken up in relation to breast cancer screening, the problem 
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of prophylactic mastectomy for hereditary breast cancer, and environ- 
mental activism. 

“Ideology” and “feminism” are the two key conceptual frameworks that 
are revisited throughout the book to research women’s experiences. Potts 
defines these frameworks early in the book’s introduction, drawing on 
Althusser, that “‘lived’ experience is not a given, given by a pure ‘reality,’ 
but the spontaneous ‘lived experience’ of ideology in its particular rela- 
tionship to the real” (p. 2), and on Dorothy Smith’s work on a sociology 
for women which “preserves in its analytic procedures the presence of the 
subject as actor and experiencer” (p. 2). The articulation of the social and 
political structuring of experience is clearly accomplished throughout the 
book in the ideological and feminist analyses of the intersections of bio- 
medicine, social activism, popular culture, environmental awareness, and 
women making sense of disease and illness. 

The sexualized and maternalized breast in the print media analyzed 
by Saywell et al. in chapter 2 represents women’s narratives of breast 
cancer outside the popular media and also demonstrates the ways in which 
public narratives about breast cancer in the media situate women. For 
example, as in a quote from a newspaper, “Mother Turns Down Cancer 
Treatment to Save Unborn Baby,” women become depicted as saints and 
martyrs through “mother-centred stories” (p. 50). At times difficult to read 
due to the poignancy of women’s stories, chapters 2 and 4 tell us about 
the simultaneous dimensions of women’s realities of having a life-threat- 
ening disease yet bearing a new life, decision-making about chemoprev- 
ention, and the loss of femininity through chemotherapy and prophylactic 
surgery. 

Key concerns of social inquiry such as age, race, and class are partic- 
ularly highlighted in chapters 2, 5, and 1, respectively. In chapter 2, Say- 
well et al show how breast cancer reporting prefers youthful represen- 
tations of women. In contrast to the reality of epidemiological data in 
which the majority of women who get breast cancer are 50 years old and 
older, they argue that young women’s bodies dominate coverage of media 
reports on breast cancer. Simpson in chapter 5 argues that the results of 
research on the preventative effects of tamoxifen, a major pharmaceutical 
drug for breast cancer, only apply to white women because of the sample 
characteristics of trial participants. Of all participants, approximately only 
3% of women were African-American, Asian-American, Hispanic, or 
other, providing extremely limited findings for women from different eth- 
nicities. The shifting nature of class related to breast cancer is taken up 
in chapter 1 by Fosket, who emphasizes the financial expense of breast 
cancer. For example, leaving work and high costs of cancer treatments 
mean economic hardships for women and class movements within a short 
period of time. 

The book makes an important contribution to social and health research 
about women’s experiences of breast cancer through its exploration of 
the sheer breadth of discourses, controversial issues related to prevention 
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and risks to health, and how these coalesce in the construction of the 
meanings for individual women as victims/survivors of breast cancer. For 
example, Klawiter argues that she approaches the terrain of breast cancer 
activism using theorization at the “juncture of culture and social move- 
ments” (p. 64). Using participant observation and a concept of “cultures 
of action,” Klawiter examines the interplays between embodiment, sub- 
jugated knowledges, and culture within sociological inquiry in a reaffir- 
mation of the value of contested meanings to amplify voices that were 
hitherto silenced. 

This book takes on analyses of the dynamic interplay of discourses in 
the social construction of meanings about breast cancer, above all achiev- 
ing clear arguments about the complexity of the ways in which “knowl- 
edges” and “reality” are socially constructed and mediated through social 
relations. Moreover, the book is firmly situated within the broader context 
of feminist perspectives on health in which women’s experiences are the 
basis for research. Laura K. Potts has brought together feminist research- 
ers who expose a litany of issues and problems with which the health 
field has to grapple in order to develop its understanding of women’s 
experiences of the prevention and treatment of breast cancer. The breadth 
of the book also makes it a key reading for a wide audience—from those 
in sociology, nursing, women’s studies, psychology, and medical science, 
to various health professionals, particularly medical specialists, radiolo- 
gists, and diagnosticians. l 


Venus on Wheels: Two Decades of Dialogue on Disability, Biography, and 
Being Female in America. By Gelya Frank. Berkeley and Los Angeles: 
University of California Press, 2000. Pp. xv +284. $50.00 (cloth); $19.95 
(paper). 


Diane E. Taub 
Southern Illinois University at Carbondale 


The number of scholarly books and articles about women with physical 
disabilities has increased in the past several years. Most of these works, 
however, have been either commentaries or theoretical discussions about 
women with physical disabilities. Few efforts have focused on research 
that examines how the women themselves interpret their daily lives and 
define their social interactions. Versus on Wheels represents the type of 
approach that incorporates the voices of women with physical disabilities. 

Gelya Frank’s work is exceptional in several respects, notably in its 
rigorous and feminist methodology, historical view of disability and the 
disability rights movement, and empowering and liberating portrayal of 
Diane DeVries, a woman born without arms and legs. This book encom- 
passes nearly a quarter century of the professional collaboration and per- 
sonal relationship between Frank and DeVries. The two women met in 
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1976 at the University of California at Los Angeles. DeVries was 26 and 
an undergraduate sociology major enrolled in a cultural anthropology 
course, for which Frank, at age 28, was a graduate teaching assistant. 
Frank is unabashed in revealing her first memories about DeVries, be- 
lieving that DeVries was probably dependent on her parents and living 
at home. Frank also assumed that DeVries would remain single and never 
have sex. 

Reflecting her anthropological training, Frank defines her research as 
cultural biography, a synthesis of ethnographic and life history methods. 
This work utilizes a variety of methodological approaches, including par- 
ticipant observation, DeVries’s unpublished autobiography, clinical re- 
cords, and letters written by DeVries, as well as formal and informal 
interviews with DeVries and with individuals who were salient in her 
life. Emphasizing empathy, or the “mirror phenomenon” (p. 4), Frank 
continually undergoes self-reflection of her viewpoints and recognizes how 
the interconnectedness between Frank and DeVries affects the represen- 
tation and interpretation of DeVries’s life. However, the frequent self- 
disclosures of Frank and discussions about her life are not always relevant 
and detract from the portrayal of DeVries. 

Throughout this work, Frank expresses her strong desire that DeVries, 
and not Frank, narrate and construct DeVries’s life. In discussions about 
their collaboration, Frank and DeVries agree to split any royalties fifty- 
fifty and to resolve any concerns DeVries might have about the material 
Frank includes. Frank is keenly aware of the power differential between 
the “ethnographer and ‘native’,” but underplays her own role when she 
claims that DeVries has “the ultimate power, appropriately, over me” since 
she could reject Frank’s work at any time (p. 108). 

DeVries was born in Texas in 1950 to white, working-class parents. 
During her childhood, she swims, becomes a poster child for the March 
of Dimes, and is resistant toward artificial arms and cosmetic legs. In 
adolescence, tensions escalate with her mother over her care, and DeVries 
moves to a rehabilitation facility. While in this environment, her embrace 
of disability culture, independent living, and disability rights is strength- 
ened. As an adult, DeVries participates freely in sexual relationships, gets 
married, becomes involved in religion, divorces an abusive and alcoholic 
husband, receives her bachelor’s degree in sociology, obtains employment, 
encounters job discrimination, gets fired, sues, and obtains a master’s 
degree in clinical social work. At last report in the book, DeVries lives 
in her own apartment and is employed as a social worker. 

Rather than focus on the individual nature or the physicality of 
DeVries’s disability, Frank highlights DeVries’s experiences in relation to 
the prevailing cultural climate affecting women and individuals with dis- 
abilities. Similar to other women during the 1960s and 1970s, DeVries is 
influenced by the women’s movement and strives for independence, ed- 
ucation, and a career. At the same time, she encounters social stigma 
because of her disability and confronts legal and institutional barriers. 
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A drawback of this book is the extensive academic excursions in which 
Frank indulges. The resulting distraction would be reduced with fewer 
and less detailed allusions to various academic debates. This work is 
supplemented with 57 pages of notes, providing in-depth discussion about 
such topics as cultural ethnography, cultural studies, and state and federal 
programs and laws concerning disability. In addition, the book contains 
38 pages of a comprehensive bibliography, along with a quite helpful 
index that includes both names and topics. Further, nearly two doren 
photographs and newspaper clippings depicting DeVries’s life are pre- 
sented in the center of the book. Individuals interested in either gender 
or disability studies will find this work instructive and thought-provoking. 

The reader is not led to feel sympathy for DeVries nor to consider her 
an anomaly. On the contrary, Frank and DeVries identify DeVries with 
the Venus de Milo. DeVries perceives herself as “‘a modern-day Venus’, 
not on a pedestal but in a wheelchair: Venus on wheels” (p. 162). Her 
story is one of empowerment and of recognition that disability reflects 
diversity rather than a condition of pathology or deviance. 


Gendering the City: Women, Boundaries, and Visions of Urban Life. Ed- 
ited by Kristine B. Miranne and Alma H. Young. Lanham, Md.: Rowman 
and Littlefield, 2000. Pp. x+229. $75.00 (cloth); $24.95 (paper). 


Lauren Costello 
University of Melbourne 


In this book, Miranne and Young have assembled a collection of articles 
building on the past two decades of work investigating women and the 
urban form. The book consists of 10 essays, emanating from the 1997 
Urban Affairs Association Conference, that examine the visible and in- 
visible boundaries women negotiate. In particular, the works focus on 
notions of boundaries, the ways that women are implicated in spatial 
boundaries, and examples of how women create new spaces within the 
built environment that reflect the diversity of women’s lives. The editors, 
in the introductory chapter, briefly outline the bodies of knowledge fo- 
cusing on women and the urban environment across a range of discipli- 
nary fields, and they set out a framework that acknowledges the multi- 
plicity of women’s identity and experiences.The first two chapters begin 
the task of problematizing the notion of boundaries and suggest how, in 
quite different ways, these boundaries are constantly shifting, both con- 
straining and enabling women in certain ways. Garber, for example, takes 
the notion of anonymity to be an example of a boundary that has been 
predominantly interpreted as a method for women and other marginal 
groups to participate in cities. Anonymity, she suggests, can enhance 
women’s interaction in cities, but it can also be limiting, as it assumes 
anonymity can be chosen or discarded at will. Garber concludes that, as 
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a political strategy, anonymity fails to allow for difference to be acknowl- 
edged. In the same way, Peters considers the silencing of First Nation 
women of Canada and how they negotiate everyday urban spaces. She 
argues that spatial knowledges become boundaries that produce uneven 
understandings of women where not all women are meaningfully 
represented. 

In section 2 of Gendering the City, four chapters examine women’s 
multiple subjectivities and range of experiences. Gilbert employs an anti- 
essentialist framework to rethink geographical understandings of the in- 
tersection of gender, race, and poverty. Subban and Young use literacy 
as an example of how cultural knowledges are imbedded in language. 
Especially among African-American women, literacy is shown to be a 
powerful mediator of social relations, where political capacities can be 
diminished as a result of literacy levels. Spain outlines how black women’s 
influence on planning has been hidden, lost, or forgotten over time. She 
chronicles the contributions of black women and organizations to city 
planning and how their efforts mitigated some of the excesses of racial 
discrimination. In the final chapter, Miranne looks at the impact welfare 
reform has on further isolating and disempowering women. She argues 
that welfare reform creates uncertainty that places women in a powerless 
position. This essay outlines how women resist the ways that the state 
socially and geographically isolates women. 

Moving beyond descriptions including women in discourses of the ur- 
ban form, the final four chapters challenge modernist orthodoxy, arguing 
for the production of knowledges that reflect the multiplicity of women’s 
experiences. Hendler and Harrison critique Canadian planning history 
and illustrate the gendered bias that now constitutes planning theory. 
Adding to this theme on planning, Andrew takes one instrument of plan- 
ning, safety audits, to illustrate how they can result in either reinforcing, 
constraining, or dismantling boundaries for women in urban spaces. She 
concludes that planning tools like safety audits can have a multiplicity 
of impacts and can also aid in removing some boundaries for women. In 
contrast to using a planning tool, Ritzdorf critiques the ways that black 
families are constructed as “other” by the discourses of public policy. She 
shows how zoning becomes a method of physically creating boundaries 
and also how these discourses reinscribe normative uses of urban space. 
The final chapter outlines the meanings of home for women and children 
and the problems single mothers face in the metropolitan housing system. 
Cook, Bruin, and Crull suggest that planners and policy makers do not 
consider the meaning of home for women and instead focus on the pract- 
icalities of housing, thus ultimately diminishing single mothers’ choices 
in the housing market. 

The concluding chapter, by Milroy, returns to the political gesturing of 
this collection to reflect on the intersection of women and cities. Milroy 
reasserts how an antiessentialist framework is a feminist politics that is 
possible and worth pursuing. Gendering the City represents a contem- 
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porary collection of essays that examine a diversity of women’s experi- 
ences while successfully adhering to the main theme of visible and in- 
visible boundaries. The political positioning of the book both deconstructs 
the silencing of women and provides refreshingly innovative ways of 
rethinking spatial knowledges of the urban environment. The collection 
contributes to the emerging body of work employing an antiessentialist 
framework while still situating women at the center of discourse. 


Legalising Gender Equality: Courts, Markets, and Unequal Pay for Women 
in America. By Robert L. Nelson and William P. Bridges. Cambridge: 
Cambridge University Press, 1999. Pp. xvi+-393. $59.95 (cloth); $19.95 
(paper). 


Fiona M. Kay 
Queen's University 


In recent years, the pay equity issue has gained prominence in academic 
research, media commentary, and policy debates. Center stage is the chal- 
lenge to identify the source of wage differences between jobs held pri- 
marily by women and those held primarily by men within the same or- 
ganization. The dominant discourse holds that differences in wages are 
a product of the market, rather than the decisions of employers, and 
differentials are founded on considerations of efficiency. In this skillfully 
argued and well-documented book, Nelson and Bridges pose a formidable 
challenge to the core empirical claims of contemporary theories dominated 
largely by orthodox labor economists. Nelson and Bridges contend that 
a substantial portion of the pay differences between “male” and “female” 
jobs, especially in large organizations, cannot be attributed to the market 
nor to principles of efficiency. Rather, their research convincingly dem- 
onstrates that these differences derive from organizational processes for 
which employers should be held legally responsible. Their claims reignite 
the policy debate on pay reform. 

Legalizing Gender Equality begins by tracing the evolution of statutory 
and case law leading up to the Gunther opinion, a doctrinal turning point 
for wage discrimination claims under Title VII, and analyzes reported 
cases of sex discrimination in pay following this pivotal case. The authors’ 
reading of contemporary cases suggests the courts were heavily influenced 
by the dominant conception of between-job' gendered pay differences. The 
focus of the book is a detailed analysis of four empirical case studies of 
significant pay discrimination lawsuits litigated during the 1970s and 
19803. Nelson and Bridges offer a distinctive emphasis on dimensions of 
the employing organizations as defendants in the lawsuits, highlighting 
organizational data filtered into legal records, employment conditions giv- 
ing rise to legal disputes, and the consequences of legal decisions for 
organizations. In each case, the authors find that gender inequality cannot 
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be adequately explained by market forces or efficiency arguments. The 
patterns observed and their complexities demand a new theoretical focus 
on organizational dimensions of gender inequality, specifically the process 
through which organizations mediate market wages. 

Nelson and Bridges offer a significant contribution to the sociology of 
law and social inequality by explicitly challenging conventional theories 
of the relationship between market and organizational processes, and by 
painstakingly unraveling the mechanisms through which organizations 
reproduce gendered pay inequities. Their innovative sociological frame- 
work for the analysis of gender inequality is termed “the organizational 
inequality model.” Grounded in the classical Weberian conception of or- 
ganizations as systems of legitimate domination, the authors propose a 
framework that examines the process by which certain values become 
organizational practice and the consequences for employees and the 
organization. 

The most significant limitation of this study is the difficulty of drawing 
generalizations from only four cases to the larger and more complex terrain 
of organizational structures. Yet the authors discuss this limitation at 
length, as well as issues of selection bias in cases pursued to court. Another 
shortcoming is that the book does not explore racial inequality in organ- 
izational pay systems, although the novel approach to inequality clearly 
holds potential for the study of race-based pay discrimination. Overall, 
Nelson and Bridges have succeeded with a systematic investigation of 
efficiency, market, and organizational inequality models across public and 
private sectors. The rich data derived from the cases peal back layers of 
organizational complexity to reveal the mechanisms that reinforce gender- 
based pay inequality within organizations. 

The authors extend their work beyond theoretical frontiers to recom- 
mend policy initiatives. The first is to address institutional barriers 
through the development of a “best practice model” of gender-neutral 
wage administration. The second strategy demands renewed efforts at 
achieving meaningful results through selective litigation and other forms 
of antidiscrimination regulation. The authors suggest that courts redirect 
their focus from the standard of culpability toward a standard of re- 
sponsibility, in which the touchstone of liability is systematic wage gaps 
that remain unjustified by market and efficiency considerations. In an 
interesting twist, the third tactic suggested is that of the market’s potential 
for transformation. While Nelson and Bridges challenge the link between 
markets and organizational pay systems, they argue that significant gains 
may be achieved through harnessing market forces. They argue that or- 
ganizations can more consistently and fairly incorporate market principles 
in their wage determination systems to foster progressive gender outcomes. 

Legalizing Gender Equality is essential reading for scholars of social 
inequality, gender, and sociology of law. The book breaks new ground in 
a highly polarized debate over pay equity. The “organizational inequality 
model” emphasizes gender in structuring employment relations and rein- 
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troduces the firm to studies of gender inequality. The results of this com- 
pelling study testify to the importance of mapping variations in gender 
inequality across different market and organizational contexts. 


The Gender Division of Welfare: The Impact of the British and German 
Welfare States. By Mary Daly. Cambridge: Cambridge University Press, 
2000. Pp. x+273. $64.95 (cloth); $23.95 (paper). 


Betty Farrell 
Pitzer College 


Welfare states are both gendered and have an impact on gender relations 
in complex and unpredictable ways. Although no state welfare system 
has specifically defined gender equality as its primary goal, welfare pol- 
icies, practices, and the norms that underlie them can have both intended 
and unintended consequences for male/female income distribution and 
redistribution, poverty, and the kinds of life choices open to men and 
women in a society. Mary Daly’s densely packed, informative study of 
welfare in Britain and Germany in the mid-1980s sets out to add gender 
to the predominantly class-based analysis that has characterized the wel- 
fare state literature to date. She asks: What difference does it make to 
shift the analysis of institutionalized welfare policies from an exclusive 
focus on labor markets, wage-earning, and job-related economic risks to 
a model that considers the interrelationship between family, state, and 
market? One consequence is to bring to center stage the kind of care- 
taking responsibilities (for children, the ill, the elderly, the disabled, and 
for household maintenance) that have traditionally been located in fam- 
ilies as the (unpaid) work of women. 

This book addresses the extent to which welfare policies in Britain and 
Germany have shifted the traditional work of families from the private 
sphere to the public, and what impact such a shift has had on women’s 
and men’s life experiences. Considering welfare and tax policies through 
the lens of gender highlights different male and female risk universes—in 
particular, the privileging of male income security and the relative inat- 
tention to women’s patterns of family/work involvements and the life- 
long vulnerabilities these produce. 

The German and British welfare systems offer an important contrast. 
Germany has adopted a social insurance—based model of welfare in which 
cash transfers and tax benefits are provided to working men, with the 
assumption that these resources will be distributed through the family to 
dependent wives and children. The privileging of the male breadwinner 
household results in women having limited access to benefits on their 
own, except as widows who are well provided for under this system. The 
implicit bargain that German women make in this welfare system is to 
be cared for through marriage, rather than through their own labor force 
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participation. One consequence is that there are large income gaps be- 
tween German men and women, and a high rate of poverty for female- 
headed households. 

In Britain, by contrast, the welfare state has operated as a flat-rate, 
minimal income replacement system that focuses more on support for 
individuals than for families. Some benefits, such as those for children, 
are directed to mothers, and alternative family forms, such as single- 
mother households, are far less marginalized in Britain than in Germany. 
British benefits are meager for everyone, but because transfers are not 
differentiated according to gender, the result is less gender-based income 
inequality than in Germany. The paradox is that, although the British 
welfare state has been more successful in reducing gendered income in- 
equalities, it nevertheless has produced higher rates of poverty in the 
society overall. 

Ultimately, Daly argues, both systems are flawed in terms of supporting 
or improving the lives of German and British women. The care-taking 
activities of women are either uncompensated or low-paid. Poverty re- 
mains high among female-headed households in both countries. Recent 
trends in Germany have been to reinforce the traditional family assump- 
tions and gender arrangements behind their welfare policies, even though 
there is evidence to suggest that these social relations are in flux. In Britain, 
there is new emphasis on reducing state support in favor of family-based 
care, even though high divorce and nonmarital childbearing rates have 
undermined the family’s capacity to provide traditional support services. 

This book is geared to a sophisticated academic audience in the fields 
of gender, stratification, comparative welfare politics, and family studies. 
At times, the author’s abstract analytic style obscures the thread of the 
argument, and one occasionally longs for a narrative of how these policies 
actually play out in people’s real lives. But Daly’s comprehensive, com- 
parative perspective offers many important insights. The book ultimately 
raises many intriguing questions that cannot be answered in the context 
of this study or through the methodology of multivariate analysis. How, 
for example, does the greater familism of German households play out in 
terms of gender power relations, compared to the higher degree of indi- 
vidualism that structure British gender dynamics? What are the prospects 
for women in both countries in the face of economic downturns and the 
tightening of state-supported welfare benefits? Sweden is briefly suggested 
as an alternative welfare model (pp. 218-19), but this tantalizing example 
must await another study. One wonders, as well, about the lessons these 
two welfare systems can provide the United States, where the debates 
continue uncritically about marriage and the family as the best private 
alternative to state-supported welfare. The comparative perspective offers 
insights we can learn much from, and this book provides an excellent 
starting point for bringing gender and family to the center of economic 
and political analysis. 
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Welfare Regimes and the Experience of Unemployment in Europe. Edited 
by Duncan Gallie and Serge Paugam. Oxford: Oxford University Press, 
2000. Pp. xix+412. $74.00 (cloth); $24.95 (paper). 


Jan Culpitt 
Victoria University of Wellington 


There are three general themes that, as well as being the analytical focus 
for separate chapters in this book, all the authors of this vital book return 
to again and again. The first is an analysis of the characteristics of the 
particular welfare regimes that separate European counties used to in- 
stitute their policy prescriptions to counter unemployment. The second is 
how these regimes interact with and are affected by variations in patterns 
of family life and solidarity that, the authors acknowledge, are affected 
not only by different patterns of welfare but also depend upon “much 
longer-standing cultural traditions.” The third theme is to locate the pre- 
vious two within an analysis of the specific economic conditions of each 
of the European countries they studied (Belgium, Denmark, France, Ger- 
many, Greece, Ireland, Italy, the Netherlands, Portugal, Spain, and the 
United Kingdom) and to consider how this has been affected by different 
levels of economic development as well as by the nature of economic 
restructuring that these countries have put into place. 

With the intellectual “protection” of exhaustive qualitative and com- 
parative analysis, the authors deftly thread their way through some of 
the more intractable ideological thickets of contemporary discourses about 
unemployment. One example is the well-worn debate, originating in the 
Elizabethan poor laws, that generosity of welfare benefits affects attitudes 
to work. This is a magisterial study that effectively raises and returns 
policy analysis to the importance of national differences in their respective 
consideration of unemployment practices. It calls into question the po- 
tential glibness of those social policy analyses that have too easily sub- 
sumed these differences in order to promulgate more general theories such 
as globalization and comparative welfare state descriptions. 

The authors are to be commended for demonstrating the complexity 
of unemployment policy and for revealing the detailed and distinctive 
national characteristics inherent in unemployment policies. They propose 
a fourfold analysis of these differences in policy prescription, namely: 
subprotective, liberal/minimal, employment centered, and universalistic. 
While not assuming that any one nation can ever adequately be located 
within any one specific model, their analysis of European welfare regimes 
in the selected countries will provide policy analysts and academics in 
other countries with a schema for the analysis of their own sets of un- 
employment policies. It will be an important resource in this endeavor. 

The great benefit of this book is that it does return the issue of un- 
employment studies to an analysis of the particular. In this way, it is a 
specific and welcome challenge to the sloganeering responses to unem- 
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ployment that have marked the public rhetoric in such countries as my 
own (New Zealand), which have so profoundly embraced the restructuring 
demands of an economic rationalist model and are only now beginning 
to consider what has been wrought. 

There is a strong strand of social policy analysis that reduces all of the 
complexity, so carefully portrayed and analyzed in this book, to various 
subsets of political discourse. While not entirely standing apart from an 
assessment of the role of ideological conflict in an approach to unem- 
ployment analysis, this volume does much to lay the groundwork for a 
return to a detailed policy analysis of difference and, I would suggest, 
inculcates a respect for these national differences. In this way, the authors 
expand on the concept of social exclusion, as though it had an independent 
and therefore effective explanatory frame, and they challenge those who 
would use this concept loosely to instead consider a more detailed analysis 
in which “dimensions of deprivation” are analyzed locally. They dem- 
onstrate that we have much more to gain from an analysis of these national 
differences than from blurring them in generalized and inadequate man- 
tras of the globalized new. This is an important work of scholarship that 
will provide a great wealth of material for academics, students, and policy 
analysts. 


Meritocracy and Economic Inequality. Edited by Kenneth Arrow, 
Samuel Bowles, and Steven Durlauf. Princeton, N.J.: Princeton University 
Press, 2000. Pp. viii+-348. 


Hiroshi Ishida 
University of Tokyo 


The persistence of economic inequality in contemporary America has at- 
tracted public attention for a long time. Many Americans believe that 
social policies to reduce the level of inequality in the form of employment 
training or redistributive programs are either ineffective or unfair and 
that economic inequality is the result of differences in people’s cultural 
disposition and generic ability, which are immune to societal intervention. 
This book, which contains original essays by leading scholars in the fields 
of economics, sociology, and the biological sciences, argues that economic 
inequality cannot simply be explained by individual intellectual ability 
and that social reforms can reduce the extent of inequality and improve 
the nation’s economic well-being. 

The book is divided into four parts. The first part, “Merit, Reward, 
and Opportunity,” contains chapters by Amartya Sen and by John E. 
Roemer, and takes up the issue of defining meritocracy. Sen claims that 
the notion of merit is underdefined and that, if it is properly defined, it 
depends on one’s view of what the just society should be. Roemer ad- 
vocates the importance of the conception of the “level playing field” in 
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which opportunities must be equalized before the competition begins 
while individuals must be responsible for the outcomes after they enter 
competition. The second part, entitled “The Causes and Consequences of 
‘Intelligence,’” includes two chapters. Flynn’s chapter demonstrates a 
clear trend in IQ tests, in which each generation outscores the previous 
generation, and suggests that the increases in IQ scores are too large to 
be regarded as intelligence gains. A chapter by Feldman and others shows 
that the heritability of IQ has been overestimated in previous studies and 
that high heritability does not imply immunity to environmental change. 
Both chapters criticize the thesis presented by Herrnstein and Murray in 
The Bell Curve which emphasized the increased importance of genes in 
explaining economic inequality and the ineffectiveness of public policies. 

The third part, “Schooling and Economic Opportunity,” contains five 
chapters that deal with the relationship among cognitive skill, education, 
and socioeconomic attainment. Ashenfelter and Rouse’s chapter reports 
that workers who have similar genetic and family backgrounds but dif- 
ferent schooling do have different wages and that students from less ad- 
vantaged homes benefit from schooling as much as those from more ad- 
vantaged backgrounds. School thus has great potential in raising skills 
and income of the individuals from less well-off families, thereby reducing 
income inequality. The chapter by Bowles and Gintis argues that schooling 
increases earnings primarily by transforming individuals’ preferences, 
rather than by enhancing cognitive skills. They show that noncognitive 
individual traits, such as trustworthiness, willingness to work hard, and 
competitive attitudes toward fellow workers, are rewarded in the work- 
place because they contribute to attenuating incentive problems at work. 
Korenman and Winship reanalyze the data used in The Bell Curve. Their 
reanalysis shows that the apparent effect of intelligence is reduced sub- 
stantially by controlling for education and that a refined measure of family 
background exerts equally strong impact on economic attainment as 
intelligence. 

The chapter by Hauser and others reports that the effect of schooling 
is substantial and persistent even after controlling for mental ability, social 
background, and social psychological variables (such as parents’ encour- 
agement and friends’ college plans). They conclude that, although ability 
plays a crucial role, education and social psychological variables play 
even more important roles in occupational attainment. The chapter by 
Cawley and others examines a dynamic relationship between ability, 
schooling, postschooling investment in training, and earnings over the life 
cycle. Their simulations show that wage gaps between high school and 
college graduates differ by ability groups and that ability affects present 
wage levels as well as individuals’ decisions to invest in their skills which, 
in turn, influences future wage levels. 

The final part, entitled “Policy Options,” contains three chapters. Lund- 
berg and Startz propose new economic models of racial inequality, which 
emphasize the role of social externalities, including factors like imperfect 
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information, use of race as a signal, racial segregation, and community 
effects (such as group reputation and social capital). They claim that it 
is possible to design effective policy interventions which encourage pos- 
itive feedbacks where individual behaviors affect community attributes 
and where community attributes affect incentives and opportunities of 
individuals. Loury’s chapter shows simple economic models of worker- 
employer interaction where quota-like antidiscrimination policies lead to 
reduced incentives for skill acquisition among the disadvantaged group. 
Finalty, Benabou claims that policies promoting equality of opportunity 
are likely to increase social mobility and economic growth, while policies 
to enhance equality of outcome tend to reduce economic output and work 
incentives. 

In summary, this book shows that scientific studies can contribute to 
tackling one of the most pressing issues confronted by American people: 
the persistence of economic inequality. It should be read not only by the 
educated public, who will gain a better understanding of the causes of 
inequality, but also by public policy makers who will learn a great deal 
about how to craft effective policies to reduce economic inequality. 


Becoming Adult: How Teenagers Prepare for the World of Work. By 
Mihaly Csikszentmihalyi and Barbara Schneider. New York: Basic Books, 
2000. Pp. xx+289. $27.50. 


George Farkas 
Pennsylvania State University 


This is the second in a series of volumes analyzing a new data set on 
American adolescents in the 1990s, collected with the support of the Sloan 
Foundation (the first was The Ambitious Generation by Barbara Schnei- 
der and David Stevenson [Yale University Press, 1999].) Across 12 sites 
and 33 middle and high schools in 1992-93, the researchers identified a 
“focal” sample of 1,215 students in grades 6, 8, 10, or 12, and a “cohort” 
sample of 3,604 students also in these grades. The focal students received 
all the data collection instruments; the cohort students only completed a 
questionnaire. Data were collected again from the sample members in 
1993-94, 1994-95, and 1996-97. 

This volume uses cross-sectional data from the first data collection wave 
to explore how teenagers are preparing for work and adulthood. Three 
sets of questions are addressed. First, what images and expectations do 
teenagers have about their future employment careers? Second, how do 
family, school, and friends affect the expectations, values, habits, and 
skills—the cultural capital—of adolescents? Finally, what sort of expe- 
Tiences are teenagers having as they transition out of high school? 

For the first set of questions, the authors find that teenagers are very 
optimistic about their future work careers. Approximately 70% expect to 
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have jobs that pay well and that they will like. Further, the less advan- 
taged groups—African-Americans, ethnic minorities, and students whose 
communities rank lower in social class—tend to have the highest optimism 
scores. (This seems surprising, but at least for African-American adoles- 
cents, it has been reported before.) The downside is that this optimism 
may be unrealistic, leading to disappointment later. Certainly the per- 
centage of the sample expecting to achieve careers in the professions 
greatly exceeds the actual percentage of the labor force currently holding 
these jobs. The authors suggest a need for support systems to assist teen- 
agers with socialization into adult work roles. 

The second set of questions is addressed, in part, by using data collected 
via the experience sampling method (ESM). Students were given prepro- 
grammed wristwatches that signaled them at eight unpredictable times 
per day during a typical week. Students filled out a one-page form each 
time they were signaled. The respondent reported the activity engaged 
in, his or her location, and his or her thoughts and feelings at the time. 
In addition, data were collected via questionnaires and personal inter- 
views with the respondents, parents, and school staff. 

Results show that academic activities are seen as work, leisure activities 
as play, and while the former are seen as important to one’s future, they 
are not seen as enjoyable. Surprisingly, white children from educated 
families report disliking work more than do less advantaged minorities. 
However, a more negative social class finding is that economically dis- 
advantaged teenagers spend greater amounts of time than do more ad- 
vantaged teens in a state of disengagement—one that is neither like work 
nor like play. This is an unpleasant, unfocused state, often accompanied 
by low self-esteem. This demonstration of greater disengagement among 
teenagers toward the bottom of the social class hierarchy is one of this 
study’s major contributions, since such disengagement may play a central 
role in the lower educational and occupational achievement of this group. 
(For a suggestive ethnographic treatment, see Ain’t No Makin’ It by Jay 
MacLeod [Westview Press, 1995].) 

The opposite of disengagement is engagement in activities that combine 
high levels of challenge and skill. Teenagers reporting higher levels of 
such engagement also report a better overall quality of experience, are 
more involved in productive activities, are more motivated and optimistic, 
and have higher self-esteem. The authors find that families play a key 
role in providing an environment that is both supportive and challenging. 
The most successful families do this in a way that is spirited, cohesive, 
and purposeful. Such a family ethos can overcome economic disadvantage 
in preparing adolescents for adulthood. 

The authors also use their ESM data to explore students’ experiences 
in the classroom. They find that some of the most common classroom 
activities—teacher lectures and watching videos—are reported by stu- 
dents to lack both challenge and importance to their future goals. Students 
report a lack of engagement at these times. By contrast, the most engaging 
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activities are individual work (including a test or quiz) and group activities 
(although, with the exception of science labs, the latter are relatively rare). 
Nonacademic subjects are reported to be more engaging than academic 
subjects. With its focus on individual work and tests, mathematics is 
reported to be the most engaging academic subject. The authors find that 
student engagement in schoolwork is largely driven by the activity format 
(lecture, group, individual, audiovisual, test) employed by the teacher. 
However, it may be important that in a further analysis of these data, 
Gad Yair (Soctology of Education, October 2000) finds that, net of activity, 
African-American and Hispanic students are significantly more preoc- 
cupied with external matters (less engaged with schoolwork), than are 
Asian and white students. 

Finally, the authors examine the experiences of the oldest sample mem- 
bers as they transition out of high school. They are not very positive about 
the assistance provided by guidance counselors or the foundation laid by 
academic schoolwork for these students’ futures in additional schooling 
or employment. The volume concludes with a set of recommendations 
for improvement that are consistent with these themes. Overall, this is a 
provocative study that will be of interest to anyone concerned with Amer- 
ican adolescents and their transition from school to work. 


Stepping over the Color Line: African-American Students in White Sub- 
urban Schools. By Amy Stuart Wells and Robert L. Crain. New Haven, 
Conn.: Yale University Press, 1997. Pp. xii +380. $40.00 (cloth); $18.00 
(paper). 


Meredith Phillips 
University of California, Los Angeles 


Nearly a half-century after the Supreme Court’s landmark Brown v. 
Board of Education decision, most children in America still attend racially 
segregated schools. An intriguing exception are those students who vol- 
untarily participate in an interdistrict transfer program in St. Louis that 
buses over 13,000, mostly poor, urban, African-American students to 122 
predominantly wealthy, white, suburban schools each day. Stepping over 
the Color Line is a thorough, well-written case study of this program. 
The book’s importance reaches far beyond this particular case, however. 
Its insights into the complexities of the school-choice process and the 
difficulties of true racial integration make its core chapters required read- 
ing for all sociologists interested in race, education, and inequality. 

The book has three sections. The first traces the history of racial sub- 
ordination in St. Louis over the past 150 years, setting the stage for the 
court settlement that led to the creation of the interdistrict transfer pro- 
gram. The second, and most provocative, section presents interview data 
from city students and their parents about why they stayed in the city 
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schools, transferred to the suburban schools, or returned to the city schools 
after trying out the suburban schools. The third section describes the 
transfer program from the point of view of the white suburban educators, 
parents, and students. 

Wells and Crain’s interview data challenge common assumptions about 
how parents and students make educational decisions. Their results re- 
inforce the concern that school-choice programs may “cream” off the better 
students and the most vocal parents. In their data, students who trans- 
ferred to the suburbs tended to have supportive parents, resilient tem- 
peraments, or both. The parents of transfer students also tended to have 
more education and higher-status jobs than did city parents, even though 
Wells and Crain deliberately sampled transfer and city students from the 
same neighborhoods. 

Wells and Crain’s results also indicate that basic “push/pull” factors 
cannot explain educational choices. Neither dissatisfaction with the qual- 
ity of their neighborhood schools nor positive information about the trans- 
fer program—even from friends and relatives who participated in 
it—were sufficient to encourage some city parents and students to try the 
transfer program. Even though most city parents believed that the sub- 
urban schools were better, some chose to remain in the city simply because 
the environment was more familiar, Others settled for the path of least 
resistance, in essence choosing not to choose. Still others used a compli- 
cated decision calculus involving a careful assessment of their children’s 
academic preparation and personality in relation to the predicted demands 
of the suburban schools. As one parent, who decided to let her son, but 
not her daughter, transfer to the suburbs, put it: ‘““I wanted to put her 
out there, too, but it would be too stressful for her. She is not as quick 
as Maurice. He is very quick; he just memorizes—has an excellent mem- 
ory. She is very active—won’t sit still to grasp it. If I put her out in 
Parkway [a suburban school], it would be very stressful. She can’t deal 
with it” (p. 209). 

An important strength of the book lies in the authors’ willingness to 
portray transfer students” experiences in all their complexity, despite the 
fact that both authors are strong proponents of the long-term benefits of 
racial integration. Wells and Crain’s interviews reveal that many of the 
advantages of suburban schools, such as having access to a more chal- 
lenging curriculum, being exposed to all the forms of capital inherent in 
a wealthy, white environment, and escaping from one's neighborhood, 
are also disadvantages. Transfer students faced obstacles to academic 
integration because they had weaker academic skills than their suburban 
counterparts and because they felt excluded from the tacit cultural knowl- 
edge that suburban teachers and students shared. They also found it 
difficult to become socially integrated with their suburban peers because 
they lived too far away to spend time with them after school] and on the 
weekends. 

Despite the book’s contributions to our understanding of the school- 
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choice process and the promises and pitfalls of integrated schooling, it 
has several limitations. It is too long, mostly because the authors did not 
synthesize the histories of residential and school segregation. More im- 
portant, its conclusions rely too heavily on its small N, cross-sectional 
interview methodology. Although the authors used some secondary survey 
data to support their findings, their basic descriptive claims about dif- 
ferences between the city, transfer, and return students would have been 
much more convincing had they administered their own survey. Moreover, 
the retrospective nature of the interviews makes it difficult to tell whether 
the informants are describing factors that actually influenced their choices 
or factors that simply help justify their choices after the fact. A prospective 
research design would have helped sort out these issues of cause and 
effect. Taken as a whole, however, Wells and Crain have written a first- 
rate book on the history, politics, and lived experience of a policy designed 
to erase the color line, one transfer student at a time. 


Urban Exodus: Why the Jews Left Boston and the Catholics Stayed. By 
Gerald Gamm. Cambridge, Mass.: Harvard University Press, 1999. Pp. 
xii+-384. $39.95. 


Omar M. McRoberts 
University of Chicago 


In this highly readable and well-researched book, Gerald Gamm rewrites 
the story of neighborhood invasion and succession and makes a significant 
contribution to urban sociology and the sociology of religion alike. The 
events unfold in Boston’s Upper Roxbury and Dorchester—two formerly 
white and Jewish districts transformed in the 1960s and 1970s by black 
in-migration. Gamm begins with the contention that racist fears, manip- 
ulative real estate and lending practices, and antagonistic street-level in- 
teractions cannot fully explain why Jews left relatively quickly and peace- 
ably, while Catholics held on longer, fought with more vitriol, and pulled 
up stakes leas often. 

Rather, rules governing membership, institutional rootedness, and re- 
ligious authority for synagogue and parish differed in ways that permitted 
Jews to leave readily and encouraged Catholics to hold fast. Membership 
in synagogues is entirely voluntary and has no territorial basis. Catholic 
parishes, by contrast, are membership boundaries delineated in the space 
around the church. Jewish ritual, meanwhile, is far less rooted in particular 
places than Catholic ritual, which is built around immovable altars in 
permanent, diocesean edifices. The Torah is highly portable and does not 
derive any degree of sacredness from any particular building or territory. 
Finally, Catholic rules of authority locate a great deal of power in officials 
located outside and above the congregation and parish priest. Synagogues 
and rabbis, however, are highly subject to congregational preferences and 
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cannot appeal to powerful judicatories for monetary or moral support. 
The synagogue, then, would have little choice but to follow suit if con- 
gregation members staged a mass exodus toward the suburbs. 

In developing this argument, Gamm mobilizes an impressive array of 
statistical, archival, and anecdotal data. The book’s painstakingly con- 
structed thematic maps are particularly helpful. The maps, reminiscent 
of those in St. Clair Drake and Horace Cayton’s classic Black Metropolis 
(University of Chicago Press, 1945), show how racial and ethnic settlement 
patterns covaried with religious institutional locations over nearly a 90- 
year period. 

Urban Exodus is pioneering in that it takes local institutions as the key 
players in the invasion/succession process. In many previous studies, in- 
stitutions were considered reactors to, not engineers of, shifting residential 
patterns. Largely absent were accounts of institution-led relocation or 
institution-based resistance to displacement or the arrival of reviled others. 
Also absent were accounts sufficiently sensitive to the formative roles local 
religious organizations have played in the urban process. Urban Exodus 
begins to present those accounts and develop that sensitivity, thereby 
significantly enriching theoretical and empirical discussions of religion in 
urban settings. 

A minor problem with the book is its title, which suggests that white 
Catholics stayed in their neighborhoods while Jews left wholesale. The 
fact is, as Gamm reveals in the text, many white Catholics eventually left 
too. Catholic churches, nonetheless, remained and became religious homes 
for recently arrived Vietnamese, Latino/a, and Haitian immigrants. This 
points, nevertheless, to a deeper conceptual problem regarding Gamm’s 
uneven operationalization of the term “mobility.” He always describes 
Jewish mobility in terms of institutional and personal flight but sometimes 
describes Catholic rootedness in terms of an absence of institutional move- 
ment, without regard to the glaring fact of white Catholic flight. The 
insinuation is that Jews and Catholics really behaved oppositely. 

More generally, Gamm’s explanatory discussion is thicker for syna- 
gogues than churches, which makes Catholic cases appear as a backdrop 
against which to develop the complexity of Jewish cases. This is partly 
understandable since the book aims to refute Hillel Levine and Lawrence 
Harmon's Death of an American Jewish Community (Free Press, 1992), 
which foregrounds all of the impetuses for Jewish flight that Gamm down- 
plays. Still, Gamm demonstrates more convincingly that institutional Ju- 
daism did not impede Jewish flight than that Catholic canon law led 
whites toward belligerent parochialism. 

At times, Catholic rules of authority allowed the hierarchy to sidestep 
territorial rules of rootedness and membership. As Gamm acknowledges, 
Boston’s Catholic archdiocese sometimes shut down churches or exper- 
imented with nongeographic “ethnic parishes.” Importantly, though, nei- 
ther response indicates a rigid devotion to altars, buildings, or physical 
propinquity as the primary basis for membership. These actions point to 
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the ways judicatorial whims, rather than iron cage—like rules, influenced 
local institutional behaviors. In any case, for a more sophisticated portrait 
of Catholic involvement in urban race politics, read Parish Boundaries 
(University of Chicago Press, 1996), by John T. McGreevy. 

Finally, Gamm may overstate the Jewish immunity to territorial con- 
cerns. If institutional Judaism were so unlike parish-based Catholicism, 
synagogues might not have needed to relocate in order to maintain mem- 
bership. Why did synagogues not remain in place, confident that members 
would commute back to the city for worship? Perhaps the Jewish faithful, 
like so many of their Catholic counterparts, preferred to live near their 
religious institutions, such that synagogues kad to follow their members’ 
movements. Perhaps Boston’s Jews had parishes after all—portable ones. 


Bitter Fruit: The Polstics of Black-Korean Conflict in New York City. By 
Claire Jean Kim. New Haven, Conn.: Yale University Press, 2000. Pp. 
xiv+300. $37.50. 


Francesca Polletta 
Columbia University 


The boycott of two Korean produce stores by black activists in Brooklyn 
in 1990 lasted almost a year, created a firestorm of publicity, and destroyed 
the reelection prospects of New York City’s first black mayor. Claire Jean 
Kim traces the history of the boycott in order to advance an ambitious 
set of arguments about race, power, and protest in post-1965 America. 
The press reported the story as one of racial scapegoating, with blacks 
lashing out against hard-working Koreans whose success showed up their 
own failure. Academic accounts of black-Korean conflict have not gone 
much beyond that interpretation, Kim argues. But they thereby “miss the 
forest for the trees” (pp. 12, 154), namely, the “racial order” structuring 
the contenders, stakes, and outcome of such conflicts. Kim draws on Omi 
and Winant’s notion of a “racial formation” to draw attention to the 
naturalized categories and understandings of race that shape people’s life 
chances. But she extends their concept in two ways. First, she locates 
Asian Americans in the post-1965 American racial order. Ranked below 
whites but above blacks on an axis of superiorityAnferiority, they are 
positioned as apart from both groups on an axis of insider/foreigner, as 
“permanently foreign and unassimilable” (p. 16). They are a “model mi- 
nority” with admirable values of entrepreneurialism and self-sufficiency, 
but are also “ostracized from the body politic” (p. 17) and denied access 
to the social safety net available to other Americans. Second, Kim argues 
that the racial order shapes how subordinated groups experience and 
challenge their position, and with what consequence. Segregated in under- 
financed and politically underserved neighborhoods as the result of a long 
history of discrimination, black Americans have repeatedly drawn on a 
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nationalist “community control” frame to fight the intrusion of exploitive 
landlords and merchants in their neighborhoods and to envision a geo- 
graphical base for more fundamental black power. 

When a Haitian woman was allegedly beaten by the Korean manager 
of a Brooklyn green grocery in 1990, black nationalist activists saw in 
the boycott an opportunity to mobilize black residents around the aim of 
community control. Their complaint was with particular merchants and 
with black powerlessness, not with Koreans. But in response to the boy- 
cott, Korean merchants and their supporters in the media invoked an 
integrationist rhetoric and, indeed, the legacy of Martin Luther King, Jr., 
to paint the boycotters as racist. Mayor David Dinkins, caught “between 
a rock and a hard place” (p. 189), alienated whites by hesitating to criticize 
the boycotters and then alienated blacks nine months later by crossing 
the picket line. Others began to cross the picket line after he did, but the 
boycott’s end did not reverse his political fortunes—he was later defeated 
by Rudolf Giuliani in an election which played up racial themes—nor 
secure protesters anything in the way of victory. Such a dénouement was 
inevitable, Kim argues, since the integrationist, “colorblind” rhetoric in- 
voked by the boycott’s opponents effectively reinterpreted black challenge 
as black criminality and “reverse” racism. In the end, “racial power in- 
evitably generates protest by subordinated groups, but it also names, 
interprets and ultimately silences that protest” (p. 219). 

Kim traces the unfolding of the boycott through in-depth interviews 
with black nationalist activists, city officials, Korean merchants and their 
advocates, and boycott participants. Her account is illuminating in show- 
ing the extraordinary pressures on black elected officials, and especially 
the mayor, to assuage the exaggerated racial anxieties of white business 
elites and media editorialists. It also depicts the complex relations of 
competition and mutual dependence that mark black activists’ interac- 
tions with black elected officials, and the reliance of both on the very 
media coverage that ended up undercutting their positions. 

The problem is that Kim’s analysis repeatedly strains against the limits 
of the racial power model she presents. Her thorough account of why 
Korean immigrants cornered the green grocery niche in poor black neigh- 
borhoods emphasizes more the fact that immigrants were often middle 
class and came with cash than the effects of a discursive racial formation. 
The racial power model provides no purchase on the relations between 
protest leadership and elected officials, which deserve generalization be- 
yond this case. Nor does the model account for what seems some change 
in Koreans’ status (if not that of blacks) as a result of the protest. Kim 
argues that protest in the end merely strengthens rather than changes the 
racial order. But media coverage of the boycott referred to Koreans flexing 
new political muscle, suggesting that they may have begun to successfully 
position themselves as legitimate political claimants—precisely the status 
that Kim argues is denied them in the current racial order. 

Still, the book deserves a wide audience. It bridges theorizing on race/ 
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ethnicity and social movements, offers a too-rare picture of post-1965 
black protest, and joins a rich empirical analysis of interethnic conflict 
with an ambitious and provocative argument. 


Namaste America: Indian Immigrants in an American Metropolis. By 
Padma Rangaswamy. University Park: Pennsylvania State University 
Press, 2000. Pp. xviii+355. $65.00 (cloth); $19.95 (paper). 


Parmatma Saran 
City University of New York 


As a result of changes in the American immigration laws in 1965, there 
has been a constant flow of immigrants from India for the last 30 years, 
and it is still continuing. Initially, those who came in the 1960s and 1970s 
represented a high level of education and skills and were mostly profes- 
sionals, for example, doctors, engineers, scientists, and so on. This was 
the function of the immigration laws which only allowed professionals to 
come as immigrants. These are referred to as “primary” immigrants. 

Subsequently, Indians were coming to America as immigrants through 
sponsorship by relatives who were already here. However, this group, 
which came from 1980 to 1990, was not necessarily as well educated and 
lacked the professional backgrounds of previous waves of immigrants. 
They are referred to as “secondary” immigrants. Therefore, now we have 
a highly diversified Indian community in the United States in terms of 
social, cultural, educational, and occupational backgrounds. A strong In- 
dian community has emerged in the United States, with major concen- 
trations in metropolitan areas like New York, Chicago, Los Angeles, and 
San Francisco among others. 

The emergence of Indian communities in the United States has attracted 
academic attention and has also been the object of a spate of articles in 
newspapers and magazines. A number of books have been published by 
historians, sociologists, anthropologists, and others exploring different as- 
pects of the Indian immigrant experience in the United States. This book 
is the newest addition to the growing literature on this subject, and per- 
haps the most detailed, comprehensive, and articulate one. Padma Ran- 
gaswamy has captured the Chicago scene to the fullest extent possible 
and attempted to relate it to the larger scene both theoretically and meth- 
odologically. She has played her role extremely well as both “insider” and 
“outsider,” particularly since she had gone back to India for 10 years and 
again returned to Chicago in the early 1990s. This clearly gave her an 
advantage in conducting this research. 

At the microlevel, Rangaswamy has done an excellent job in portraying 
the Indian American community in the Chicago area. She has successfully 
used both quantitative and qualitative analysis in presenting her data. 
Her personal interviews and observations give a real sense of what is 
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happening to the lives of her respondents. Unlike many other studies of 
Indian immigrants, which focus on social and cultural aspects of assim- 
ilation, she also introduces the role of cultural, religious, political, and 
economic enterprises and its impact on the Indian community. While the 
status of women has been an issue and has been dealt with by other 
scholars, Rangaswamy adds a new dimension to the study of Indian 
immigrants by particularly dealing with youth and the elderly population 
and concerns affecting them. These two issues are of immense value for 
their practical implications and must be addressed independently by fu- 
ture researchers. Her study also confirms that the experiences of Indian 
immigrants in Chicago are not any different from those living in New 
York and other parts of the country, nor, for that matter, much different 
from those living in different parts of the world. One common thread is 
their strong commitment to Indian identity. 

The history of Indians migrating to different parts of the world is not 
a new phenomenon, even though the circumstances and context of their 
migration may vary. Indian communities around the world are generally 
seen as “diaspora,” meaning thereby that while they have common roots, 
they are in a state of exile, as Rangaswamy rightly points out. Additionally, 
since they cannot return to the homeland, they are also disconnected from 
their roots in India. Rangaswamy therefore introduces an alternative term, 
oikumene, which connotes the formation of an extended household around 
a central oikos (a Greek word meaning “home”), and she suggests that 
this may be more appropriate to the study of Indians around the world 
since the central source or homeland, India, remains an important part 
of their identity. 

This model makes sense for the study of Indians who left India from 
1950 to 1960 for England, Canada, the United States, and Australia. 
Because of their better economic status, and advancements in transpor- 
tation, communication, and media, it has been possible for them to main- 
tain closer ties with India. However, those who left India earlier in the 
1800s for South Africa, East Africa, Fiji, and Trinidad really lost contact 
with India for practical purposes, even though Indian identity remained 
important for them. Hence this alternative model, otkumene, has some 
limitations and may not be useful in the study of the earlier wave of 
Indian immigration. However, in any case, it can be argued that we need 
to explore new paradigms for the study of Indian immigrants in these 
changing times. 

In conclusion, this book can be clearly seen as a serious scholarly con- 
tribution, providing a very comprehensive picture of Indian immigrant 
experiences at the microlevel and at the same time sensitizing its readers 
to the larger theoretical as well as methodological issues needed for the 
study of Indian immigrants at the macrolevel. This study is certainly a 
welcome addition to the growing literature on Indian immigrants and 
particularly valuable to scholars engaged in the study of ethnicity and 
migration. It is highly recommended for those studying Indian immi- 


1814 


Book Reviews 


grants. Finally, Indian immigrants will find this reading engaging as well 
as rewarding. 


Changing Race: Latinos, the Census, and the History of Ethnicity in the 
United States. By Clara E. Rodriguez. New York: New York University 
Press, 2000. Pp. xv+283. $55.00. 


Peter Skerry 
Claremont McKenna College and the Brookings Institution 


Appearing in the midst of the continuing controversy surrounding the 
2000 decennial census, this study is a timely addition to the growing 
literature on the political and policy implications of government statistics 
on race and ethnicity. Concerned specifically with the historical and con- 
temporary interactions between Latinos and the U.S. census, the author 
offers a competent, nontechnical overview of the issues concerning how 
our largest minority fits into this nation’s bipolar, black-white racial par- 
adigm. More to the point, Rodríguez examines how Latinos may be chang- 
ing that long-dominant paradigm. 

Working squarely within the well-established framework that race and 
ethnicity are social constructions, the author plows no new theoretical 
ground here. Similarly unsurprising are her findings that Latinos’ racial 
identities in the contemporary United States are, quite unlike those of 
black and white Americans, highly fluid and context related. For that 
matter, Rodríguez's research methodologies—which include intensive per- 
sonal interviews, simple correlational analyses of census responses, and 
thorough review of academic and government studies on responses to 
racial and ethnic survey questions—are hardly innovative. Nevertheless, 
she offers an adequate overview of the evidence on the correlates of 
shifting responses by Latinos to questions about their racial and ethnic 
identities. 

Particularly useful is Rodrfguez’s inquiry into the tendency for Latinos 
to check off the “other race” category on the 1980 and 1990 censuses. 
Indeed, Rodriguez reminds us that in 1990 more than 40% of Latinos 
identified themselves racially as “other race” and that more than 97% of 
all those so identifying that year were Latinos. To the extent that Rod- 
ríguez has an overarching argument, it is that such responses to census 
and survey questions do xo! reflect, as is often averred, “confusion” among 
Latinos as to how they fit into the contemporary American racial para- 
digm. Rather Rodríguez maintains that such responses reflect the very 
different conceptualizations of race that Latinos bring with them from 
their countries of origin and that to some extent are maintained here in 
the United States. 

In a similar vein, Rodríguez presents some intriguing data on the effects 
of the race of interviewers and of the context of interviews on Latinos’ 
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responses to questions about their race and ethnicity. Rodríguez also pres- 
ents an informative historical summary of the changing format of race 
and ethnicity questions on the decennial census forms. 

What this study does not do is deal with the policy issues now swirling 
about the census. For example, the author spends a good deal of time 
tracing how we arrived at the present regime of racial and ethnic self- 
identification on the census form. But she neglects to address the inter- 
esting and important exception to self-identification: observer identifica- 
tion of race and ethnicity, the method that civil rights enforcement 
agencies have come to rely upon. 

Toward the very end of the study, Rodríguez alludes to the persistent 
“conflict between providing recognizable categories that are relevant to 
respondents and needing to gather uniform, comparative data” (p. 176). 
But this is all she has to say on this important issue. Nor does Rodríguez 
consider the much more basic question of whether Latinos are funda- 
mentally challenging the United States’ bipolar racial paradigm, as she 
seems to suggest, or whether they are gradually becoming absorbed into 
a new version of the old paradigm that in law and public policy treats 
African-Americans, Latinos, and others as similarly situated racial 
minorities. 

As a result, this book will be most useful in undergraduate courses 
dealing with race and ethnicity. More advanced students and researchers 
will find it less helpful. 


Sharing America’s Neighborhoods: The Prospects for Stable Racial In- 
tegration. By Ingrid Gould Ellen. Cambridge, Mass.: Harvard University 
Press, 2000. Pp. vii+228. $39.95. 


Mary Pattillo-McCoy 
Nortkwestern University 


There is plentiful demographic evidence that U.S, cities are highly seg- 
regated by race. Yet this does not mean that a neighborhoods are seg- 
regated. Sharing America’s Neighborhoods takes as its focus the nearly 
20% of metropolitan neighborhoods that are racially integrated. Here, 
“neighborhood” means census tract, and “integrated” means 10%-50% 
black. The book concentrates on black/white integration and stresses the 
residential decisions of white households as determinative. It offers an 
instructive analysis of the mechanisms of racial integration and racial 
change and should be read by urban sociologists, demographers and plan- 
ners, and race and ethnicity scholars. 

Ellen lays out the landscape of racially integrated census tracts. 
Whereas the “social problems” approach predisposes many sociologists to 
tell a story of neighborhood racial instability, Ellen spins the data in the 
opposite direction. Turnover is not inevitable. The majority (57%) of in- 
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tegrated neighborhoods in 1970 were still integrated in 1990. This is a 
service to the discipline as it counteracts the collective pessimism about 
racial integration. Having established the possibility for stable integration, 
the remainder of the book examines the reasons that neighborhoods 
change. 

Ellen uses censuses (1970-90) and geocoded survey data from the Amer- 
ican Housing Survey to build a theory of “race-based neighborhood ster- 
eotyping.” The theory challenges the notion that whites’ residential de- 
cisions are based on their aversion to living near blacks. Instead, Ellen 
argues that whites are more concerned about nonracial economic and 
quality of life criteria, such as housing values, school quality, and crime. 
However, whites’ assessments of neighborhood health are influenced by 
stereotypes about blacks. “It is not necessarily that white households dis- 
like living next to blacks per se; it is that many white households, rightly 
or wrongly (and even perhaps, with some regret), associate predominately 
black neighborhoods with diminished neighborhood quality and resil- 
ience” (p. 47). The fear that racially integrated neighborhoods are on their 
way to becoming all black is the primary threat to maintaining their 
stability. 

The author finds empirical support for the theory. When predicting 
white population decline (chap. 4), neighborhood satisfaction (chap. 5), 
and actual mobility (chap. 6), the coefficient for proportion black in the 
neighborhood is almost never significant, but the measure of black pop- 
ulation growth is frequently significant. Growth in the black population 
is positively correlated with white population decline and actual mobility 
decisions, and negatively correlated with white neighborhood satisfaction. 
This finding challenges pure prejudice theories that are based on white 
households’ simple antipathy for black neighbors. Such dislike, Ellen 
argues, should be captured by the measure of current racial composition. 
What matters instead is whites’ future expectations for neighborhood 
quality, informed by the trend in racial composition. 

Ellen also finds that unlike homeowners, white renters’ satisfaction and 
mobility are not affected by either current proportion black or growth in 
the black population share over time. If white residential decisions were 
based primarily on aversion to black neighbors, then one would not expect 
differences in attitudes and behavior among whites by housing tenure. 
The race-based neighborhood stereotyping theory rests on the investment 
decisions of white households. Thus, the financial, temporal, and social 
investments in neighborhoods made by homeowners and not renters make 
the racial stereotypes operative for homeowners in their housing decisions 
and evaluations. 

While the author contends that these findings strongly challenge pure 
prejudice models of neighborhood change, it seems instead that the results 
in this book are important qualifications of such models. It is consistent, 
for example, with a “tipping” hypothesis that growth in the black pop- 
ulation better predicts white behavior since “tipping” is based on breaches 
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of the racial tolerance thresholds of successive waves of white residents. 
Yet the author resists pure prejudice as an explanation using a phrase 
that is becoming ever more prevalent in sociology: “It isn’t race per se.” 

What authors usually mean when they use this rhetorical expression is 
that it is not the “blackness” of blacks that occasions some outcome, but 
rather other characteristics that are so tightly connected with blackness 
that they are often indistinguishable by lay observers. Through statistical 
isolation, however, we can make the race coefficient insignificant and 
conclude, as Ellen does, that “households care less about the racial com- 
position of their neighborhood per se than about its quality of life” (p. 
131). I am unconvinced, however, that this is a helpful distinction. Surely, 
it can be informative to policymakers who wonder what things about 
blacks need to be “changed” in order to change an outcome, since changing 
skin color is not an option. But the important task of specifying the 
mechanisms does not necessitate using a language that obfuscates the 
primary connections between blackness and negative outcomes. Because, 
as Ellen points out relative to neighborhood quality, “any disparities that 
exist between black and white neighborhoods are to a significant degree 
rooted in negative racial attitudes and discrimination” (p. 156). 


From the Puritans to the Projects: Public Housing and Public Neighbors. 
By Lawrence J. Vale. Cambridge, Mass.: Harvard University Press, 2000. 
Pp. xiii+460. $45.00. 


Arnold R. Hirsch 
University of New Orleans 


In 448 well-researched, written, and illustrated pages, Lawrence J. Vale 
provides a history of public housing in Boston, focusing upon its national 
context and cultural origins. For Vale, public housing is not simply a 
bricks-and-mortar New Deal program. It is, instead, part of a historical 
debate regarding the nature of society’s obligation to provide decent hous- 
ing those who cannot meet the market’s demands or the community’s 
accepted standard of behavior. Embracing such individuals collectively 
as “public neighbors” (p. 8), this book connects the prehistory of public 
housing to traditional attitudes and policies aimed at distancing as well 
as reforming the poor. 

Ambivalence and continuity constitute dominant themes. The former 
manifested itself in repeated attempts to separate the “deserving” from 
the “unworthy” in the distribution of public assistance. From the “warning 
out” of “strangers” to the institutionalization of “indoor relief,” the desire 
to withhold support from those of questionable character, or to ration it 
to those susceptible of being “cured” in a confined environment became 
clear. Progressive era settlements, model tenements, zoning ordinances, 
and building regulations ambiguously elaborated upon society’s respon- 
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sibilities by suggesting the need to reform both the environment and the 
poor. And these actions contrasted sharply with the largesse bestowed 
upon upwardly mobile “worthy” citizens by the federal government’s 
distribution of public lands. Not only did such giveaways promote the 
ideal of the single-family home occupied by a hardworking citizen-owner, 
but they made its realization possible for many. Ironically, the ubiquity 
of the housing subsidies (later augmented by federally insured mortgages 
and breaks ensbrined in the tax code) contributed to their invisibility and 
the ideology-mythology of the morally superior, independent property 
owner. 

The second half of the book furnishes a richly detailed case study of 
public housing in Boston that will be especially appreciated by specialists. 
It is here that the continuity with the program’s prehistory shines through: 
the exaltation of the single family home, distrust of the poor, the popular 
linkage of (im)morality and poverty, and the persistence of the “politics 
of ambivalence.” The “more privileged,” Vale concludes, “gained urban 
houses, frontier homesteads, or suburban plots,” even as “those public 
neighbors judged least deserving were warned out, walled-in, and left 
behind” (p. 158). 

Undergoing a multistaged transition keyed to legislative enactments 
through the 1930s and 1940s, public housing in Boston began as a series 
of highly “selective collectives” intended to reward the “deserving” poor 
(as late as 1950-51, 90% of the applicants were veterans and 95% were 
employed) only to emerge later as the leading option for impoverished 
nonwhites possessing the fewest choices. First seen as an engine of reform 
that promoted good citizenship by lifting its denizens out of surrounding 
slums, public housing lost its image as “step up” when it became a means 
of facilitating private economic development. The Boston Housing Au- 
thority (BHA) subsequently managed 25 family projects built between 
1938 and 1954 on scattered sites. With a carefully screened tenantry, the 
system worked well until faced with the problems of concentrated poverty 
and race in the postrenewal and civil rights eras. In charting the BHA’s 
subsequent decline, Vale cites the usual explanatory suspects while adding 
an “underlying cultural unease” rooted in “ideological ambivalence” (p. 
333). That intellectual discomfort soon led authorities away from the 
construction of new developments and toward programs that utilize pri- 
vate housing and market forces. Vale concludes that vouchers represent 
the best hope for sheltering the nation’s public neighbors: more public 
housing, but without projects. 

From the Puritans to the Projects is an impressive rendering of an 
influential, if ambiguous, reform tradition and the BHA. However de- 
serving of intense scrutiny in its own right, though, Boston’s unique char- 
acter raises questions with regard to the weight placed on cultural or 
ideological explanations. Even holding Puritan legacies in abeyance, a 
city that remained 91% white in 1960, and whose public housing popu- 
lation nearly equaled that figure (85%) two years later, presents quite a 
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different profile from that found in many major cities. Where 13 of 17 
Boston family projects built between 1949 and 1954 were on vacant, often 
outlying land, such suggested placement provoked bitter opposition else- 
where. Those cities that concentrated overwhelmingly black develop- 
ments in the urban core faced racial problems of a different order and 
such magnitude that the stigma placed upon their projects flowed from 
sources other than the ideological “ambivalence” described here. 

Finally, Vale’s call for more public housing without projects must be 
measured against Sudhir Alladi Venkatesh’s recent plea to halt the current 
wave of demolition in his American Project: The Rise and Fall ofa Modern 
Ghetto (Harvard University Press, 2000). An ethnography of Chicago’s 
Robert Taylor Homes, Venkatesh’s alternative perspective led him to fear 
that the public commitment to shelter the poor will itself disappear with- 
out the projects’ tangible presence; Vale believes that visible presence is 
now an inescapable part of the problem. 


The Crime Drop in America. Edited by Alfred Blumstein and 
Joel Wallman. Cambridge: Cambridge University Press, 2000. Pp. 
xiii+318. $54.95 (cloth); $19.95 (paper). 


John H. Laub 
University of Maryland 


Criminologists were caught off guard by both the upsurge of violence 
during the mid-1980s and the precipitous decline during the mid-1990s. 
The magnitude of the decline in violence, for example, is truly startling, 
with national homicide rates at a 30-year low. The Crime Drop in America, 
edited by Alfred Blumstein and Joel Wallman, is an important collection 
of papers that systematically addresses various explanations for changing 
rates in violent crime in urban areas. The topics investigated can be 
categorized as follows: the effects of criminal justice institutions John 
Eck and Edward Maguire examine changes in policing and William Spel- 
man assesses the impact of prison expansion); the effects of situational 
components of violent events (Garen Wintemute examines the role of guns 
in violent crime and efforts to control guns, and Bruce Johnson, Andrew 
Golub, and Eloise Dunlap analyze changes in drug use, drug markets, 
and drug subcultures); and the effects of larger social forces (Richard 
Rosenfeld examines the age structure of homicide, especially the steady 
decline in adult homicide, declining rates of marriage and intimate partner 
homicide, and changing cultural norms; Jeff Grogger analyzes economic 
trends, in particular economic opportunity; and James Fox assesses the 
effects of demographics). 

Alfred Blumstein sets the stage for these chapters by reviewing trends 
of violence from 1980 to 1998. Blumstein carefully documents the steep 
rise in violence starting in 1985 and the equally steep fall in violence 
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starting in 1992-93. Blumstein highlights the different trends in homicide 
for different age groups and underscores the need to examine disaggre- 
gated rates of violence. From these data, it is clear that the increase in 
violence during the 1980s was due to juveniles, not adults. However, the 
decline is attributable to both age groups. Blumstein presents his thesis 
that the increase in juvenile violence was due to the diffusion of guns 
into the hands of juveniles coupled with the emergence of the crack co- 
caine market. What is less clear is whether this explanation can account 
for the large decline as well 

Although there are few surprises in this volume overall, some findings 
should be highlighted, especially for those interested in policy. For in- 
stance, the increase in incarceration played a relatively modest role in 
bringing down crime rates. More precisely, Spelman estimates that “the 
crime drop would have been 27 percent smaller than it actually was, had 
the prison buildup never taken place” (p. 123). Along similar lines, there 
appears to be no hard evidence that changes in policing alone (e.g., zero- 
tolerance policing drawn from “broken windows” theory) accounted for 
the decline in violent crime. Finally, it is apparent that demographic trends 
were not helpful in predicting either the upsurge in violence or its hefty 
decline. As Fox says, “Demography can be predicted with a high level of 
certainty, suggesting future changes in crime that might occur with the 
rather bold assumption that all else remains equal.” However, “the as- 
sumption of ‘all else equal’ is a rather problematic one” (p. 309). 

I wish the volume was more expansive than it is. Three things come 
to mind. First, the focus on short-term trends in violence needs to be 
justified. We know that rates of burglary have declined dramatically over 
the last two decades. Do the same “suspects” for the decline in violence 
extend to property crimes and forms of problem behavior, like teenage 
birth rates, that have also declined? Moreover, what do these short-term 
fluctuations look like when examined in a longer time series of say 50 
years? Second, there are several sociolegal trends that are not discussed 
in enough depth. These issues are of interest to sociologists, and they 
include crime prevention initiatives, welfare reform, attempts to 
strengthen families and schools, the controversial role of abortion legal- 
ization, and efforts of African-American religious groups like the Ten Point 
Coalition in Boston. Third, what can we learn from the experiences of 
other countries? In fact, an exclusive focus on the United States may limit 
our understanding of trends in violence. At a recent meeting of the Amer- 
ican Society of Criminology, Rosemary Gartner, a sociologist at the Uni- 
versity of Toronto, pointed out that while homicides in Canada have fallen 
markedly during the 1990s, there has been little change in drug demand 
or drug markets, incarceration rates have fallen, police practices have not 
changed in any systematic way, and overall the Canadian economy has 
not been particularly robust during this time period. This suggests that 
there is much that can be learned from a comparative study of crime 
trends in the United States, Canada, and Europe. 
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These criticisms aside, this book is a “must read” for criminologists. 
The questions examined are important, the research is carefully done, and 
the findings will not only help us sort out competing explanations for the 
current crime drop, but will also expand our general knowledge about 
crime causation and its control. 


Flag Burning: Moral Panic and the Criminalisation of Protest. By 
Michael Welch. New York: Aldine de Gruyter, 2000. Pp. viii+220. $59.95 
(cloth); $29.95 (paper). 


Mathieu Deflem 
Purdue University 


This book provides an account of the history of flag desecration and the 
efforts to criminalize it. Welch’s analysis is primarily directed at unrav- 
eling the course and outcome of attempts to outlaw flag desecration and 
the unintended consequences these had. The opening chapters provide a 
narrative of the main stages in the history of flag desecration. During the 
antebellum years of the 19th century, movements to protect the flag as a 
powerful symbol in American society had strong associations with nativ- 
ism and patriotism. The Civil War was a first important catalyst to launch 
a veritable movement against flag desecration based on the intimate con- 
nection between flag desecration and anti-Unionism. Not surprisingly, 
similar intensifications of the flag issue took place during World Wars I 
and IL 

Mounting resistance against the outlawing of flag desecration did not 
take place until the protest era of the 1960s and 1970s, when various 
forms of flag desecration, especially flag burning, became central elements 
in a more general protest movement. During the 1980s, this generality 
made way for more isolated but highly publicized flag desecrations by 
specific social movements, such as the Revolutionary Communist Party. 
The intense nature of the controversy at that time led to convictions on 
the basis of flag protection statutes. These statutes, however, were quickly 
overturned when the U.S. Supreme Court in 1989 ruled that a Texas 
antidesecration statute violated the First Amendment of the U.S. Con- 
stitution. The debate shifted to the political arena, but a constitutional 
amendment to ban flag desecration failed. In 1989, Congress passed the 
Flag Protection Act, which in the following year, however, was ruled to 
be unconstitutional by the Supreme Court. 

The final section of Welch’s book analyzes the reactions against flag 
desecration in terms of a model of moral crusading. Welch identifies the 
criteria of the panic, the social control agents and institutions that played 
a prominent role in the crusade, and the themes that were used to sub- 
stantiate concerns for and against flag protection. Specifically analyzed 
are relevant congressional debates and the representations of flag dese- 
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cration and flag protection in the media. The author concludes that po- 
litical elites and the media have contributed to the idea that flag dese- 
cration is a revolutionary force targeted at the very fabric of American 
society. The media’s role is more ambiguous in that journalists are also 
highly protective of First Amendment concerns. This ambiguity is also 
shown in the ironic consequences that reactions against flag desecrations 
have had in contributing to the opposition to the control of flag 
desecration. 

Michael Welch’s book delivers a contribution to the sociological study 
of a fascinating and important social issue. Any sociologist interested in 
flag desecration issues has to start with Welch’s work, this book, and the 
author’s many related articles. The empirical sections of the study, es- 
pecially the identification of the various themes and players in the con- 
frontational battle between First Amendment rights and the protection 
of fundamentally held beliefs and values, make for an interesting read. 
However, I found the work to be generally much less convincing in the- 
oretical aspects. The book is largely indebted to a rather orthodox social 
constructionism, additionally relying on other sociological insights, par- 
ticularly Robert Bellah’s notion of civil religion, which is used to argue 
that the American flag has become a venerated object that demands 
special protective status. While occasionally useful, Welch’s theoretical 
model is not particularly illuminating. For one, the theory is spiced up 
with a fanciful terminology (e.g., “authoritarian aesthetic”) that is neither 
explained nor applied to any reasonable degree of intellectual sophisti- 
cation. The reliance on theories in the social control literature is at times 
careless, most clearly when Welch uses Gary Marx’s notion of the ironies 
of social control to argue that social control contributes to rule breaking 
(p. 179), whereas the original insight is that social control may contribute 
to deviance under specified circumstances, the conditions of which have 
to be carefully investigated. 

Betraying certain normative tendencies in the social construction par- 
adigm, Welch’s analysis focuses too exclusively on the strategies of state 
agents to monopolize the opposition against flag desecration. This “elite- 
engineered model” (p. 124) neglects the popular, grassroots dimension of 
the flag protection movement. Failing to disentangle these two dimensions 
of power and resistance—for instance, in terms of a Weberian distinction 
between state and nation—the one-sidedness of Welch’s perspective be- 
trays a highly partisan stance. Desecrations of the flag are seen as “crit- 
icism of the state” (p. 4) that target the “authoritarian aesthetic by at- 
tacking its symbols” (p. 50), while movements protective of the “symbolic 
value of Old Glory” (p. 49; note the different terminology to refer to the 
flag) are claimed to use images that are “fraught with contradictions” (p. 
9), relying on arguments that “not only fail basic ontological scrutiny, [but] 
also defy the underlying principles of the U.S. Constitution” (p. 12). I fail 
to see the scholarly grounds that could rationally support such a priori 
normativism to slip into our discipline. 
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Culture: The Anthropologists’ Account. By Adam Kuper. Cambridge, 
Mass.: Harvard University Press, 1999. Pp. xv+297. $31.00 (cloth); $17.95 
(paper). 


Norman K. Denzin 
University of Illinois at Urbana-Champaign 


This is an important, carefully argued book, although I disagree with 
much of what it says. Kuper, a South African anthropologist, details in 
eight closely argued chapters the uses to which the concept of culture has 
been put in postwar American cultural anthropology. His introduction 
surveys the 20th-century culture wars, showing how the uses and mean- 
ings of culture moved back and forth from the humanities to the social 
sciences. At mid-century culture was defined by Parsons, Kroeber, and 
Kluckhohn as a collective symbolic discourse about knowledge, beliefs, 
and values. This would be the term’s meanings for the newly emerging 
behavioral sciences. 

The next generation of anthropologists (Lévi-Strauss, Geertz, Schneider, 
Sahlins) moved the discourse in two different directions at the same time, 
the structural or linguistic, and the humanistic or interpretive. Lévi- 
Strauss and his followers read culture as if it were a language. The post- 
structural Geertzians, in contrast, interpreted culture as a text. 

In part 1, elaborating the themes in his introduction, Kuber presents 
two genealogies of the term. Chapter 1 surveys the continental discourses 
on culture, noting tensions among those who treat culture with art and 
civilization (Febvre, Arnold), as a way of life (Eliot), and as structures of 
feeling (Williams). Chapter 2 examines the call by Parsons, Kroeber, and 
Kluckhohn for an objective science of culture. Part 2 explores postwar 
experiments with the concept. Chapters 3, 4, and 5 respectively examine 
the works of Geertz, Schneider, and Sahlins. Geertz’s interpretive turn, 
which moved anthropology into the humanities, is contrasted to Schnei- 
der’s rigorous scientific structural approach to kinship, and Sahlins’s his- 
torical materialism. 

Chapter 6, “Brave New World,” offers an all-too-brief reading of the 
post—Clifford and Marcus writing culture project. For Kuper, the tenured 
radicals of the post-Vietnam generation politicized culture, while criticiz- 
ing anthropology’s complicit relationship with American imperialism. 
This generation rejected functionalism (Parsons), structuralism (Lévi- 
Strauss), and the strict hermeneutic approaches to culture (Geertz). Lit- 
erary approaches to writing culture were developed. Rosaldo and others 
questioned the value of such terms as objectivity and impartiality. This 
generation was united on three themes: there has been a world-historical 
shift in the terms of cultural trade; objective accounts of other ways of 
life are not possible; and there is a moral obligation to celebrate cultural 
difference (p. 218). 

Kuper criticizes each of these assumptions, calling them romantic, post- 
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modern and relativist, and asking (without Spivak), who can speak for 
the “other”? He believes that the postmodern turn has had a paralyzing 
effect on the discipline. And at this moment, Kuper’s values become most 
clear. “The postmodern, writing culture project has stifled young ethnog- 
raphers. It has become a source of ideological support for identity politics. 
Most importantly, this project denies the possibility of a cross-cultural, 
comparative anthropology” (p. 223). He wants to return to the good old 
days, before everything got so messy. 

Chapter 7 presents Kuper’s conclusions. He now has three generations 
of anthropologists interacting in his text. The Parsonian generation 
wanted an objective science of culture. Geertz’s generation pushed for a 
“detached, cerebral hermeneutics of culture” (p. 228). In the 1990s, culture 
theory would become political, and essays on cockfights in Bali seemed 
quaint (p. 229). 

Enter the enemy, the left and its cultural studies project. The humanities 
and popular culture triumph over high art. Culture is politicized, cultural 
difference and multiculturalism are valorized, the melting pot is dead, 
and cultural criticism becomes normative. We are suddenly confronted, 
he asserts, with a fragmented culture built on identity politics. This con- 
flicts with the liberal individualism of the earlier melting pot model. This 
assimilation model recognized difference but urged persons to find a place 
in the larger culture. Kuper liked this situation. In order to dismiss cultural 
studies, he charges that it presumes an essentialism that turns culture into 
a “politically correct euphemism for race” (p. 240). Surely Stuart Hall 
should have been allowed to confront this conservative allegation. For 
these are harsh words, and Hall is nowhere to be heard. 

Kuper conclusions are quick and brief. He wants to dismantle culture 
and make it refer to several separate things, such as knowledge, art, or 
belief. He is quite firm, contending that “appeals to culture can only offer 
partial explanations of why people think and behave as they do... 
political and economic forces, social institutions, and biological processes 
cannot be wished away” (pp. x-xi). Thus does he morally object to cultural 
studies. It draws attention away from “what we have in common... 
across national, ethnic and religious boundaries” (p. 247). We are back to 
square one, back to Parsons and the good old boys, back to the unprob- 
lematic days of cross-cultural, comparative anthropology. 

As a partisan of cultural studies, I have several objections to Kuper’s 
project. Where are the women? The women who have written culture 
are not here. Culture is a site of the political struggle to define how life 
is lived and experienced. This site is deeply enmeshed in matters of power, 
ideology, and the media. Humans live in a second-hand world, one already 
defined by language and the culture industries. The anthropologist is part 
of this world, not above or outside it. There is no objective vantage point 
from which one can write. We are always already writing culture. This 
project will not disappear because of Kuper’s disapproval. Further, raising 
the red flags of cultural essentialism will not make the politics of identity 
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disappear. Kuper cannot legislate what a nation and its peoples have in 
common. The melting pot worked only as long as it was controlled by 
whites males. 

So this is an important book, and it must be contested. Historical read- 
ings are always important because they police disciplinary boundaries. 
Kuper’s readings clearly exclude certain versions and forms of culture 
and cultural studies. Because it is so well written, this book has the 
potential of becoming a canonical guide for a new generation of anthro- 
pologists. This could be unfortunate. Culture is an unruly term. Its mean- 
ings cannot be dictated. More deeply, the term must always exist within 
shifting political, historical, disciplinary and gendered terrains. Each gen- 
eration must have the freedom to take up the term and make it their 
own. If they take Kuper too seriously, the current generation will not do 
this. And that would be a great loss. 


A Matter of Taste: How Names, Fashion, and Culture Change. By 
Stanley Lieberson. New Haven, Conn.: Yale University Press, 2000. Pp. 
xvit 334. $29.95 


Wendy Griswold 
Northwestern University 


One of the most eminent sociologists currently active—a former president 
of the ASA—has written a book lambasting common sociological practice. 
Two practices, actually: he attacks both the common move of imputing 
causality from plausibility and the “iceberg fallacy” of assuming that every 
glittering new shard on the “cultural surface” indicates a structure of social 
change underneath. Stanley Lieberson will tolerate neither, and he has 
put together an impregnable demonstration of how to do it right 

The topic is taste, and the specific subject is taste shifts in the naming 
of children. Lieberson points out that naming offers an ideal window into 
how taste works because they are one of the few fashion-driven practices 
that are virtually untouched by marketing and the manipulation of con- 
sumption, for no one stands to profit from any particular naming decision. 
Moreover, the historical record is extremely rich; over the centuries we 
might not know if little Pamela or Pierre learned to read or lived beyond 
childhood or were socially mobile, but we do know what their names 
were. So mustering fabulous quantities of these data and subjecting them 
to rigorous analytic scrutiny allows Lieberson to see what motivates shifts 
in naming practices whereby Harry and Hilda go out of style and Lakeisha 
and Michael come in. 

Lieberson charges that the typical cultural sociologist (your reviewer 
blushes as she writes this) finds a cultural change, finds some social change 
that happened around the same time, theorizes a plausible connection 
between the social change and the cultural change, and asserts that the 
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former caused the latter (or, more typically and less honestly, that the 
latter “reflects” the former). With the cool eye of a scientist, Lieberson 
views this as simply hypothesis formation, and he demands that any such 
assertion be subject to rigorous testing to see if what is plausible is actually 
correct. It often is not. For example, while it might be plausible that 
biblical names reflect a religious spirit, in fact biblical names came into 
fashion in the 1960s, just as church attendance was dropping. Moreover, 
parents with intense religious feeling who go to services regularly are less 
likely to give their children biblical names than those who never darken 
the doorway of church or synagogue. 

Indeed the sociological impulse to assign an external cause to changes 
in a fashion system may be misleading. Lieberson suggests that internal 
causes produce most of the advances and declines in the popularity of 
names. Most important of these internal causes is the ratchet effect 
whereby (1) taste shifts are modest variations on existing tastes, and (2) 
these shifts do not oscillate back and forth but move steadily in one 
direction for a considerable period of time so that there can be no confusion 
in the short run between what’s fashionable and what’s out of date. Other 
internal mechanisms include expansion from a taste stem (Jane gives rise 
to Janet, Janice, Janis) and combinations of prefixes with previously more 
popular roots (among African-Americans, popular girls names of the 1970s 
like Tonya and Tasha had given way to Latonya and Latasha by the 
1980s). For the past century or so there has been far more room for such 
innovation, because fashion has replaced custom in name selection, a 
change that Lieberson demonstrates was brought about by urbanization 
and individualism, not by the mass media. 

But if internal pressures produce some sort of shift, what moves the 
shift in certain directions and not others? External cues can sometimes 
be identified (the name Donald dropped out of fashion when Donald Duck 
appeared) but not always (the lasting popularity of Humphrey Bogart did 
nothing for Humphrey). Symbolic connections difficult to untangle, and 
while the fate of a few particular names (Adolf) can be attributed to 
symbolic contamination, external triggers from media or political leaders 
do not seem to have any consistent impact. Lieberson regards innovations 
as like mutations: they happen, and then the internal processes help carry 
on a particular innovation or not. For example if someone with an unusual 
Old Testament name emerges as a sports hero, that name would be more 
likely to catch on in the 1960s (when biblical names were hot) than in 
the 1940s (when they were not). 

Going beyond names, Lieberson argues that a culture should be re- 
garded as a “surface,” the result of various unconnected processes that 
have put elements there. I envision this cultural surface as something like 
a table set with different pieces of crockery, some old, some new, some 
wedding gifts, some from the second-grader’s art class. The analysis of 
the cultural surface involves three different activities: “the initial occur- 
rence and growth of each element; its continuation in the likely event that 
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the initial causes no longer operate; and the forces that cause other earlier 
elements to decline or disappear” (p. 258). The set table does not reflect 
some underlying iceberg, but instead is the result of different mechanisms 
(attitudes toward wedding gifts, involvement of one’s children’s ceramics, 
tendency to break old things). To understand the surface and how it might 
change over time requires painstaking analysis, not an assertion of plau- 
sibility, but it can be done, and Lieberson challenges cultural sociologists 
to try to map and account for culture in this manner. 

There are certain books in cultural sociology that accumulate so much 
evidence and subject it to such painstaking scrutiny that one cannot help 
but be convinced by them; the thesis is not so much argued as established. 
Such books are usually written by full professors with lots of resources, 
immense patience, and high standards. Claude Fisher’s America Calling 
was one such book; so was Orlando Patterson’s Slavery and Social Death. 
Now Lieberson has produced another of these rarities. These books are 
solid. More, they are scientific, not in the sense of an uncritical positivism 
but in the sense of a careful but relentless gathering and weighing of the 
evidence. Books like A Matter of Taste set the benchmark for sociological 
practice and remind us of how often we fall short. 


Fashion and Its Social Agenda: Class, Gender, and Identity in Clothing. 
By Diana Crane. Chicago: University of Chicago Press, 2000. Pp. x +294. 
$30.00. 


Gary Alan Fine 
Northwestern University 


We understand social order both through naked torsos and clothed ones. 
Social class, gender, and identity are inscribed in bodies and on what 
bodies are draped in. Sociologists, being professionally prudish, have been 
more likely to examine the latter than the former, and, as a consequence, 
there is now a growing quantity of scholarship that examines the symbolic 
meaning of fashion. Fashion in clothing seems particularly symbolic and 
“cultural” because it is evident that what people choose to wear often 
seems so voluntary. The rapidity and abruptness with which dress styles 
change serves to remind us that attire is an aesthetic choice, mediated, 
of course, by those technologies and those gatekeepers that set the options. 

Diana Crane has long been fascinated with aesthetic styles and cultural 
movements. In Fashion and Its Social Agendas she has woven together 
seven previously published articles to provide a lens on the selection of 
clothing in the 19th and 20th centuries. The origin of this volumes leads 
inevitably to a certain fragmentation, and the reader who expects to be 
presented with a consistent and linear analysis will leave disappointed. 
In short, this is not a work that aspires to present a fully formed argument 
about the nature and the dynamics of fashion. Crane’s detailed historical 
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treatment sometimes downplays narrative facility with telling a grand 
story, establishing proof based upon a preponderance of detail. Recog- 
nizing this limitation, there is still much to be learned from the depth of 
Crane’s scholarship. Crane’s intention is not to smooth over details, but 
to explain the complications of clothing use. 

To assert that fashion reflects class, gender, and identity is to make a 
claim that is far too simple. The interesting question is the limits of these 
connections and the semiotic subtleties of the connections. As Crane aptly 
notes, the fashion divisions among social classes are not as self-evident 
as they might otherwise appear. Drawing on the case studies of Frederic 
Le Play in 19th-century France, Crane demonstrates that individuals in 
distinct segments of the working class brought different orientations to- 
ward middle-class fashion, notably—and surprisingly—in regard to such 
“accessories” as ties and hats. Clothing represented a complex semiotic 
system in which individuals select items to incorporate in their wardrobe, 
especially on leisure occasions, often centered on Sundays and holidays. 

The difference in fashion attitudes between the provinces and Paris, 
where the class divisions were less evident and imitation greater, are also 
sociologically significant. In the United States, too, the class divisions were 
less apparent than in the more traditional and stratified cultures of Europe. 
In her excellent and novel treatment of the “cultural politics” of hats in 
19th-century France (who would have thought there is so much to a 
beret!), Crane notes not only class and gender division, but differences 
among Paris, provincial cities, and rural areas, and points to variant 
meanings of different styles, even pointing to the donning of bowlers to 
blur class lines (p. 84). In light of the gendered meaning of clothing, Crane 
demonstrates the multiple ways that women in the 1890s used “fashion” 
as a form of nonverbal resistance to (or at least commentary upon) male 
hegemony. Within the clothing system, a set of alternative styles 
emerged—a multiplicity of styles that has become increasingly institu- 
tionalized within contemporary consumption systems. It is now possible 
for individuals—men and women—to create “texts” of identity by se- 
lecting body coverings, and, as Crane points out, at various times during 
the past two centuries social movements emerged that pushed various 
forms of clothing reform—bloomers, pant suits, unisex clothing, and the 
like. Most of these movements had as their goals to provide women more 
options to dress “like” men, but more significantly to permit them the 
opportunities to have the bodily freedom to do what men do, recognizing 
that clothing can be not only socially constraining, but physically con- 
straining as well. 

Perhaps most interesting is Crane’s discussion of 20th-century (French) 
couture as a global industry. Any student of occupations and of cultural 
organization will recognize within her discussion the process of globali- 
zation, now so widely evident. Fashion, it turns out, was an early industry 
in which national boundaries on acquiring resources, producing garments, 
and selling the product were undercut. While fashion (along with cuisine) 
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served as a marker for the dominance of French society, this position of 
centrality is only possible given the global reach of the French fashion 
industry. 

Crane powerfully demonstrates why fashion is a powerful image for 
the sociologist. Precisely because dress seems to be so separate from the 
demands of function, it appears to be pure expression, an undiluted con- 
struction of segmental groups. The danger—a danger that Crane does not 
entirely escape—is to ignore the fact that fashion decisions are selected 
by individuals with their own motivations. To suggest, as does the title, 
that fashion has its own agendas is perbaps to make concrete fashion as 
a social actor in its own right, rather than being part of a tool kit—a 
closet—from which agents select to clothe their public selves 


Creative Industries: Contracts between Art and Commerce. By Richard 
E. Caves. Cambridge, Mass.: Harvard University Press, 2000. Pp. ix+454. 
$45.00. 


William T. Bielby 
University of California, Santa Barbara 


In this book, Caves applies theories of contracts and industrial organi- 
zation in order to explain the economic and social organization of creative 
industries. His task, to explain “why art worlds are organized the way 
they are,” is an ambitious one. He begins by developing the argument 
that contracting and organizational arrangements are shaped by seven 
distinctive features that differentiate creative industries from other sectors. 
The first feature is nobody knows. Demand is uncertain, and whether a 
product will be a success cannot be known until after it reaches the market. 
The second is art-forart’s-sake. Individuals working in the creative arts 
differ from other workers in that they often care greatly about the at- 
tributes of their output and the circumstances under which it is produced. 
As a result, they are willing to work in creative jobs for compensation 
substantially below what they could earn in “humdrum” jobs, and the 
number of people seeking work in creative industries greatly exceeds the 
work available. The third is the motley crew phenomenon. Creative in- 
dustries often require inputs from individuals with diverse skills and ori- 
entations toward their work, which creates problems of coordination. The 
fourth is infinite variety. The products of creative industries are unique 
and highly differentiated. No two films, plays, books, paintings, musical 
compositions, and so on are exactly the same, and the number of possible 
products is infinite. The fifth is the A hist/B list phenomenon. Skills are 
vertically differentiated, and talent is strictly ranked. These rankings cre- 
ate differential rents: the amount consumers are willing to pay to enjoy 
the product of a top-ranked artist greatly exceeds the minimum compen- 
sation required to elicit that artist’s services, and the amount of this “rent” 
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decreases dramatically with artists’ rank. The sixth principle is time flies. 
Temporal coordination is very important, as is the prompt realization of 
revenues, and production delays are extremely costly. The final principle 
is ars longa—individuals are still willing to pay to enjoy a product long 
after it is produced. 

From here, Caves goes on to explain similarities and differences in 
organization across a wide range of art worlds: popular music, musical 
composition and performance, theater, television, motion pictures, book 
publishing, fine art, and even toys. We learn, for example, why the option 
contract is used so widely in creative industries, why nonprofit organi- 
zation is an efficient solution to high fixed costs in the performance arts, 
why talent guilds have tremendous bargaining power, why media con- 
glomerates continue to pursue merger strategies despite the illusory payoff 
of “synergy,” and how the durable and expanding stock of musical mas- 
terpieces constrains opportunities for contemporary composers. We learn 
why “payola” (paying a bribe to get a creative work into channels of 
distribution) is a natural and efficient response in industries in which 
“conventionalized” prices exceed marginal costs and, in an interesting 
aside, how Dick Clark elevated this stigmatized practice to an art form. 
Caves also explains how the recent acquisitions of television networks by 
film studios (e.g., Disney’s merger with ABC) can be understood as a kind 
of “capitalized payola.” 

Extending the work of Gans and others, Caves offers an interesting 
analysis of the distinction between highbrow culture and lowbrow culture 
that eschews differentiating the two based on aesthetic principles or the 
demographics of audiences. Instead, he argues that what we usually call 
“high culture” is an art world where there is both a high ratio of “buffs” 
(who invest in acquiring capital that enhances their appreciation of the 
product) to casual consumers and a high level of fixed costs per consumer. 
Thus, in his view, “each art realm has its natural turf and scale of op- 
eration, a balance of demand and supply forces” (p. 187). 

None of what appears in this book is based on Caves’s original research. 
Instead, he draws upon his extraordinary command of just about every 
serious scholarly work on creative industries written by cultural sociol- 
ogists, and on a rapidly growing body of theoretical and empirical work 
in the field of cultural economics. Some of what is covered in the book 
will be familiar ground to cultural sociologists, such as the discussion of 
organizational arrangements that attempt to manage the conflict between 
art and commerce in creative industries. And, as an economist, Caves is 
quick to embrace a functionalist logic (“In the world of commerce, market 
distortions tend to generate their own repairs” [p. 144] and puzzles over 
the persistence of organizational arrangements that seem to defy economic 
logic (e.g., stable customary royalty rates in music publishing). Yet Caves 
approaches his task with a humility that is uncommon among the more 
zealous practitioners of the dismal science, and I came away from this 
book more convinced of the applicability and utility of economic models 
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of contracting than I have from any other writing by mainstream econ- 
omists on social organization and work. 

Readers of this book will be surprised to find that it contains only 
passing reference to the impact of electronic commerce. There is no explicit 
discussion of online auctions, music exchanges, or Internet book retailers 
Caves argues convincingly that the rapid expansion of electronic com- 
merce is not evidence of a fundamental transformation of creative in- 
dustries. Instead, the Internet’s impact can be understood by applying 
the same principles from theories of contracts and industrial organization. 
` Indeed, a productive approach for using this book in the classroom would 
be to require students to extend Caves’s analysis of auction houses to 
eBay, his analysis of the distribution of popular music to Napster, and 
his analysis of changing organizational concentration in the publishing 
industry to Amazon.com. 

Caves’s book effectively communicates sophisticated economic con- 
cepts with a minimum of formalism and technical jargon. It is clearly the 
work of an individual who cares deeply about art, culture, and the creative 
process. It should be read by anyone who teaches or does research on the 
production of culture or the economics of organization. And anyone who 
aspires to be a rock star, best-selling author, or filmmaker (not uncommon 
career goals among sociology majors these days) would be well advised 
to read this book to better understand how money is made and creativity 
is managed when art and commerce intersect. 


Life Has Become More Joyous, Comrades: Celebrations in the Time of 
Stalin. By Karen Petrone. Bloomington: Indiana University Press, 2000. 
Pp. x+266. $39.95 


Dominic Boyer 
University of Chicago 


This informative and enjoyable new social history of Soviet celebrations 
in the Stalinist era offers novel insight into the role of state-sponsored 
ceremonials in national identity formation. Petrone explains that, faced 
in the 1930s with the problem of perduring localized social imaginaries, 
the bureaucrats of the Soviet Union sought to instrumentally utilize mass 
celebrations to cultivate standard currencies of national belonging and 
national imagination. The narrative of the book skillfully juxtaposes the 
Soviet state’s quest for legitimacy in a harmonious national imaginary 
with the pragmatic complexities, tribulations, and not occasional fruitions 
of engineering symbolic meaning on a mass scale. 

The book’s introduction is succinct and deftly positions the author’s 
project as an investigation of Soviet celebrations as political instruments 
of mass identity formation significantly mediated by the contexts of their 
production and reception. The next several chapters each focus on a 
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particular celebration complex (ranging from physical culture parades to 
the glorification of Soviet aviation and polar exploration to the 1937 Push- 
kin centennial) and analyze in detail both the state’s intended logic for 
the celebration and the problems the state encountered in maintaining 
the transcontextual purity and integrity of its desired meanings of national 
culture as it created and disseminated them. To her credit, even though 
she explicitly aims to complexify contemporary interpretations of Soviet 
propaganda, Petrone never downplays the power of state ideology for 
cultivating and accrediting particular strategies of national identification 
as emblematic of “New Soviet” men and women. Yet, neither does she 
take “state ideology” as an undifferentiated given, instead consistently 
exposing the heteroglossic effects of the factional interests of different 
groups of propagandists and of the indigenization of state discourse in 
the Soviet peripheries. Finally, the great methodological strength of the 
book is Petrone’s juxtaposition of private documents like diaries with 
official discourse to reveal the uneven incorporation of state symbologies 
into the formation of local and personal social identities. Here, we en- 
counter both triumphs of state intention—for example, in tears shed by 
an old collective farmer for the heroism of Soviet explorers—and blanket 
failures of official narratives of citizenship and identity to suffuse local 
conceptions of selfhood. 

The case studies, taken together, admirably support Petrone’s central 
argument that “the identities created by Soviet celebrations were highly 
variable, contingent, and constantly in the process of being reshaped. 
Soviet celebrations both succeeded and failed in their goal of creating 
New Soviet Men” (p. 204). In this, the book makes an important contri- 
bution to mass culture studies by showing how celebrations were equally 
rituals of establishing categories of national sacredness (in a Durkheimian 
sense) and objects of estrangement. Petrone’s insistence on the plurality 
of vectors in Soviet identity formation—including accredited state dis- 
course, critical intellectual subversions, competing ethnic-national affili- 
ations, and a host of other local adherences—is a welcome reminder that 
state-sponsored mass culture cultivates but does not guarantee mass iden- 
tity formation. Indeed, the book reveals that this is not only a problem 
of differential local reception but indebted to a fundamental heterogeneity 
at the production end of state narratives. The book’s historical analysis 
vividly illuminates the pluricentric institutional forces as well as the in- 
dividual agents who constitute the artisans of any national imaginary. 

Fresh insights into center-periphery relations in the Soviet Union, into 
the social life of propaganda, and into the hybridity of Soviet public 
culture are among the text’s more significant contributions both to Soviet 
studies and to mass culture research more generally. The text does have 
certain weaknesses, however. Even taken as a whole, the several case 
studies do represent a somewhat limited sample, and one might wish for 
more sustained theoretical consideration of how Soviet celebrations should 
be placed within research on state manipulation of public culture more 
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generally. Here, comparisons with other socialist contexts might have 
helpfully illuminated what was distinctive to the Soviet case and what 
strategies were more generally shared by other party-states’ mass cultural 
regimes. The subtitle of the work is also misleading since the case studies 
focus almost exclusively on the period between 1932 and 1937, with some 
connections to the period of the first Five Year Plan but with only sketchy 
connections to the celebration complexes of the World War II and postwar 
periods. Finally, this reviewer found the narrative choppy in places and 
the exegesis of particular celebrations repetitive in others. 

Despite these relatively minor flaws, this book should be warmly re- 
ceived for its timely reconsideration of the role of mass celebrations in 
national identity formation and, particularly, for its striking illumination 
of the complex and often unsuccessful negotiations of local and translocal 
knowledge that are implicit in any state effort to “engineer” (using Stalin’s 
own phrase) a national soul. Petrone describes with understated irony 
how genuine collective rapture emerged in these celebrations almost de- 
spite the backroom calculations oriented to achieve it. 


From Mutual Aid to the Welfare State: Fraternal Societies and Social 
Services, 1890-1967. By David T. Beito. Chapel Hill: University of North 
Carolina Press, 2000. Pp. xvi+320. $55.00 (cloth); $24.95 (paper). 


Charlotta Stern 
Stockholm University 


David T. Beito tells a constellation of fascinating stories about the fraternal 
societies of ordinary Americans, 1890-1967. With strong empirical sup- 
port, he argues that, far from being odd social clubs with strange rituals, 
fraternal societies were at the center of working-class and ethnically based 
living and were creatures of material and moral necessity. Using an ide- 
ology of self-reliance and mutuality, fraternal societies provided members 
with social services framed as rights. The mutuality meant that today’s 
recipient could be tomorrow’s donor, and vice versa. The stigma of hi- 
erarchical dependence was thus replaced by the dignity of self-reliance. 
Over the period studied millions of Americans by such means provided 
themselves with sickness insurance (worker’s compensation), life insur- 
ance, and health insurance. 

A member’s social right depended on proper conduct according to ac- 
cepted rules and rituals. These were enforced by visiting committees and 
interpersonal communication. Beito points out that a fraternal society 
worked much like a formalized extended family. It taught members the 
virtues of thrift, mutualism, and individual responsibility. Unlike a family, 
however, the fraternal lodge was an organization of consenting adults. It 
was usually democratic, with elected leaders and functionaries. Its rules 
and rituals made the obligations explicit, and perhaps most important, if 
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the value of membership was doubted, members could exit. It organized 
solidarity around membership, it pooled resources and extended them to 
members in need, and it practiced social control Members also learned 
valuable organizational skills. 

Thus, fraternal societies offered social inclusion and economic security 
to uprooted individuals with scarce material resources. Its success was 
enormous. By 1920 members of fraternal societies carried over $9 billion 
worth of life insurance. At least one-third of adult males were members, 
including large segments of workers, blacks, and recent immigrants. 

The first and last part of the book is about the rise and fall of mutual 
aid fraternities, while the chapters in between contain many fascinating 
stories beyond insurance and mutual aid. Beito shows that fraternal so- 
cieties were multifaceted, spirited organizations of great variety and that 
organizations sometimes pulled together some rather grand, even heroic, 
endeavors. For instance, he outlines the story of the Ladies of the Mac- 
cabees, a woman-only, feminist fraternity that organized around 20,000 
members in the 1920s. And the United Order of True Reformers, a black 
fraternity that ran a 150-room hotel, a bank, a newspaper, and several 
retail stores as well as provided their members life insurance. Or the Loyal 
Order of the Moose, which established Mooseheart to feed, clothe, and 
educate the children of deceased members. By 1929 Mooseheart housed 
110 mothers and 1,274 children. In his chapter on Mooseheart, Beito 
describes the quality, financing, and everyday workings of the orphanage, 
and gives accounts of the children’s social background, their adult lives, 
and attitudes about their childhood. His careful and detailed review of 
the evidence suggests that the disrepute of orphanages was quite unfair 
and overstated. 

Beito goes on to examine the lodge practice, another fraternal institution 
in disrepute. Through lodge practice, fraternal societies provided members 
with low-cost medical services, costing on average $2 per year or laborer’s 
day pay. After interviews and voting, a doctor was signed up to provide 
basic health care in exchange for a fixed salary based on the size of the 
membership. Lodge practice spread rapidly—auntil leaders of American 
Medical Societies decided that it was an evil. The evil consisted in it 
“pos[ing as] a danger to prevailing fees and subjectfing] doctors to the 
exploitative whims of the laity” (p. 109). The lodge practice evil elicited 
almost universal condemnation from medical societies. This, coupled with 
declining numbers of doctors, led to the demise of the lodge practice by 
the end of the 1920s. 

Using these and other examples, Beito explains the rise and fall of some 
specific fraternal institutions and the long-run fall of fraternal mutual aid. 
The challenges presented at the time seem to have shifted the emphasis 
of fraternities to service rather than mutual aid. Beito suggests that the 
shift might have been a strategic mistake because fraternities thereby lost 
their special character. But the demise of mutual aid societies may have 
been inevitable. With growing wealth among the new generation of Amer- 
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icans, the demand for mutuality declined. With increasing regulations and 
legislation, fraternities could no longer offer hospitals, orphanages, and 
old age homes. With taxpayer-financed mother’s pensions, workers com- 
pensations, and social security, the demand for voluntary insurance de- 
clined. As the title of the book suggests, mutual aid was indeed crowded 
out by the welfare state. 

Beito’s excellent study sheds light on an important yet neglected part 
of the social past. It is exciting in the way that it exposes the selfishness 
and arrogance of the elite of the time. It has insights especially for so- 
ciologists interested in social movements, voluntary organizations, social 
work, empowerment, and American social history. 


The Invention of the Passport: Surveillance, Citizenship and the State. 
By John Torpey. Cambridge: Cambridge University Press, 2000. Pp. 
xii+211. $59.95 (cloth); $22.95 (paper). 


Horng-luen Wang 
National Taiwan University 


At first glance, the passport deserves little scholarly attention from so- 
ciologists, as it appears to be no more than an official document needed 
only for foreign travel and rarely bearing any significance in our daily 
life. However, John Torpey has turned a seemingly insipid topic into a 
fascinating one by making an original contribution to the sociology of the 
(nation-)state. In this groundbreaking exploration of the passport’s vicis- 
situdes from the French Revolution to the present time, Torpey argues 
convincingly that the passport is important to our understanding of the 
nature of the state and the state system. 

Echoing Marx’s analysis of the means of production in capitalism, as 
well as Weber’s conception of the monopoly of legitimate violence by the 
state, Torpey adds a third type of expropriation in the modern world, 
namely the “monopolization of the legitimate means of movements” by 
nation-states. Intended to be bounded, mutually exclusive communities, 
nation-states need to bring closure not only to their territories but also to 
their subjects. Accordingly, Torpey suggests that the traditional charac- 
terization of the state “penetrating” society be replaced by that of “em- 
bracing” it. To enhance its capacity of governing, the state must locate 
and lay claims to people by “surrounding” or “taking hold” of its members 
(p. 11). Therefore, documentary controls on movement and identification 
become essential to determine “who are in” and “who are out.” Drawing 
on extensively collected materials covering France, Germany, Italy, the 
United Kingdom and the United States, Torpey impressively provides us 
with a well-documented account of how ideas and practices concerning 
the passport developed in various historical contexts and became insti- 
tutionalized, albeit at an uneven pace, in the contemporary state system. 
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Such a theoretically driven and empirically grounded study contains 
rich themes that can sustain a range of discussion. Although Torpey hardly 
makes “institutions” his buzzword, this study can be best seen as exem- 
plifying the analytical strengths of the “institutionalist approach” to the 
study of world polity (following John Meyer) and nationhood/nationalism 
(following Rogers Brubaker). Indeed, Torpey is “institutionalist” to the 
bone when he asserts that “national communities must be codified in 
documents rather than merely imagined” (p. 6), or that “nationality is an 
ascribed status that cannot be established without reference to documents” 
(p. 155; emphasis in original). Torpey also nicely deals with the contra- 
dictions of the passport with cosmopolitan views in revolutionary France, 
and with economic liberalism in the nineteenth century, eventually leading 
us to the ongoing debates concerning the “postnational” argument. As 
hypermobility of transnational flows of people is deemed one of the de- 
fining features of globalization, we tend to assume that people are crossing 
national boundaries more and more freely. Torpey reminds us, however, 
that such is not the case. He shows us that transnational flows of people, 
no matter how mobile they may seem, have to move within the institu- 
tional channels set up by the state. Even where passport controls are 
loosened, the state remains the final authority for such discretions, and 
institutions of nationhood and citizenship remain predominant. Thus, the 
study provides a powerful refutation against the prevalent view of the 
“decline of sovereignty/citizenship” or “postnational membership.” 

However, there are some limitations with the book due to its ambitious 
scope. Because it focuses on a few selected countries, there is a relative 
lack of coverage on how an overarching passport regime has been created 
worldwide. In addition to detailed biographies of individual trees, readers 
motivated by the book’s title might want to know more about how the 
entire forest has grown into what it looks like today. Besides, while the 
image of “embracing” is a refreshing way of conceptualizing the relation- 
ship between the state and society, the definition of “means of movement” 
is not clearly framed in this study, making the analogy hard to grasp. 
Unlike machinery (means of production) or the army (means of violence), 
the passport is neither inherent in nor necessitated by the movement itself. 
To say that the state “monopolizes” the means of movement through 
passport control implies that movement intrinsically invokes admissions 
or regulations of some sort. This, apparently, is not true. In discussing 
cosmopolitanism and the laissez-faire, Torpey has already pointed out 
that regulations on migratory flows can be eliminated under different 
credos. Thus, it is the state that sets barriers on the otherwise free move- 
ment of people. The passport regime simply embodies Foucault’s “gov- 
ernmentality,” Giddens’s “surveillance and administrative power,” or We- 
ber’s “legitimate violence” in a microscopic, capillary form. All these 
authors’ concepts Torpey has aptly drawn upon, but his novel notion of 
“means of movement” seemingly adds little to our theoretical stock. 

These limitations notwithstanding, this book on the whole is a laudable 
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effort that will interest students in a full array of topical fields: the modern 
state, nations and nationalism, globalization, sovereignty and citizenship, 
international migration, world polity and international regimes, and oth- 
ers. All will find parts of this work either insightful or stimulating for 
further investigation. 


Geography and Social Movements: Comparing Antinuclear Activism in 
the Boston Area. By Byron A. Miller. Minneapolis: University of Min- 
nesota Press, 2000. Pp. xx+215. $21.95 (paper). 


Dingxin Zhao 
University of Chicago 


Barrington Moore is a geographer. To this statement, no one may agree. 
However, allow me to summarize Moore’s classic theory as follows: Moore 
argues that modernities are not universal but place-specific, that different 
types of modern political regimes, from dictatorship to democracy, are the 
result of place-specific modes of production and, consequently, the social- 
spatial relations among major classes in different places ranging from 
England to China. Presented with the preceding summary, geographers 
may think that Moore is indeed one of them. In fact, however, Moore is 
still not a geographer, and neither is geographer Byron A. Miller. 

The book studies antinuclear movements in Cambridge, Lexington, and 
Waltham, Massachusetts. It argues that antinuclear movements in the 
three Boston-area municipalities faced different structural constraints in 
terms of levels of education, class composition, political opportunities, 
economic history, and history of activism. Consequently, the activists in 
the three municipalities adopted different strategies in participant mo- 
bilization, issue framing, and in their interactions with the authorities. It 
also shows that while the antinuclear movements mainly operated at the 
municipal level, what they were against was a military policy determined 
by the central state. Since the central state for most of the time wanted 
to keep the arms production going and defense firms had plenty of re- 
sources to stage nationwide countermobilizations, the municipality-based 
antinuclear movements were not very effective in achieving their goals. 

In the above summary, I have disposed of all the geographic jargon 
used by the author. Once this jargon is avoided, the book becomes a 
rather typical comparative sociological work. The geographic jargon can 
be added to Moore’s work and removed from Miller’s without altering 
their main arguments, because the jargon carries no mechanism of its 
own. For example, the author argues that characteristics of places give 
rise to place-specific movement strategies. However, by the characteristics 
of a place, the author actually refers to people’s social backgrounds—such 
as levels of education, class composition, and political opportunities—in 
a municipality. What provides mechanisms here are the social back- 
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grounds, not the place. As for the place-specific strategies, the author only 
supplies evidence that shows, for example, that the spatial distribution 
of memberships in a municipality reflected the social structural compo- 
sition of the population in that municipality. Without a demonstration 
that the spatial distribution indeed resulted from the strategies of the 
peace movement organizations, the distribution only shows us that people 
of certain social structural backgrounds were more likely to join the peace 
movement than others. Should the author provide evidence on how the 
peace organizations had targeted people with certain social structural 
backgrounds, it is again the social structural backgrounds of the popu- 
lation, not the places, that furnish mechanisms. In other words, the strat- 
egies are structure specific, not place specific. 

The author also argues that the antinuclear movements faced different 
political opportunity structures at the municipal and central-state levels, 
and that this difference had a great impact on the movements’ strategies 
as well as chances of success. The author uses “geographic scale” to label 
the problem. Obviously, what really matters here are the different political 
opportunity structures at the local and central-state levels. The concept 
of geographic scale again carries no mechanisms in itself and is therefore 
dispensable. 

Let me use Roger Gould’s study of mobilizations during the Paris Com- 
mune to illustrate a mechanism-based geographic analysis. Gould argues 
that the Haussmann’s projects had altered Paris geography and destroyed 
the class-based residential networks in Paris. Consequently, the mobili- 
zation during the Paris Commune was no longer based on working-class 
consciousness, as was the case during the June rising of 1848, but on 
neighborhood-based networks. The difference between Gould’s and the 
author’s approaches is clear. In the author’s analysis, none of the socio- 
logical factors that the author found important to participant mobilization, 
such as the level of education, class composition, political opportunities, 
have anything to do with the geographic characteristics of the three mu- 
nicipalities. Geography has no contribution to the analysis. In Gould’s 
work, however, it was the change of Paris geography that contributed to 
the change of neighborhood structures and consequently the patterns of 
mobilization. One cannot understand the patterns of mobilization without 
understanding the new Paris geography. Geography provides a mecha- 
nism in Gould’s study and is thus indispensable. 

The book has an unnecessarily long theoretical discussion, with many 
analyses unrelated to the major arguments of the book (but they may be 
useful for students who want to know the literature). The book’s empirical 
analyses also remain thin. However, the most crucial problem of the book 
lies in the way the concept of geography is used. Given such a usage, as 
the Barrington Moore example shows, any comparative studies involving 
cases from more than one place can be treated as a geographic study. 
Such an all-inclusive, mechanismless use of geographical concepts will 
only undermine the legitimacy of geography as a discipline. A valid ge- 
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ographic analysis must assign space, place, and scale independent mech- 
anisms—mechanisms that cannot be reduced to traditional sociological 
factors such as class, gender, and state but make a significant contribution 
to a social process under investigation. 


Acts of Faith: Explaining the Human Side of Religion. By Rodney Stark 
and Roger Finke. Los Angeles and Berkeley: University of California 
Press, 2000. Pp. 343. $48.00 (cloth); $18.95 (paper). 


Rhys H. Williams 
Southern Illinois University, Carbondale 


It may surprise those not familiar with the sociology of religion to know 
that the “new paradigm” perspective currently in vogue is not synonymous 
with a “rational choice” theoretical approach. The term “new paradigm” 
came from R. Stephen Warner?s “Work in Progress toward a New Par- 
adigm for the Sociological Study of Religion in the United States” (Amer 
ican Journal of Sociology 98 [1993]: 1044-93). Warner’s argument was 
that particular social and institutional conditions in the United States 
made its religious scene distinctly different from Europe’s. Thus, secu- 
larization theories based on the European canon are not particularly useful 
for understanding religion here. The argument could have been considered 
historicist or institutionalist—in any case, it certainly was not premised 
on a rational choice theory of human motivation, nor did it propose 
development of “formal” or hypothetico-deductive sociological theory. 

Those reading this book may not recognize that distinction either. Rod- 
ney Stark and Roger Finke have added much to the sociological study 
of religion and helped open new vistas and new ideas to the subdiscipline. 
But intellectual caution and scholarly diplomacy are not their style. They 
claim much, in a vociferous voice, and pretty much assume they wear 
the mantle of the new paradigm. 

I hasten to add that Stark and Finke claim not to like the “rational 
choice” moniker. They use a “softened” and “expanded” conception of 
rational action, which emphasizes reasonableness to the actor. But this 
has its own problems, as the conception is so qualified and broad that it 
is not only unobjectionable but it is close to tautological. 

Nonetheless, the logic of the argument and the organization of the book 
are consistent with the classical liberalism that produced neoclassical ec- 
onomics (Gary Becker and Adam Smith are cited repeatedly) and soci- 
ological exchange theory. Stark and Finke begin with an analysis of the 
“religious individual” and her or his motivations for religious behavior 
and then proceed to sections on the “religious group” and then the “re- 
ligious economy.” Their case is codified in 99 propositions and 36 formal 
definitions presented over the course of the book—numbered sequentially 
throughout so as to emphasize their cumulative and logically deductive 
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quality. Individual, group, and economy are not just analytic levels for 
Stark and Finke; rather they represent a necessary micro-to-macro the- 
oretical progression in which social groupings are aggregations of indi- 
vidual actions. 

For example, religious groups are analyzed by the extent to which they 
can demand commitment and behavior from members. Since individuals 
are trying to maximize religious benefits, the more membership “costs,” 
the more it must be worth and the greater the commitment. Thus, groups 
that are more demanding will do better because membership is more 
exclusive and thus more valuable, the certainty of salvation is higher (at 
least to believers), and religious capital is reinforced through social net- 
works. In some of the literature, the key to this loyalty is “strictness” in 
both doctrine and behavioral demands. Stark and Finke, however, gen- 
erally use the term “distinctiveness” rather than strictness, which shifts 
the focus to group-environment interactions. 

Concomitantly, when the authors discuss the venerable “church-sect” 
tradition of understanding different religious collectivities, they focus less 
on internal organizational characteristics and instead define the difference 
as between “low tension” (churches) and “high tension” (sects) with the 
surrounding society. This helps explain religious change; high-tension sects 
arise when low-tension groups cannot satisfy people’s religious needs. But 
when sectarian groups grow enough to become dominant, they necessarily 
lose some of their tension with society, thus leading to new sectarian 
movements. The logic is basically free market: monopolies produce la- 
ziness while market competition meets more people’s demands more 
efficiently. 

Obviously, there are anomalous cases, as some very strict and distinctive 
groups are not at all oriented to growth, or they place such high demands 
that they are too costly. However, growth need not be a synonym for 
health, unless we accept too completely the assumptions of capitalist ec- 
onomics. More troublesome cases to this formulation are the Southern 
Baptist Convention (SBC) and the Mormons. While the Mormons remain 
in some tension with U.S. society generally (even if they dominate some 
areas), the SBC is harder to figure. It has become a national denomination 
and is in near perfect concert with the recent conservative changes in 
American politics and public religion. How exactly is it “high tension”? 
If individualist, experiential, racially segregated, anti-intellectual, and lo- 
cally organized religion is a subculture, then what exactly is the dominant 
culture? One needs more of a theory of power to deal with such questions 
than this economistic theory provides. And perhaps the assumption that 
“you get what you pay for” is based in culture and not as universal or 
fundamental as assumed here. 

Those familiar with the sociology of religion literature will know that 
there is currently a lively debate over some of the empirical evidence 
Stark and Finke use to support their conclusions. Whether religious plu- 
ralism actually increases church attendance is very much in doubt. That 
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debate is not engaged seriously here. Also, the authors’ béte noir, “secu- 
larization theory,” is caricatured, and the introductory chapter on atheism. 
and the study of religion has little to teach serious students of social 
thought. 

While this book will probably not persuade those who do not already 
agree with Stark and Finke, it does provide a thorough account of their 
perspective and covers a great deal of ground. There is much to engage. 
No doubt this book is not the last word, but it will spur much future 
research. 


Conservation Tillage and Cropping Innovation: Constructing the New Cul- 
ture of Agriculture. By C. Milton Coughenour and Shankariah Chamala. 
Ames: Iowa State University Press, 2000. Pp. xii+360. $64.95. 


Donald E. Voth 
University of Arkansas 


This is a sociological analysis of a major sociotechnical change in agri- 
culture, the transition from farming systems based upon plowing and the 
plow to various minimum-till or no-till production systems. According to 
the authors, Christian County, Kentucky, was the site of the first no-tillage 
crop production experiments in the United States, experiments which 
occurred on the farm of Harry and Laurence Young in 1962. In 1998 the 
authors report that from 12% (for cotton) to 18% (for “other crops”) to 
as high as 49% of double cropped grain sorghum and 74% of double 
cropped soybeans in the United States are grown under some form of 
“conservation tillage.” The book describes, sometimes in almost tedious 
detail, how this sociotechnical transition occurred. Parallel with the U.S. 
experience, a similar transition in Australia is described. 

The book’s eleven chapters start with a conceptual treatment of the 
“social construction” of tillage and cropping systems. This is followed by 
a description of the “plow culture” in the United States and Australia, 
including its historical roots in Europe; by a description of the major 
forces which affected both U.S. and Australian crop production agricul- 
ture in the 1950s; and then a rather abstract treatment of “the social 
construction of innovative networks.” Then follow two chapters dealing 
with the social construction of new tillage and cropping systems, one 
focusing on the United States, the other on Australia. Here, the “action 
learning” processes involved in the innovation are described through use 
of numerous brief case descriptions. The spread of conservation tillage in 
Kentucky and Queensland, Australia, and ultimately throughout the 
United States are then presented with a dual focus upon statistical in- 
formation and the phases or cycles through which this transition passed. 
Here the authors emphasize that the process was not linear, but quite 
episodic, depending upon the economic environment for agriculture, tech- 
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nical developments, and practical results found in the field. The final two 
chapters focus upon the role of research and development institutions and 
of public policies upon experimentation with and the adoption of con- 
servation tillage. 

The innovation with new tillage systems and transition away from the 
deeply ingrained plow culture is described as fundamentally a sociocul- 
tural process. The technology of agricultural production is a social prod- 
uct, created by human beings to facilitate achievement of their desired 
ends. Though the authors do not emphasize it, the book could be used 
as a detailed description of cultural inertia and resistance to change, even 
when the physical and economic environment seem to be forcing change. 
Rather, the authors focus upon change and the change processes. Though 
they do not set out a specific series of hypotheses to test, they do, from 
time to time, challenge conventional wisdom about the innovation and 
adoption processes. In the introduction they state that “the process of 
diffusion, as this study indicates, differs substantially from the classical 
theory of the diffusion of innovations (Rogers 1983). The principal dif- 
ference in our view is that because a practicing farmer and his farm are 
the core of a conservation tillage system, rather than simply of a particular 
machine or the chemical herbicide, diffusion takes place by the com- 
munication of a model of a tillage cropping system that individual farmers 
acquire and use as a basis for reconstructing a satisfactory management 
system of their own” (p. 13). They go on to identify what turns out to be 
a key feature of their entire story—the role of networks: networks of 
farmers, networks of institutions, even networks of ideas and concepts of 
agricultural production systems. 

The book is an excellent example of that rural sociology that focuses 
upon the dynamics of agriculture and agricultural production, in context 
of the broader issue of the social construction of reality, in this case some 
very concrete reality that is in flux. As such it both makes a strong case 
for the importance of the sociocultural aspect of technological change and 
development, and provides a path, if not concrete methodological tools, 
for understanding this sociocultural context. Unfortunately, its length and 
the amount of detail it provides make it unlikely that those who need to 
be persuaded will read much of it. Hopefully those seeking to perform 
similar analyses will do so. In addition to the central topic of the book, 
they will find excellent summaries of relevant literature. Those who are 
simply interested in the history of agricultural production practices will 
also find the book informative and enjoyable. Theoretical and analytic 
material is interspersed and illustrated with a vast amount of concrete 
detail involving the real experiences of farmers, extension agents, and 
others involved in agricultural production. 
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Soccer is the world’s favorite pastime. In the 
United States, however, the sport Is a distant. - 
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and hockey. Why Is America an exception? 
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conceptions of Justice. 

Proaceton Studies n Cuire Soaniogy 

Cloth $29.96 ISBN 0-061-07452-6 





